Site Reliability Engineer

last updated January 20, 2021 20:15 UTC

MobileCoin

HQ: Remote

more jobs in this category:

  • -> Website & App Tester @ PingPong
  • -> Entry Level Content Writer @ Jerry
  • -> Code Challenge Reviewer - Review Code In Your Spare Time - £50 Per Hour @ Geektastic
  • -> Frontend Developer (React) @ Cake
  • -> Frontend Engineer @ Torc

HQ:San Francisco
Location:Fully Remote anywhere in the US or Canada
Comp:Base salary $160K – $190k + coins & equity – total comp is north of $300K

Responsibilities

  • Maintain, monitor and improve our Kubernetes clusters.
  • Maintain, improve, scale and secure our Azure infrastructure and Ubuntu Linux systems.
  • Assist our development teams in running, packaging, deploying and troubleshooting applications
  • Work with developers on streamlining deployment processes with Jenkins and other tooling
  • Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes, Prometheus, and Logging
  • Monitor, triage and respond to alerts in our 24/7/365 environment.
  • Participate in design and code reviews, and ensure that the foundation for our services is best in class.
  • Evaluate new technologies, design and implement as appropriate.
  • Identify automation opportunities and implement by creating custom or by using off the shelf solutions.

Requirements

  • Extensive experience of working in cloud-based systems operations
  • You’re very comfortable with Linux command line
  • You have extensive experience with Docker (building and running containers), and container orchestration (Kubernetes preferred)
  • You have experience with Prometheus and Grafana (preferred), or other monitoring systems (InfluxDB, StatsD, Graphite, etc)
  • Experience with CI pipelines and Jenkins (preferred)
  • You are security minded and follow standard security best-practices (least-privilege, common attack defenses, etc)
  • You have a good understanding of computer networking, TCP/IP, load balancing, distributed computing, web services, and the fundamental protocols used by the internet (HTTP, HTTPS, DNS, etc.).
  • You have experience supporting production workloads and are familiar with monitoring concepts and tooling.
  • You’re highly proficient in at least one scripting language (Python, Go, Rust, Bash, etc.).
  • You’re enthusiastic about working in a small, growing team, you are open, empathetic, and care about putting the best ideas forward in a collaborative and helpful manner.

Nice to Have

  • Experience with Azure
  • Experience with Rust and/or C/C++
  • Experience with advanced CPU features in a container environment (SGX, GPU, etc)
Apply info ->

To find out more about this job, please visit this link

Shopping Cart
There are no products in the cart!
Total
 0.00
0