Posted in

Staff Site Reliability Engineer

Staff Site Reliability Engineer

CompanyEarnest
LocationSan Francisco, CA, USA, Oakland, CA, USA
Salary$223000 – $253000
TypeFull-Time
Degrees
Experience LevelSenior, Expert or higher

Requirements

  • 8+ years of experience in reliability, scalability, performance, security, and enterprise system architecture, with a focus on toil reduction and best practices implementation.
  • Strong coding skills in at least one language (Go, Python, Java Spring Boot, .NET, etc.) and deep knowledge of software applications, technical processes, and emerging disciplines.
  • Hands-on experience with monitoring and telemetry tools (Grafana, Prometheus, Datadog, Splunk, etc.), SLO alerting, and CI/CD tools (Jenkins, GitHub Actions, GitLab, Terraform).
  • Expertise in containerization and orchestration (Kubernetes, Docker, ECS) and troubleshooting networking and distributed system issues.
  • Experience creating infrastructure resources using Terraform or OpenTofu, with formal training or certification in software engineering and 5+ years of applied experience.
  • Willingness to travel to the Oakland office monthly to collaborate with other Earnies.

Responsibilities

  • Ensure the reliability, scalability, performance, and security of systems while managing and optimizing infrastructure for efficiency and minimal downtime.
  • Develop, maintain, and enhance observability and CI/CD tools (Splunk, New Relic, GitHub Actions, Terraform, etc.), streamline deployments, update documentation, and improve internal tools for efficiency and scalability.
  • Lead product initiatives, conduct resiliency reviews, coordinate cross-team efforts, and manage goals, risks, and resources for successful delivery.
  • Advise on reliability, mentor engineers on best practices, facilitate cross-team communication, and translate stakeholder needs into technical solutions.
  • Lead key projects, stay current with industry trends, formalize best practices, and mentor engineers in building and troubleshooting reliable distributed systems.

Preferred Qualifications

    No preferred qualifications provided.