Posted in

Senior Infrastructure Reliability Engineer

Senior Infrastructure Reliability Engineer

CompanyAnduril
LocationNewport Beach, CA, USA
Salary$154000 – $231000
TypeFull-Time
Degrees
Experience LevelSenior

Requirements

  • Experience operating production systems using Docker and Kubernetes
  • Proficiency with at least one cloud platform (AWS, GCP, or Azure)
  • Experience managing infrastructure with Infrastructure-as-Code tools (e.g., Terraform)
  • Strong problem-solving skills with a focus on automation
  • Scripting or software development experience (e.g., Python, Go, Bash)
  • Familiarity with CI/CD pipelines and developer tooling
  • Ability to own systems end-to-end, from design to incident resolution

Responsibilities

  • Own the lifecycle of core self-hosted developer tools (e.g., GitHub Enterprise, CircleCI, Artifactory)
  • Design and implement automated systems for patching, backups (with validation), and upgrades
  • Scale infrastructure to support a fast-growing engineering org
  • Use Infrastructure-as-Code (Terraform, Pulumi, etc.) to manage environments
  • Operate and troubleshoot systems using Docker, Kubernetes, and cloud platforms (AWS, GCP, Azure)
  • Define and maintain SLOs for service availability, reliability, and performance
  • Lead and participate in incident response and root cause analysis
  • Collaborate with platform, security, and software teams to drive operational excellence

Preferred Qualifications

  • Prior experience with GitHub Enterprise Server, Artifactory, or CircleCI
  • Experience maintaining highly available, scalable internal tools
  • Exposure to security best practices, compliance requirements, or auditing
  • Experience supporting large engineering teams in a fast-paced environment
  • Background in SRE or hybrid SWE/DevOps roles