Skip to content

Senior Infrastructure Reliability Engineer
Company | Anduril |
---|
Location | Newport Beach, CA, USA |
---|
Salary | $154000 – $231000 |
---|
Type | Full-Time |
---|
Degrees | |
---|
Experience Level | Senior |
---|
Requirements
- Experience operating production systems using Docker and Kubernetes
- Proficiency with at least one cloud platform (AWS, GCP, or Azure)
- Experience managing infrastructure with Infrastructure-as-Code tools (e.g., Terraform)
- Strong problem-solving skills with a focus on automation
- Scripting or software development experience (e.g., Python, Go, Bash)
- Familiarity with CI/CD pipelines and developer tooling
- Ability to own systems end-to-end, from design to incident resolution
Responsibilities
- Own the lifecycle of core self-hosted developer tools (e.g., GitHub Enterprise, CircleCI, Artifactory)
- Design and implement automated systems for patching, backups (with validation), and upgrades
- Scale infrastructure to support a fast-growing engineering org
- Use Infrastructure-as-Code (Terraform, Pulumi, etc.) to manage environments
- Operate and troubleshoot systems using Docker, Kubernetes, and cloud platforms (AWS, GCP, Azure)
- Define and maintain SLOs for service availability, reliability, and performance
- Lead and participate in incident response and root cause analysis
- Collaborate with platform, security, and software teams to drive operational excellence
Preferred Qualifications
- Prior experience with GitHub Enterprise Server, Artifactory, or CircleCI
- Experience maintaining highly available, scalable internal tools
- Exposure to security best practices, compliance requirements, or auditing
- Experience supporting large engineering teams in a fast-paced environment
- Background in SRE or hybrid SWE/DevOps roles