Skip to content

Site Reliability Engineer – SRE
Company | SMX |
---|
Location | Boulder, CO, USA |
---|
Salary | $103200 – $172000 |
---|
Type | Full-Time |
---|
Degrees | |
---|
Experience Level | Senior |
---|
Requirements
- 5+ years experience operating high reliability environments.
- Ability to implement observability toolsets to instrument software and hardware utilization.
- Practical hands-on troubleshooting techniques with demonstrated fault isolation and root cause analysis experience.
- Experience in building automation tools (e.g. deployment automation).
- This position will require the ability to obtain a U.S. government DoD Secret Security clearance.
Responsibilities
- Lead implementation of the observability tools and analyze metrics capturing the environment utilization as well as mission performance of the software.
- Work with software teams to specify and optimize hardware environment usage.
- Lead problem diagnosis efforts of the deployed mission system.
- Develop automation for the deployment of the software stack.
Preferred Qualifications
- Currently active U.S. government DoD security clearance
- Proficiency with the following tools/technologies: K8s/OpenShift, Prometheus/Tempo/Grafana, Helm charts, Python, CI/CD pipeline configuration, NATS data layer, Container (Docker) management, Cyber security certifications (e.g. CISSP), Linux system administration (RHEL)