Skip to content

Site Reliability Engineer – Deployment Engineering
Company | Workday |
---|
Location | McLean, VA, USA |
---|
Salary | $106000 – $195500 |
---|
Type | Full-Time |
---|
Degrees | |
---|
Experience Level | Mid Level |
---|
Requirements
- 3+ years proven Systems / DevOps / SRE Engineering experience with programming experience in a high level language, e.g. Python, Java, Ruby, C#
- Experience working with deployment tools like CloudFormation, Terraform, Ansible, or Chef
- Experience in monitoring, analyzing and troubleshooting large-scale distributed systems.
- 3+ years experience working with Docker, Kubernetes, Serverless (Lambda’s).
- 3+ years experience with one or more: AWS or Google Cloud Platform.
- You have experience working in a Linux/Unix Operating
Responsibilities
- You will be responsible for preparing, automating, monitoring, triaging, refining and performing weekly maintenance activities
- Create comprehensive runbooks for both Maintenance Window and Tenant Maintenance orchestration
- Design issue remediation through automated self-healing, driving toil reduction
- Collaborate with peers & teams across the business to drive the future direction of improvements to key business functions related to the weekly maintenance window and tenant management.
- Participate in weekend work & a flexible working schedule
- Be passionate about modern technologies and advocate for innovative approaches to existing systems or work practices
- Work in a collaborative environment, openly sharing your knowledge with others and you enjoy learning from others too!
Preferred Qualifications
- Ability to resolve incidents quickly across a diverse scope of complex systems and conduct thorough RCAs
- Experience of using Confluence, Slack & Jira a plus
- Experience with MySQL databases a plus