Posted in

Site Reliability Engineer – Deployment Engineering

Site Reliability Engineer – Deployment Engineering

CompanyWorkday
LocationMcLean, VA, USA
Salary$106000 – $195500
TypeFull-Time
Degrees
Experience LevelMid Level

Requirements

  • 3+ years proven Systems / DevOps / SRE Engineering experience with programming experience in a high level language, e.g. Python, Java, Ruby, C#
  • Experience working with deployment tools like CloudFormation, Terraform, Ansible, or Chef
  • Experience in monitoring, analyzing and troubleshooting large-scale distributed systems.
  • 3+ years experience working with Docker, Kubernetes, Serverless (Lambda’s).
  • 3+ years experience with one or more: AWS or Google Cloud Platform.
  • You have experience working in a Linux/Unix Operating

Responsibilities

  • You will be responsible for preparing, automating, monitoring, triaging, refining and performing weekly maintenance activities
  • Create comprehensive runbooks for both Maintenance Window and Tenant Maintenance orchestration
  • Design issue remediation through automated self-healing, driving toil reduction
  • Collaborate with peers & teams across the business to drive the future direction of improvements to key business functions related to the weekly maintenance window and tenant management.
  • Participate in weekend work & a flexible working schedule
  • Be passionate about modern technologies and advocate for innovative approaches to existing systems or work practices
  • Work in a collaborative environment, openly sharing your knowledge with others and you enjoy learning from others too!

Preferred Qualifications

  • Ability to resolve incidents quickly across a diverse scope of complex systems and conduct thorough RCAs
  • Experience of using Confluence, Slack & Jira a plus
  • Experience with MySQL databases a plus