Posted in

Sr Site Reliability Engineer – App Service Team

Sr Site Reliability Engineer – App Service Team

CompanyPalo Alto Networks
LocationSanta Clara, CA, USA
Salary$Not Provided – $Not Provided
TypeFull-Time
DegreesBachelor’s, Master’s
Experience LevelMid Level, Senior

Requirements

  • 4+ years as an engineer in Infrastructure, Operations, DevOps, or System Engineering
  • 2+ years building high availability, scalable cloud-native applications on AWS and GCP
  • BS or MS in Computer Science, a related field, or equivalent professional experience or equivalent military experience required
  • Expertise in public cloud, GCP and AWS both preferred
  • Expertise in configuration management with a framework such as Terraform, Helm
  • Experience in Site Reliability Engineering, Production Engineering, or DevOps
  • Solid experience in container workloads and Kubernetes
  • Linux administration, internals, and network troubleshooting
  • Proficiency with programming languages like Golang or Python along with shell scripting to automate tasks
  • Proficiency with CI/CD pipelines, GitOps principles and the knowledge of GitLab Runners is a plus
  • Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions
  • Excellent written and verbal communication, able to collaborate and rally support
  • Self-disciplined, self-managed, self-motivated and strong sense of ownership, urgency, and drive
  • Ready to understand and dissect new technology stacks quickly

Responsibilities

  • Contribute to the success of SRE and DevOps
  • Develop expertise in new technologies
  • Work with developers, researchers, data scientists, and security experts
  • Design, build and operate reliable, secure Cloud infrastructure
  • Ensure that applications are production-ready, scalable, and reliable
  • Develop tools and automation frameworks
  • Automate robust deployment of robust services
  • Orchestrate end-to-end monitoring and alerting
  • Participate with SRE and Dev teams in the on-call rotation
  • Lead root cause analysis of critical business and production issues

Preferred Qualifications

  • Expertise in public cloud, GCP and AWS both preferred
  • Proficiency with CI/CD pipelines, GitOps principles and the knowledge of GitLab Runners is a plus