Systems Engineer III – Site Reliability Engineering – Google Cloud
Company | |
---|---|
Location | Seattle, WA, USA, Kirkland, WA, USA |
Salary | $141000 – $202000 |
Type | Full-Time |
Degrees | Bachelor’s |
Experience Level | Mid Level, Senior |
Requirements
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 2 years of experience with programming in one or more programming languages.
- 2 years of experience working with administration (e.g. filesystems, inodes, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware, SDN).
Responsibilities
- Improve the whole lifecycle of services from inception and design, through deployment, operation, and refinement.
- Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health. Lead sustainable incident response and blameless postmortems.
- Scale systems sustainably through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity.
Preferred Qualifications
- Master’s degree in Computer Science or Engineering.
- 2 years of experience designing, analyzing, and troubleshooting large-scale distributed systems.