Skip to content

Senior Site Reliability Engineer – Data Infrastructure – AD/Adas
Company | Woven |
---|
Location | Palo Alto, CA, USA |
---|
Salary | $140000 – $230000 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Senior |
---|
Requirements
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 5+ years of experience with data structures/algorithms and professional software engineering in one or more programming languages (e.g., Python, Go, Java, C, C++).
- 3+ years of experience as a Site Reliability Engineer, working with Terraform, Docker, cloud-native technologies, networking and Kubernetes in production.
- Experience designing, deploying, monitoring and maintaining large-scale, fault-tolerant multi-region and/or multi-cloud distributed systems.
- Ability to debug & optimize code, to troubleshoot distributed systems and to automate routine tasks.
- Business-level proficiency in English speaking, reading and writing (e.g., technical documents, software documentation).
Responsibilities
- Design, build, maintain, optimize and support large scale, multi-region, multi-cloud compute and storage infrastructure powering our data platform and mission critical services.
- Work with fellow Data Infrastructure engineers and Site Reliability engineers to ensure our systems are scalable, reliable, fault-tolerant, highly available, highly performant, and observable.
- Manage incidents, triage product or system issues and debug/track/resolve by analyzing the root cause of these issues and the impact on users & operations.
- Work closely with other Data Infrastructure engineers, Site Reliability engineers, ML Platform engineers, Computer Vision and ML engineers on high-impact projects to create innovative solutions to problems in the self-drive space.
- Mentor junior engineers in their day to day work and drive best practices across the organization.
- Contribute to the long term strategy for several of our systems and products.
Preferred Qualifications
- Master’s degree in Computer Science.
- Experience working as a Software Engineer on data-intensive applications, data platforms, data pipelines, workflow orchestration, batch processing, and/or distributed databases.
- Experience working with RPC protocols and their formats, e.g., gRPC/protobuf, Apache Avro, etc.
- Experience with cloud-based (e.g. AWS, GCP, Azure) microservice architecture, event-driven, distributed architectures.
- Experience working in a fast-paced environment, collaborating across teams and disciplines.
- Experience with data governance, data privacy and security.
- Business-level proficiency in Japanese.