Skip to content

Software Engineer – Training Efficiency
Company | Waymo |
---|
Location | Mountain View, CA, USA |
---|
Salary | $238000 – $302000 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Expert or higher |
---|
Requirements
- B.S. in Computer Science, Math, or 8+ years equivalent real-world experience
- Proficient in distributed systems design with an understanding of ML data pipeline optimization
- Experience with ML frameworks, including TensorFlow and JAX
- Hands-on experience libraries like Grain or tf.data service
- Solid programming skills in Python and C++
- Practical familiarity with profiling tools to uncover performance bottlenecks
Responsibilities
- Design, and improve distributed input data pipelines for large-scale ML training workloads
- Collaborate with researchers and ML engineers to resolve bottlenecks in data pipeline performance
- Improve runtime goodput of ML training workload, including optimizing input data processing systems, ensuring scalability and reliability across distributed environments
- Implement and maintain advanced ML infrastructure tools, including ML Pathways, Grain, JAX, and TensorFlow
- Evaluate and integrate modern technologies to enhance the performance and scalability of ML systems
- Promote best practices for distributed systems architecture and contribute to technical leadership within the team
Preferred Qualifications
- MS in Computer Science, Math
- Familiarity with distributed dataflow frameworks like ML Pathways