Skip to content

Senior Machine Learning Engineer – Training
Company | Waymo |
---|
Location | Mountain View, CA, USA |
---|
Salary | $204000 – $259000 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Senior |
---|
Requirements
- Bachelor’s degree in Computer Science, Engineering, or related field, or 4+ years equivalent experience
- Experience building distributed systems for production environments.
- Solid Python or C++ skills
- Prior experience with Machine Learning frameworks (e.g., TensorFlow, PyTorch) and distributed training algorithms
Responsibilities
- Develop the infrastructure components necessary for distributed training
- Implement automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure
- Monitor system health, diagnose and perform routine maintenance tasks to ensure the reliability of the distributed training infrastructure
- Identify performance bottlenecks and optimization opportunities
- Improve the developer experience and performance of our scalable ML framework
Preferred Qualifications
- Practical familiarity using ML accelerator profiling tools to uncover performance bottlenecks
- Experience deploying and managing distributed systems in cloud environments
- Knowledge of optimization and deep learning algorithms