Posted in

Senior Machine Learning Engineer – Training

Senior Machine Learning Engineer – Training

CompanyWaymo
LocationMountain View, CA, USA
Salary$204000 – $259000
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or related field, or 4+ years equivalent experience
  • Experience building distributed systems for production environments.
  • Solid Python or C++ skills
  • Prior experience with Machine Learning frameworks (e.g., TensorFlow, PyTorch) and distributed training algorithms

Responsibilities

  • Develop the infrastructure components necessary for distributed training
  • Implement automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure
  • Monitor system health, diagnose and perform routine maintenance tasks to ensure the reliability of the distributed training infrastructure
  • Identify performance bottlenecks and optimization opportunities
  • Improve the developer experience and performance of our scalable ML framework

Preferred Qualifications

  • Practical familiarity using ML accelerator profiling tools to uncover performance bottlenecks
  • Experience deploying and managing distributed systems in cloud environments
  • Knowledge of optimization and deep learning algorithms