Posted in

Software Engineer – Training Efficiency

Software Engineer – Training Efficiency

CompanyWaymo
LocationMountain View, CA, USA
Salary$238000 – $302000
TypeFull-Time
DegreesBachelor’s
Experience LevelExpert or higher

Requirements

  • B.S. in Computer Science, Math, or 8+ years equivalent real-world experience
  • Proficient in distributed systems design with an understanding of ML data pipeline optimization
  • Experience with ML frameworks, including TensorFlow and JAX
  • Hands-on experience libraries like Grain or tf.data service
  • Solid programming skills in Python and C++
  • Practical familiarity with profiling tools to uncover performance bottlenecks

Responsibilities

  • Design, and improve distributed input data pipelines for large-scale ML training workloads
  • Collaborate with researchers and ML engineers to resolve bottlenecks in data pipeline performance
  • Improve runtime goodput of ML training workload, including optimizing input data processing systems, ensuring scalability and reliability across distributed environments
  • Implement and maintain advanced ML infrastructure tools, including ML Pathways, Grain, JAX, and TensorFlow
  • Evaluate and integrate modern technologies to enhance the performance and scalability of ML systems
  • Promote best practices for distributed systems architecture and contribute to technical leadership within the team

Preferred Qualifications

  • MS in Computer Science, Math
  • Familiarity with distributed dataflow frameworks like ML Pathways