Posted in

Software Engineer – ML Performance and Scaling

Software Engineer – ML Performance and Scaling

CompanyAnthropic
LocationSeattle, WA, USA, San Francisco, CA, USA, New York, NY, USA
Salary$315000 – $560000
TypeFull-Time
Degrees
Experience LevelSenior, Expert or higher

Requirements

  • Have significant software engineering or machine learning experience, particularly at supercomputing scale
  • Are results-oriented, with a bias towards flexibility and impact
  • Pick up slack, even if it goes outside your job description
  • Enjoy pair programming (we love to pair!)
  • Want to learn more about machine learning research
  • Care about the societal impacts of your work

Responsibilities

  • Identifying systems problems and developing systems that optimize throughput and robustness
  • Implement low-latency high-throughput sampling for large language models
  • Implement GPU kernels to adapt our models to low-precision inference
  • Write a custom load-balancing algorithm to optimize serving efficiency
  • Build quantitative models of system performance
  • Design and implement a fault-tolerant distributed system running with a complex network topology
  • Debug kernel-level network latency spikes in a containerized environment

Preferred Qualifications

  • High performance, large-scale ML systems
  • GPU/Accelerator programming
  • ML framework internals
  • OS internals
  • Language modeling with transformers