Software Engineer – ML Performance and Scaling
Company | Anthropic |
---|---|
Location | Seattle, WA, USA, San Francisco, CA, USA, New York, NY, USA |
Salary | $315000 – $560000 |
Type | Full-Time |
Degrees | |
Experience Level | Senior, Expert or higher |
Requirements
- Have significant software engineering or machine learning experience, particularly at supercomputing scale
- Are results-oriented, with a bias towards flexibility and impact
- Pick up slack, even if it goes outside your job description
- Enjoy pair programming (we love to pair!)
- Want to learn more about machine learning research
- Care about the societal impacts of your work
Responsibilities
- Identifying systems problems and developing systems that optimize throughput and robustness
- Implement low-latency high-throughput sampling for large language models
- Implement GPU kernels to adapt our models to low-precision inference
- Write a custom load-balancing algorithm to optimize serving efficiency
- Build quantitative models of system performance
- Design and implement a fault-tolerant distributed system running with a complex network topology
- Debug kernel-level network latency spikes in a containerized environment
Preferred Qualifications
- High performance, large-scale ML systems
- GPU/Accelerator programming
- ML framework internals
- OS internals
- Language modeling with transformers