Senior Applied Scientist – Large Language Models / Generative AI
Company | Datadog |
---|---|
Location | Boston, MA, USA, New York, NY, USA |
Salary | $187000 – $240000 |
Type | Full-Time |
Degrees | Bachelor’s, Master’s, PhD |
Experience Level | Senior |
Requirements
- You possess a BS/MS/PhD in Computer Science, Engineering, Machine Learning or a related scientific field or have equivalent experience
- You have relevant experience with Language Models (Large LMs is a plus), NLP, large-scale systems and data sets, deep learning, or adjacent fields. Writing production data pipelines and applications is a plus.
- You have experience with the stack for distributed training and inference of large models including distributed training and inference frameworks, and ML development frameworks such as Pytorch, Tensorflow, etc. Experience with CUDA is a plus.
- You possess the ability to elaborate complex models and ideas to non-technical personnel
- You value code simplicity and performance
- You are passionate about Generative AI and want to contribute to user-facing product
Responsibilities
- Work on a wide range of projects, building large-scale distributed fine tuning and training infrastructure, deploying LLMs on GPU instances for real-time use cases, designing robust, secure infrastructure, or supporting cutting-edge AI research and development.
- Create new product features using advanced machine learning algorithms, LLMs, and statistical techniques
- Collaborate with a group of AI specialists and scientists in envisioning the future state of our abilities while also aiding in the design and deployment of crucial services.
- Actively participate in our journal club by reading and presenting latest research papers in the field of LLMs and Generative AI
- Provide deeper insights and stories behind massive data processed in Datadog systems
- Develop, deploy, monitor and maintain the LLM models, services, and infrastructure managed by your team and participate in your team’s on-call rotation
Preferred Qualifications
- Experience with CUDA is a plus
- Writing production data pipelines and applications is a plus