Machine Learning Engineer
Company | Altos Labs |
---|---|
Location | San Francisco, CA, USA, San Diego, CA, USA |
Salary | $186150 – $315100 |
Type | Full-Time |
Degrees | Master’s, PhD |
Experience Level | Senior |
Requirements
- Masters or Ph.D. degree in a quantitative/computational field such as computer science, artificial intelligence, mathematics, statistics, physics, or computational biology, or equivalent experience
- 5+ years experience in developing machine learning models
- Very strong programming skills, including experience with Python and deep learning libraries such as PyTorch or JAX
- Experience in large-scale distributed optimization of machine learning models across multiple GPUs and nodes
- Proven track record leveraging machine learning to solve real-world problems
- Expertise in one or more of the following: generative models, language models, computer vision, bayesian inference, causal reasoning & inference, transfer & multi-task learning, diffusion models, graph neural networks, active learning
- Experience writing production-quality code with modern machine learning frameworks such as PyTorch, TensorFlow, JAX, or similar
- Experience with multi-GPU and distributed training at scale
Responsibilities
- Partner with world-class scientists across Altos to help generate biological insights with the goal of developing novel therapies
- Design and implement large-scale machine learning algorithms and systems applied to biological datasets
- Train, evaluate, and optimize machine learning models at scale
- Communicate effectively with internal and external collaborators to meet ambitious research and development goals.
Preferred Qualifications
- Familiarity with biological data formats, concepts, and computational models
- Experience in cell health and rejuvenation-related research area
- Experience with identification and assessment of drug targets and/or therapeutic compounds
- Experience in the application of machine learning methods to biological data, including genomics, transcriptomics, epigenetics, proteomics, and imaging
- Track record in open-source software development, e.g., demonstrated by high-impact GitHub repository
- Track record of high-caliber scientific work, e.g., demonstrated through publications in peer-reviewed scientific journals or major ML conferences
- Experience with one lower level language (not limited to, but such as C++, Rust)
- Experience with large scale data processing and database tools such as MapReduce, Dask, SQL, Hugging Face Datasets, TileDB, Ray