Staff LLM Engineer – Tifin
Company | TIFIN |
---|---|
Location | San Francisco, CA, USA, New York, NY, USA, Boulder, CO, USA |
Salary | $190000 – $225000 |
Type | Full-Time |
Degrees | Bachelor’s, Master’s, PhD |
Experience Level | Senior, Expert or higher |
Requirements
- Ph.D./Masters/Bachelor’s degree in computer science, mathematics, statistics, engineering, or relevant field
- Experienced in the field of NLP/LLM and well-versed with the current and latest state-of-the-art research
- Hands-on experience in various LLM fine-tuning techniques (e.g. LORA), LLM inference frameworks (e.g. vLLM), advanced RAG pipelines
- Demonstrated experience in designing and implementing agentic workflows that enable autonomous AI behaviors
- Solid expertise in reinforcement learning fine tuning methodologies and frameworks to optimize AI models
- Excellent knowledge of LLM evaluation methods and metrics
- 6-8+ years of machine learning/deep learning experience within frameworks such as TensorFlow and/or PyTorch
- 2+ years of practical experience in the development of generative AI applications
- Experience using LLMs to translate different languages
- Publications at reputable machine learning conference or journal
- Proficient in Python and SQL
- Ability to visualize data in the most effective way possible for a given project or study
- Thrives in a highly demanding, entrepreneurial, and fast-paced environment
- Is a top performer and has a proactive, “doer”, and problem-solver mentality
- Is highly flexible, has a good tolerance for ambiguity, and can quickly adapt to changing priorities
- Is an exceptional team player with solid communication skills
Responsibilities
- Design and fine-tune open source and proprietary LLMs for various tasks such as answering questions, summarization, reasoning and planning, etc.
- Build advanced Retrieval Augmented Generation (RAG) pipeline including rewriting, embedding fine-tuning, hybrid search, reranking, knowledge graphs, etc.
- Develop and integrate agentic workflow systems that empower AI agents to operate autonomously, enabling proactive decision-making and dynamic interactions.
- Apply reinforcement learning techniques—including reinforcement learning fine tuning (e.g., PPO, DPO, GRPO)—to continuously optimize model performance.
- Implement a comprehensive evaluation framework and metrics for model performance
- Deploy models into production environments and ensure low latency, reliability, and scalability.
- Collaborate with product team and software engineering team to build end-to-end product systems.
Preferred Qualifications
-
No preferred qualifications provided.