Posted in

Applied AI Engineer – Voice – Enterprise

Applied AI Engineer – Voice – Enterprise

CompanyxAI
LocationPalo Alto, CA, USA, San Francisco, CA, USA
Salary$180000 – $440000
TypeFull-Time
Degrees
Experience LevelMid Level, Senior

Requirements

  • Strong engineering background.
  • Experience interfacing between technical and customer-facing teams.
  • Excellent verbal and written communication skills in English.
  • Ability to translate business and voice-specific product needs into technical solutions.
  • Proven experience implementing voice AI or machine learning products, including APIs, back-end, and front-end voice interfaces.
  • Strong proficiency in Python and/or TypeScript.
  • Solid understanding of HTTP protocol and real-time communication protocols (e.g., WebRTC).

Responsibilities

  • Designing and building end-to-end Voice AI solutions, from understanding customer pain points to scoping product specs and deploying LLM-powered voice interfaces.
  • Benchmarking voice models, writing evaluations, or analyzing performance to identify weaknesses in speech recognition, synthesis, or natural language understanding.
  • Improving model performance through system prompt tuning, fine-tuning voice-specific models, or optimizing for low-latency voice interactions.
  • Analyzing voice request logs, prompt data, or audio inputs to enhance system accuracy and user experience.
  • Building internal tools to automate voice AI workflows, such as transcription pipelines or real-time voice processing.
  • Enhancing xAI’s Voice AI SDKs or developer documentation based on customer feedback and enterprise use cases.

Preferred Qualifications

  • Building evaluations for voice AI capabilities, such as speech recognition accuracy or naturalness of synthesized speech.
  • Demonstrating expertise in machine learning fundamentals, including voice model evaluation, training, or fine-tuning.
  • Deploying voice AI models to production, optimizing for low-latency and high-reliability environments.
  • Writing developer documentation or creating voice-specific SDKs.
  • Working with large-scale audio datasets, optimizing voice processing pipelines, or scaling systems for enterprise-grade workloads.
  • Using infrastructure tools like Pulumi or Terraform for deploying voice AI systems.