Posted in

Senior Software Engineer – Infrastructure Engineering

Senior Software Engineer – Infrastructure Engineering

CompanyThe Voleon Group
LocationBerkeley, CA, USA
Salary$225000 – $255000
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior

Requirements

  • Computer Science Degree or equivalent experience
  • 5+ years of software engineering experience building high-performance systems
  • Experience operating and scaling mission-critical, large-scale production systems in languages such as Python, Go and C++
  • Excellent communication and project management skills in complex technical domains
  • Track record mentoring engineers and leading technical direction

Responsibilities

  • Help design, implement, and maintain foundational services, frameworks, and libraries for data and compute management
  • Work in cooperation with multiple teams to expand and mature our cloud presence
  • Develop and enforce company-wide standards related to code quality, code organization, and codebase management
  • Work with DevOps to maintain our CI/CD pipelines and ensure that developers can build, test, and deploy changes in a fast and high-quality manner
  • Leverage telemetry toolkits to track core performance and reliability metrics of the services we own. Help fulfill rigorous SLAs and performance guarantees
  • Help the company survey, evaluate, and adopt major new technologies. Design, coordinate, and execute company-wide adoption plans
  • Lead complex multi-team projects. Collaborate across DevOps, SysOps, Engineering, and Research teams to deliver a cohesive platform across research and production
  • Mentor and develop other engineers on the team, and share your practices and knowledge with the team and company

Preferred Qualifications

  • Experience with ML research platforms and associated frameworks for data processing, batch computing, and research (e.g., Apache Airflow, Kubeflow, Slurm, AWS/GCP Batch, Spark, Dask)
  • Expertise in CI/CD, build systems, and best practices for large codebase management (Bazel, Jenkins, Github Actions)
  • Exposure to cloud platforms, cloud-native architectures, and tools like Infrastructure-as-Code for managing cloud infrastructure (Terraform, Pulumi, CloudFormation)