Skip to content

Software Engineer – ML Infrastructure
Company | Serve Robotics |
---|
Location | Toronto, ON, Canada |
---|
Salary | $119000 – $160000 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s, Master’s |
---|
Experience Level | Junior, Mid Level |
---|
Requirements
- BS or MS in computer science (or equivalent work experience) with focus in data engineering and machine learning
- 2+ years of industry experience developing big data processing and/or machine learning pipelines
- 1+ years of hands on experience with cloud platforms (AWS/GCP/Azure)
- Proficient in Python
- Solid understanding of system design fundamentals and distributed computing concepts
Responsibilities
- Develop and maintain highly scalable data processing pipelines for data curation, annotation, search and ml feature extraction.
- Build and improve our active learning pipelines. Ensure the scalability of our training jobs and inference endpoints.
- Develop and maintain our orchestration and scheduling systems.
- Collaborate with autonomy engineers to build new features for our autonomy data platform, improving data accessibility and developer productivity.
- Build integrations with annotation providers to efficiently annotate large scale datasets
- Develop infrastructure components using IaC(Infrastructure as Code) and implement CI/CD processes to streamline ongoing development of the platform.
- Develop monitoring and alerting frameworks to ensure platform reliability, stability and cost efficiency.
- Collaborate with ML Engineers in accelerating the ml development velocity.
- Work with the data team to ensure SLAs around data quality and availability. Optimise storage costs by tuning retention policies and improving data access patterns.
Preferred Qualifications
- Experience with terabyte scale data
- Experience with orchestration engines like Airflow, Prefect
- Experience with data discovery tools and methodologies like RAG, Vector Search
- Experience with databases. E.g BigQuery, Postgres, MySQL, MongoDB.
- Experience with vector search databases. E.g MongoDB Atlas, Pinecone,
- Experience with IaC and CI/CD. E.g Terraform, Jenkins, Github Actions
- Experience with big data frameworks such as Apache Spark/Beam/Hadoop