Data Engineer II
Company | Scribd |
---|---|
Location | California, USA, British Columbia, Canada, Ontario, Canada |
Salary | $126000 – $196000 |
Type | Full-Time |
Degrees | Bachelor’s |
Experience Level | Mid Level, Senior |
Requirements
- 4+ years of experience as a professional software engineer.
- Proficient in one or more programming languages, such as Python, Ruby, Scala, or similar.
- Hands-on experience with data processing frameworks like Apache Spark, Databricks, or similar tools for large-scale data processing.
- Experience working with systems at scale.
- Experience working with a public cloud provider (AWS, Azure, or Google Cloud).
- Hands-on experience with building, deploying, and optimizing solutions using ECS, EKS or AWS Lambdas.
- Proven ability to test and optimize systems for performance and scalability.
- Bachelor’s in CS or equivalent professional experience.
- Bonus points if you have experience working with Machine Learning systems.
Responsibilities
- Design and develop data pipelines to extract, enrich, and process metadata from millions of documents, images, and other content types.
- Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.
- Build and maintain systems that operate at a massive scale, handling hundreds of millions of documents and billions of images.
- Optimize and refactor existing systems for performance, scalability, and reliability.
- Ensure data accuracy, integrity, and quality through automated validation and monitoring.
- Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.
- Manage and maintain data pipelines, security and infrastructure.
Preferred Qualifications
- Experience working with Machine Learning systems.