Data Engineer II

Company	Scribd
Location	California, USA, British Columbia, Canada, Ontario, Canada
Salary	$126000 – $196000
Type	Full-Time
Degrees	Bachelor’s
Experience Level	Mid Level, Senior

4+ years of experience as a professional software engineer.
Proficient in one or more programming languages, such as Python, Ruby, Scala, or similar.
Hands-on experience with data processing frameworks like Apache Spark, Databricks, or similar tools for large-scale data processing.
Experience working with systems at scale.
Experience working with a public cloud provider (AWS, Azure, or Google Cloud).
Hands-on experience with building, deploying, and optimizing solutions using ECS, EKS or AWS Lambdas.
Proven ability to test and optimize systems for performance and scalability.
Bachelor’s in CS or equivalent professional experience.
Bonus points if you have experience working with Machine Learning systems.

Design and develop data pipelines to extract, enrich, and process metadata from millions of documents, images, and other content types.
Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.
Build and maintain systems that operate at a massive scale, handling hundreds of millions of documents and billions of images.
Optimize and refactor existing systems for performance, scalability, and reliability.
Ensure data accuracy, integrity, and quality through automated validation and monitoring.
Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.
Manage and maintain data pipelines, security and infrastructure.