Software Engineer - Data Infrastructure

Software Engineer – Data Infrastructure

Have meaningful experience with leading and building production data systems to deliver on major product initiatives.
You have built and managed highly scalable data processing solutions (e.g. Spark, Flink), data lakes or warehouses (e.g. Snowflake, Hive), authored queries (SQL), distributed storage systems (e.g., HDFS, S3), used workflow management (e.g. Airflow, Dagster), and have experience maintaining the infra that supports these.
Proficiency in at least one programming language commonly used within Data Engineering, such as Python, Scala, or Java.
Expertise with any of ETL schedulers such as Airflow, Dagster, or similar frameworks.
Experience maintaining a high quality bar for design, correctness, and testing.
Take pride in building and operating scalable, reliable, secure systems.
Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed.
Own problems end-to-end, and are willing to pick up whatever knowledge you’re missing to get the job done.
You have experience being the technical lead of a Data Engineering / Platform / Infrastructure Team.
Experience building ML/DL systems and/or data infrastructure that feeds into training large ML models.

Design, build and maintain highly scalable data processing solutions, while ensuring scalability, reliability, and security.
Architect, build, and deploy the back-end systems and services that power our data curation platform.
Partner with researchers and engineers to bring new features and research capabilities to our customers.
Ensure that our systems are reliable, secure, and worthy of our customers’ trust.

No preferred qualifications provided.