Posted in

Staff Data Engineer

Staff Data Engineer

CompanyAtomic Machines
LocationBerkeley, CA, USA
Salary$Not Provided – $Not Provided
TypeFull-Time
DegreesBachelor’s, Master’s
Experience LevelSenior, Expert or higher

Requirements

  • 6+ years (after Bachelor’s) of industry experience.
  • Proven experience in data engineering, data flows, and big-data processing.
  • Proficiency in Python and programming languages such as SQL.
  • Proficiency in data storage solutions (Data Lakes, Cloud Storage, SQL, NoSQL).
  • Understanding of manufacturing processes, sensors, and process automation.
  • Industry-level experience in guiding and automating data collection and processing in manufacturing environments.
  • Knowledge of DevOps practices.
  • Strong problem-solving skills and ability to work in a collaborative, fast-paced environment.
  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, Data Science, or a related STEM field.

Responsibilities

  • Design, build, and maintain scalable data collection, transformation, storage, and integration systems across different manufacturing units and processes.
  • Develop data pipelines to process and prepare data for ML model training and real-time analytics.
  • Ensure real-time data handling for ML applications and process monitoring.
  • Work with inspection engineers/technicians to collect datasets and collaborate with process designers and developers to ensure quality control in manufacturing processes through data validation.
  • Guide the creation of validation tests to compare in-house inspection algorithms with commercial tools.
  • Establish process monitoring frameworks by creating data storage solutions and implementing structured labeling for process input parameters and associated outputs.
  • Develop online statistics and analytics for process control and optimization.
  • Analyze observable time-series data corresponding to different manufacturing process stages.
  • Derive required datasets, assess data availability, and select appropriate data collection systems to improve data quality and process efficiency.
  • Work with process development engineers to propose data collection requirements and communicate those requirements to hardware designers.
  • Collaborate with cross-functional teams, including process engineers, chemical engineers, materials scientists, simulation engineers, software developers, data scientists, and ML engineers, to optimize data workflows and improve operational efficiency.

Preferred Qualifications

  • Experience with real-time data processing frameworks (Apache Kafka, Spark Streaming, etc.) and with the JMP format.
  • Knowledge of ORM and interface abstraction.