Distinguished Engineer – Data Architect
Company | Capital One |
---|---|
Location | McLean, VA, USA, Richmond, VA, USA, New York, NY, USA |
Salary | $239900 – $328500 |
Type | Full-Time |
Degrees | Bachelor’s, Master’s |
Experience Level | Senior, Expert or higher |
Requirements
- Hands on experience with AWS Cloud data technologies including RDS, DynamoDB, S3 and Glue ETL
- Experience designing and implementing event based stream processing solutions using technologies such as Kafka, Kinesis, Spark and Flink
- Ability to design and implement high availability, multi-region data replication for mission critical applications
- Experience designing and implementing data management solutions that enable Data Quality, Reference Data Management, and Metadata Management.
- A track record of designing and implementing end-to-end data pipelines supporting both production and analytic use cases
- Comfortable coding with Python or Scala and proficient in SQL
- In-depth understanding of AVRO, Parquet and DeltaLake data formats
- Background using multiple data storage technologies including relational, document, key/value, graph and object stores
- Demonstrated ability to partner with internal product and intent owners to help define requirements and outcomes for data-focused initiatives
- Ability to decompose large problems and execute smaller, manageable bodies of work to demonstrate continuous architecture delivery
- Understanding of machine learning and AI data infrastructure needs
- Bachelor’s degree
- At least 7 years of experience in Data Architecture and Data Engineering
- At least 7 years of Data modeling and platform design
- At least 2 years of experience in cloud computing (building applications in AWS)
Responsibilities
- Define and Implement data architecture standards, frameworks and guidelines to ensure data platform efficiency and to ensure high quality data for gaining insights / downstream consumptions
- Lead the creation of data models and ontologies to standardize data definitions, relationships and semantics across systems
- Collaborate with extended teams and stakeholders to establish data standards, metadata management practices and data quality frameworks
- Design scalable architectures that integrate various data sources, systems, and platforms while minimizing duplication
- Partner with engineering, data analysis, data science and business teams to align data solutions with business needs. Mentor technical teams in data architecture best practices
- Develop comprehensive architectural documentation and communicate data architecture principles to both technical and non-technical stakeholders
- Conduct exploratory data analysis to elucidate deficiencies and opportunities with tangible evidence
- Partner with other Distinguished Engineers across the enterprise to identify and foster investments in shared data services and platforms
- Engage with senior business product leads to understand the business strategy, value propositions, relative priorities and criteria for success.
Preferred Qualifications
- Bachelor’s or Master’s Degree in Computer Science or a related field
- At least 2 years of experience with ontology standards for defining a domain
- At least 2 years of experience using Python, SQL or Scala
- At least 1 year of experience deploying machine learning models
- Experience implementing knowledge graphs
- Familiarity with Industry standards related to Data
- Experience with data mesh, data lakehouse architectures and real-time data pipelines