Posted in

Senior Data Scientist / Machine Learning Engineer

Senior Data Scientist / Machine Learning Engineer

CompanyRokt
LocationNew York, NY, USA
Salary$200000 – $250000
TypeFull-Time
DegreesMaster’s
Experience LevelSenior

Requirements

  • 6+ years of industry experience as a Data Scientist (or 3+ years with a PhD), including significant work with identity, fraud, or security data
  • Master’s degree in Statistics, Econometrics, Machine Learning, or related fields
  • Strong knowledge of online AB testing and experimentation
  • Strong knowledge of Python 3 (data science libraries) and SQL
  • Experience with common Python data science libraries such as pandas, numpy and scipy
  • Experience with relational databases such as Postgres, MySQL and/or SQL Server

Responsibilities

  • Spearhead complex analyses applying advanced statistical methods and machine learning techniques to large-scale identity data, uncovering actionable insights to significantly improve the effectiveness and quality of our identity solutions and drive initiatives to completion
  • Independently identify, scope, and champion high-impact opportunities for innovation within the identity space, developing novel analytical approaches and data-driven strategies
  • Test and run experiments to determine impact on model relevancy
  • Define key metrics for tracking the health and performance of identity systems; Design, build, and maintain robust data pipelines to support machine learning model deployment and large-scale data processing for identity use cases
  • Conduct proactive analysis, modeling, and automation of improvements to entity resolution in areas such as geography, linkage, fraud detection, and risk assessment
  • Partnering closely with product and engineering teams to develop the strategy for evolving our identity graph
  • Lead high-impact projects from ideation through implementation and completion, mentoring junior team members and ensuring alignment with business objectives

Preferred Qualifications

  • Experience with distributed computing frameworks, particularly Spark is preferred
  • Experience with Kubeflow is preferred
  • Understands prompts for AI and is comfortable with using AI to improve day to day productivity
  • Experience with identity-focused data science projects (e.g., authentication, fraud, user verification, KYC)
  • Applied use of visualization tools such as matplotlib, seaborn, tableau, Python visual libraries