Posted in

Senior Deep Learning Performance Architect

Senior Deep Learning Performance Architect

CompanyNVIDIA
LocationRedmond, WA, USA, Santa Clara, CA, USA
Salary$184000 – $356500
TypeFull-Time
DegreesMaster’s, PhD
Experience LevelSenior, Expert or higher

Requirements

  • MS or PhD in Computer Science, Computer Engineering, Electrical Engineering or equivalent experience
  • 6+ years of meaningful work experience
  • Strong background in GPU or Deep Learning ASIC architecture for training and/or inference
  • Experience with performance modeling, architecture simulation, profiling, and analysis
  • Solid foundation in machine learning and deep learning
  • Strong programming skills in Python, C, C++

Responsibilities

  • Develop innovative architectures to extend the state of the art in deep learning performance and efficiency
  • Analyze performance, cost and power trade-offs by developing analytical models, simulators and test suites
  • Understand and analyze the interplay of hardware and software architectures on future algorithms, programming models and applications
  • Develop, analyze, and harness groundbreaking Deep Learning frameworks, libraries, and compilers
  • Actively collaborate with software, product and research teams to guide the direction of deep learning HW and SW

Preferred Qualifications

  • Background with deep neural network training, inference and optimization in leading frameworks (e.g. Pytorch, JAX, TensorRT)
  • Experience with relevant libraries, compilers, and languages – CUDNN, CUBLAS, CUTLASS, MLIR, Triton, CUDA, OpenCL
  • Experience with the architecture of or workload analysis on other DL accelerators
  • Demonstration of self-motivation, with a knack for critical thinking and thinking outside the box