Posted Aug 2

Nvidia is hiring a
Deep Learning Performance Architect

China, Shanghai • China, Shanghai • 2 Locations
Full time

NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning system performance architect to join our AI ecosystem analysis and perf projection efforts. In this position, you will have a chance to analyze state-of-the-art AI compilers and SW stacks and their performance on various hardware architectures. You will make your contributions to our dynamic technology focused company. 

What you'll be doing:

  • Analyze state-of-the-art AI compilers and SW stacks on various hardware

  • Identify architecture and software performance bottlenecks and propose optimizations

  • Explore new features and hardware capabilities of current AI compiler and SW ecosystems

What we need to see:

  • MS or PhD in relevant discipline (CS, EE, Math, etc.,)

  • 3 years work experience

  • Background with popular AI compilers (e.g., OpenAI Triton, MLIR, TVM, XLA)

  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow)

  • Experience on deep learning models and operators

  • Knowledge and experience on hardware architectures for deep learning applications


Please mention that you found the job on ARVR OK. Thanks.