Nvidia is hiring a
Deep Learning Performance Architect
NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning system performance architect to join our AI ecosystem analysis and perf projection efforts. In this position, you will have a chance to analyze state-of-the-art AI compilers and SW stacks and their performance on various hardware architectures. You will make your contributions to our dynamic technology focused company.
What you'll be doing:
Analyze state-of-the-art AI compilers and SW stacks on various hardware
Identify architecture and software performance bottlenecks and propose optimizations
Explore new features and hardware capabilities of current AI compiler and SW ecosystems
What we need to see:
MS or PhD in relevant discipline (CS, EE, Math, etc.,)
3 years work experience
Background with popular AI compilers (e.g., OpenAI Triton, MLIR, TVM, XLA)
Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow)
Experience on deep learning models and operators
Knowledge and experience on hardware architectures for deep learning applications
Please mention that you found the job on ARVR OK. Thanks.