Posted Aug 16

Nvidia is hiring a
Deep Learning Performance Architect

China, Shanghai • China, Beijing • 2 Locations • 2 Locations
Full time

We are now looking for a Deep Learning Architect! 


Are you passionate about exploring computer architectures for deep learning? Do you like to work at the intersection of hardware and software?  NVIDIA is seeking world-class programmers and performance architects who love to squeeze out every cycle of performance from deep learning codes.  In this role, you will develop and maintain a library that ships our best-performing GPU kernels to NVIDIA's industry-leading DL products. This position offers the opportunity to have real impact in a fast-moving, technology-focused company.


What you'll be doing: 

  • Design and develop the architecture, interface and features of the kernel library

  • Maintain the library and its GPU kernels in a good state

  • Collaborate closely with DL product teams as well as kernel development teams


What we need to see: 

  • MS or PhD in relevant discipline (CS, EE, Math) , 5+ years work experience

  • Strong programming skills in C, C++, and Python 

  • Strong problem solving skills and learning capability

  • Strong experience in software development best practices

  • Familiar with CUDA programming and GPU architecture

  • Good communication and a team player


Ways to stand out from the crowd: 

  • Familiar with TensorRT/cuDNN/cuBLAS etc.

  • Familiar with DL fundamentals, frameworks, graph compilers etc.

  • Experience in development on Linux and Windows platforms

  • Experience in software testing, functional verification etc.

  • Hands-on experience in DevOps tools, including Docker, Jenkins, Kubernetes etc.

  • Strong experience in CMake

Please mention that you found the job on ARVR OK. Thanks.