Nvidia is hiring a
Deep Learning Performance Architect
We are now looking for a Deep Learning Architect!
Are you passionate about exploring computer architectures for deep learning? Do you like to work at the intersection of hardware and software? NVIDIA is seeking world-class programmers and performance architects who love to squeeze out every cycle of performance from deep learning codes. In this role, you will develop and maintain a library that ships our best-performing GPU kernels to NVIDIA's industry-leading DL products. This position offers the opportunity to have real impact in a fast-moving, technology-focused company.
What you'll be doing:
Design and develop the architecture, interface and features of the kernel library
Maintain the library and its GPU kernels in a good state
Collaborate closely with DL product teams as well as kernel development teams
What we need to see:
MS or PhD in relevant discipline (CS, EE, Math) , 5+ years work experience
Strong programming skills in C, C++, and Python
Strong problem solving skills and learning capability
Strong experience in software development best practices
Familiar with CUDA programming and GPU architecture
Good communication and a team player
Ways to stand out from the crowd:
Familiar with TensorRT/cuDNN/cuBLAS etc.
Familiar with DL fundamentals, frameworks, graph compilers etc.
Experience in development on Linux and Windows platforms
Experience in software testing, functional verification etc.
Hands-on experience in DevOps tools, including Docker, Jenkins, Kubernetes etc.
Strong experience in CMake
Please mention that you found the job on ARVR OK. Thanks.