Posted Aug 2

Nvidia is hiring a
PyTorch Distributed Developer

US, CA, Santa Clara • 5 Locations
Full time

We are looking for a Software Developer for the High-Performance Computing Software development team. The team is responsible for developing high performance communication frameworks and applications running in production on the world’s largest supercomputers and datacenters. This position will embed you in an ambitious and diverse team that influences all areas of NVIDIA's AI platform as well as directly contributes to PyTorch, a premiere Deep Learning framework. In this role you will work with multiple teams at NVIDIA across fields, as well as collaborate internationally with the PyTorch community to develop the best AI platform in the world! The work environment is dynamic and challenging; we are innovating and inventing software products at the forefront of technology in terms of performance, scalability, and features. In addition, our team works closely with networking chip design teams in co-designing new hardware features and software APIs.

What you'll be doing:

  • Design and build core PyTorch components to maximize network performance and utilization.

  • Collaborate with NVIDIA’s hardware and software teams to improve GPU and networking performance in PyTorch.

  • Design, build and support production AI solutions used by enterprise customers and partners.

  • Work with internal applied researchers to improve their AI tools.

  • Advise design of new hardware generations.

  • Be responsible for designing and implementing new protocols and algorithms that enable the best performance possible on NVIDIA networked supercomputers and datacenters.

  • Deliver production grade solutions and get hands on with ground-breaking technology.

  • Work closely with technical leaders solving some of the biggest challenges in machine learning, cloud computing, and system co-design.

What we need to see:

  • 5+ years experience of Programming in C/C++

  • 5+ years experience of Programming in Python

  • Deep knowledge of Networking Protocols InfiniBand and Ethernet

  • Experience in network performance optimizations

  • Experience with PyTorch

  • BS in computer science / software engineering (or equivalent experience)

Ways to stand out from the crowd:

  • You have positive attitude and work well with others.

  • Demonstrated use of creative ideas, providing solutions to challenging problems.

  • PhD in CS/EE/Math/Physics

  • Experience with PyTorch distributed

  • Experience with parallel programming for CPU or GPU architectures.

  • Background in RDMA technology

  • Open Source Software Contributor

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

The base salary range is $144,000 - $270,250. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Please mention that you found the job on ARVR OK. Thanks.