Nvidia is hiring a
Senior Research Engineer
We are now looking for a Senior Research Engineer focused on Generative AI inference. Are you excited to change the way people infuse AI into products and services? NVIDIA is at the forefront of generative AI models, from language o images. NVIDIA's Nemo LLM project provides building blocks to democratize AI and make generative AI easy to develop, integrate, and deploy. Our team is dedicated to developing optimized inferencing technologies to support our growing generative AI needs. We contribute to all steps of the machine learning lifecycle: from conceptualization, to applied research, engineering for optimized inference, and deployment.
As a research engineer on the team, you will interact with internal partners, users, and members of the open-source community to define, analyze, and implement highly optimized algorithms for speech recognition, natural language understanding, image generation and speech synthesis. The scope of these efforts includes a combination of implementing new algorithms, performance tuning and analysis, defining APIs, analyzing functionality coverage, and other general software engineering work.
What you will be doing:
Developing new models and algorithms in Speech Recognition, Speech Synthesis, Natural Language Processing and Deep Learning
Architecting and implementing features in C++, CUDA, and Python
Working with engineering teams across all of NVIDIA to ensure your software integrates seamlessly up and down the stack
What we need to see:
Understanding of modern techniques in Machine Learning, Deep Neural Networks, Natural Language Processing, or Speech Recognition
6+ years industry experience in Deep Learning frameworks (PyTorch or Tensorflow)
Passion for software engineering. We are especially looking for excellent C++ and Python development skills, with meaningful contributions to major open-source projects.
Strong communication and interpersonal skills along with the ability to work in a dynamic and distributed team. Your history of mentoring junior engineers and interns is a huge plus.
Bachelor's degree or equivalent experience.
A desire to constantly grow and learn new things.
Strong computer science fundamentals - algorithms and data structures, computational complexity, parallel and distributed computing, system software.
Ways to stand out from a crowd:
Experience architecting or developing large-scale distributed systems
Knowledge of CPU and/or GPU architecture
GPU programming (CUDA)
You will also be eligible for equity and benefits.
Please mention that you found the job on ARVR OK. Thanks.