Nvidia is hiring a
Senior Distributed Systems Engineer, AI Infrastructure
NVIDIA is hiring a senior data and distributed systems engineer to architect, lead and develop our exa-scale AI infrastructure and deep learning platform for Autonomous Vehicles. You will need to have strong programming skills, a deep understanding of cloud technologies, distributed storage & compute systems, and distributed systems architecture. You will need to have excellent communication and planning skills. You ideally have experience in securing distributed systems or willingness to learn it. Together, we will build the exa-scale software 2.0 cloud platform for one of the most ambitious problems of our time: autonomous vehicles.
What you'll be doing:
Design and implement scalable and distributed services that will help power the AI infrastructure for deep learning platforms.
Design and build infrastructure and microservices that help index, mine, transform, and compose PB sized deep learning datasets.
Design the next generation of dataset management services for real and synthetic / simulated datasets.
You will enable smart data selection - one of the key ingredients for successful machine learning!
Collaborate with multiple AI teams to understand their requirements and build a future-proof platform that improves their productivity.
Support users of the platform.
What we need to see:
BS or MS in Computer Architecture, Computer Science or related field or equivalent experience.
5+ years of work or research experience in distributed systems design and development.
Strong programming background that incorporates methodologies like data structures, algorithms, system architecture design, etc.
Strong system designing experiences to build distributed storage and compute systems, microservices, and web applications.
Proven technical foundation in distributed computing and storage, including significant experience with most of the following: server systems, storage, I/O, networking, and systems software.
Expert level programmer in at least one of Go/Java/C++/Python.
Ability to switch effectively between long-term strategic and near-term tactical topics.
Highly motivated with strong interpersonal skills, you have the ability to work successfully with multi-functional teams, principles and architects and coordinate optimally across interpersonal boundaries and geographies.
Ways to stand out from the crowd:
Knowledge of big data platform. Spark, Kafka, Flink, etc.
Experience with Kubernetes and Docker.
A proactive demeanor to investigate and understand technical requirements.
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working with us and our engineering teams are growing fast in some of the hottest state of the art fields: Deep Learning, Artificial Intelligence, and Autonomous Vehicles. If you're a creative computer scientist/engineer with a real passion for distributed systems and autonomous driving, we want to hear from you.
Please mention that you found the job on ARVR OK. Thanks.