Nvidia is hiring a
Senior System Software Engineer - Infrastructure
We are on the hunt for a highly skilled and passionate Senior Infrastructure Software Engineer to join our dynamic Cloud Engineering Services team. This role offers an exceptional opportunity to leave your mark on the design, construction, and optimization of large-scale infrastructure for various foundational NVIDIA unified cloud services. If you are a dedicated engineer with a deep understanding of cloud infrastructure and distributed systems, and you thrive in a challenging, innovative environment, this could be the perfect role for you.
What you'll be doing:
Engage with product engineering teams to gain a comprehensive understanding of their infrastructure use cases. Communicate design trade-offs effectively and construct scalable systems to meet their unique needs.
Develop advanced tooling to automate the build and deployment of microservices and infrastructure components, enhancing efficiency and productivity.
Proactively identify bottlenecks in the daily usage of core infrastructure and implement robust solutions to resolve them.
Reduce manual labor and increase operational efficiency through automation.
Monitor the infrastructure to alert on significant events, ensuring the highest level of system performance and reliability.
What we need to see:
A Master's or Ph.D. in Computer Science or a related field, or equivalent experience.
4+ years of hands-on experience in designing and building infrastructure to support large-scale, fault-tolerant distributed services.
Strong experience with cloud infrastructure platforms like AWS, Azure, or Google Cloud.
High level of proficiency in Infrastructure as Code and Configuration Management tools like Terraform.
Expertise in administering, operating, and configuring Kubernetes and Envoy.
Proficiency in scripting languages such as Python.
Demonstrated experience in Continuous Integration/Continuous Delivery (CI/CD) tools such as Gitlab and Jenkins and the GitOps model.
Proficiency in various monitoring tools such as Prometheus, Grafana, Cloudwatch, and Thanos.
Strong background in cloud security, Kubernetes security, and application security.
Proficiency in debugging issues involving networks, DNS, HTTP, Linux, and containers.
Ways to Stand Out from the Crowd:
You're not just someone who can follow instructions, but a true innovator who isn't afraid to challenge the status quo and bring fresh ideas to the table. You're always looking for ways to improve existing systems and processes.
Deep understanding of the latest technologies and trends in cloud infrastructure and distributed systems. You're not just familiar with the tools, but you understand the underlying principles and can leverage this knowledge to make strategic decisions.
Committed to personal and professional growth. You're always looking for opportunities to learn new skills and deepen your expertise.
By joining our team, you will be part of a forward-thinking company that values innovation and creativity. We offer a competitive salary and benefits package, a flexible work environment, and the opportunity to work with some of the brightest minds in the industry. If you're ready to take your career to the next level, we'd love to hear from you.
Please mention that you found the job on ARVR OK. Thanks.