Posted Aug 16

Nvidia is hiring a
Hardware Failure Analysis Engineer

US, CA, Santa Clara • US, CA, Santa Clara • 2 Locations
Full time

Our technology has no boundaries! NVIDIA is building the world’s most groundbreaking and state of the art compute platforms for the world to use. It’s because of our work that scientists, researchers and engineers can advance their ideas. At its core, our visual computing technology not only enables an amazing computing experience, but it is also energy efficient! We pioneered a supercharged form of computing loved by the most demanding computer users in the world - scientists, designers, artists, and gamers. It’s not just technology though! It is our people, some of the brightest in the world, and our diverse company culture make NVIDIA one of the most fun, innovative and dynamic places to work in the world! At the center of NVIDIA's culture are our core values like innovation, excellence and determination and team, that guide us to be the best we can be.

We are looking for a Failure Analysis Engineer in the Datacenter Systems Engineering team. You'll work closely with board designers, Silicon Solutions and AE teams to debug and root cause customer and internal builds issues of NVIDIA's datacenter modules. The ideal candidates are self motivated, very comfortable in a lab environment and demonstrate a passion towards product system level quality. Candidates should have strong debug/circuit analysis fundamentals as well as broad understanding of HW/firmware interactions. They must be capable of growing in fast paced environment with evolving product definitions.

What you’ll be doing:

  • Analyze, coordinate, and perform failure analysis activities to identify root cause of customer reported issues on Datacenter products.

  • Debug systems, triage issues, perform root cause analysis, verify fixes, define new tests, and improve product test plans.

  • Provide debug support in NPI phases to internal teams and factory builds.

  • Assist validation team with electrical and functional validation of power sensors, MCU, I2C, SPI, SMBUS and PCIe interfaces.

  • Define diagnostics for engineering builds to fully test products being designed.

What we need to see:

  • BSEE/BSCE or equivalent experience

  • 3+ years of hands-on hardware/software debug experience.

  • Strong understanding of digital design, circuit design analysis and computer architecture.

  • Programming skills with experience in Python or other scripting languages (such as Perl, Shell).

  • Hands-on lab experience with board bringup, lab debug and lab tools (Logic analyzers, oscilloscopes, multimeters).

  • Motivated self-starter with excellent analytical and problem solving skills

Ways to stand out from the crowd:

  • Ability to handle multiple high-priority tasks in parallel

  • Experience with BMC, server management processors.

  • Experience with Linux command line operation and updating OS/SW/FW on servers.

  • LSIO/HSIO validation experience

The base salary range is $104,000 - $195,500. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Please mention that you found the job on ARVR OK. Thanks.