All roles

Senior Site Reliability Engineer - Infrastructure

Remote · USA Full-time New today

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by seeking new opportunities that are hard to solve, that only we can address, and that matter to the world. This is our life’s work, to amplify human creativity and intelligence. Make the choice to join us today! As an SRE or with equivalent experience, you'll collaborate with various teams to improve our infrastructure environment within NVIDIA's Hardware Infrastructure team. You will enable our engineers to have the best environment on the planet to make the most innovative chips in the world. You will work with your team of EDA and software experts to build new infrastructure in an agile environment. You will continuously innovate and improve scalable, reliable, high performance systems and tools to enable the next generation of chips! What You’ll Be Doing

  • Develop automation in order to scale infrastructure easily and reliably.
  • Use broad IT infrastructure skills to implement infrastructure innovations which accelerate chip development.
  • Design and implement network architecture, storage solutions, virtualization, and services specific to EDA workflows.
  • Work closely with EDA teams to understand their requirements and translate them into infrastructure solutions.
  • Work in a diverse team performing fast paced investigations to empower engineers to develop at the speed of light.
  • Collaborate to improve how our chip development process utilizes our infrastructure.
  • Directly contribute to the overall quality and improve time to market for our next generation chips.

What We Need To See

  • Experience with automation workflows such as Ansible and Jenkins.
  • UNIX Systems programming and automation using industry standard languages and familiar with API calls. Python experience preferred.
  • Authoritative level usage of UNIX and UNIX CLI utilities such as sed, awk, grep.
  • Hands on experience with architectural decisions in technologies (storage, networking, compute) our chip engineers depend on.
  • Understanding of distributed UNIX system concepts such as NFS, autofs, DNS, LDAP and/or NIS.
  • Excellent planning and communication skills and a passion for improving the productivity and efficiency of other specialists.
  • Strong experience investigating and debugging complex, multi-discipline problems in a UNIX environment.
  • 5+ years experience in a large, distributed UNIX environment.
  • History of using data analysis principles and influencing data-driven decisions.
  • MS (preferred) or BS in Computer Science, similar degree or equivalent experience.

Ways To Stand Out From The Crowd

  • Extensive knowledge with job schedulers (in particular IBM Spectrum LSF and/or SLURM).
  • Experience with perl.
  • Deep understanding of distributed system principles.
  • Experience with chip design workflows, such as front end verification, back end workflows, or mixed signal workflows.
  • Experience in crafting solutions that balance security and productivity for the end user.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits . NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. JR1989663 Apply Job!

Related roles

Early Head Start - Classroom Aide

Remote · USA Full-time

Vice President, Operations Strategy & Partnerships

Remote · USA Full-time

Staff Software Engineer, Infrastructure, Google Cloud

Remote · USA Full-time

Part Time Airbnb Cleaning: Paid Weekly

Remote · USA Full-time

Guest Service Agent - Part Time

Remote · USA Full-time

Network Engineer III, Implementation and Operations

Remote · USA Full-time

Supply Chain Specialist (Supply Chain Management)

Remote · USA Full-time

Learning and Development Project Lead Specialist

Remote · USA Full-time

Data Engineer III - Mainframe DB2/DBA

Remote · USA Full-time

Information Risk Analyst - Information Security

Remote · USA Full-time

Experienced Full Stack Insurance Sales Agent – Remote Inside Sales Representative for National General Insurance

Remote · USA Full-time

Senior Software Developer – Applications

Remote · USA Full-time

Remote Clinical Therapist - North Carolina

Remote · USA Full-time

UPS Data Entry Clerk (Entry Level/No Experience)

Remote · USA Full-time

Staff Attorney, Gender Justice & Health Equity

Remote · USA Full-time

Entry-Level Remote Data Entry Work From Home Opportunity at arenaflex

Remote · USA Full-time

Experienced Customer Success Associate – Delivering Exceptional Service and Seamlessly Guiding Customers through the Nuuly Shopping Experience

Remote · USA Full-time

Experienced Data Entry Specialist – Live Chat and Remote Work Opportunity with blithequark in Chile, $40/Hour, 2024

Remote · USA Full-time

Experienced Remote Data Entry Operator – Entry Level Position for Ambitious Individuals Seeking Flexible Work Arrangements

Remote · USA Full-time

Financial Analyst, GPS

Remote · USA Full-time