Logo
NVIDIA

Senior DevOps and SRE Engineer

NVIDIA, Santa Clara, California, us, 95053

Save Job

Overview

NVIDIA is seeking a passionate, motivated and technical Kubernetes Architect/Engineer to join its Infrastructure, Planning and Processes organization as a Senior DevOps & SRE Engineer to support the design and implementation of Kubernetes solutions for the Cloud Platform. The position is part of a fast-paced team that develops and maintains sophisticated build and test environments for multiple hardware platforms including NVIDIA GPUs and Tegra Processors across Windows, Linux and Android. The team collaborates with NVIDIA Software units such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence, Robotics and Autonomous cars to support infrastructure and systems needs. What you’ll be doing

Architect, design, implement and maintain Kubernetes environments from planning to production/deployment to support CI/CD pipelines for GitLab, Jenkins and GitHub Actions. Design solutions with service discovery, networking, monitoring, logging and scheduling in Kubernetes. Play a critical role in ensuring our platform is easy to use, reliable, scalable and resistant to disruptions. Enable developers to deliver value while meeting customer needs for stability and security. Participate in product workshops, roadmaps and design sessions. Lead technical demos, whiteboard sessions and working meetings. Defend the proposed architectural design before the DevSecOps review board (security, networking, infrastructure, dev, ops). Develop automations to improve efficiency and productivity. Participate in on-call support and critical issue coverage as an SRE engineer. Contribute to prototyping, crafting and developing cloud infrastructure for NVIDIA. What we need to see

Kubernetes domain expertise with extensive experience building scalable, resilient platforms in both public and private clouds, capable of providing platform engineering/architecture standard methodologies (including architecting and implementing the overall platform, orchestration, security and monitoring ecosystem). High proficiency in administering and configuring Kubernetes. Programming background in Python, Go and/or similar scripting languages. Experience maintaining cloud infrastructure and highly available production environments. Ability to automate processes using CI/CD tools. Proficient in Configuration as Code and infrastructure-as-code tools such as Ansible, Puppet, Chef and Terraform. Strong background with GitLab, Jenkins, GitHub Actions and/or other CI/CD systems and Artifactory. Experience with databases both SQL (MySQL) and NoSQL (Elasticsearch / MongoDB / Cassandra). Experience with customer management/onboarding, data analytics/visualization and monitoring tools like Kibana, Grafana, Splunk, Zabbix, Prometheus and similar systems. 8+ years of proven experience. Bachelor’s or Master’s degree in computer science, software engineering, or equivalent experience. Ways to stand out from the crowd

Strong understanding of containerization and microservices architecture. Certifications such as Certified Kubernetes Administrator (CKA), Certified Kubernetes Security Specialist (CKS) and Certified Kubernetes Application Developer (CKAD) are preferred. Thrives in a multi-tasking, fast-paced environment with evolving priorities. Ability to break down complex problems into simple subproblems and reuse available solutions; design simple, efficient systems that require minimal support. Prior experience with large-scale operations teams, data center usage and a background in computer algorithms to meet scaling challenges. With competitive salaries and a generous benefits package, NVIDIA is widely regarded as a leading employer. If you’re a creative and autonomous engineer with a real passion for technology, we want to hear from you. Your base salary will be determined based on location, experience and pay of similar roles. The base salary range is 168,000 USD - 270,250 USD for Level 4, and 208,000 USD - 333,500 USD for Level 5. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until September 30, 2025. NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We value diversity and do not discriminate on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr