Logo
NVIDIA

Senior System Software Engineer - Infrastructure

NVIDIA, Santa Clara, California, us, 95053

Save Job

Senior System Software Engineer - Infrastructure Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIA engineer, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work.

Job Details

Seniority Level: Mid‑Senior level

Employment Type: Full‑time

Responsibilities

Designing, deploying, and maintaining scalable AWS infrastructure using EKS, EC2, S3, and related services.

Managing and optimizing Kubernetes clusters for high availability, resilience, and performance.

Creating and maintaining GitLab CI/CD pipelines to automate build, test, and deployment workflows.

Developing automation scripts and Infrastructure as Code (IaC) templates with Terraform.

Monitoring system performance and implementing logging, metrics, and alerting through LGTM, Prometheus, Datadog, or Splunk.

Implementing DevSecOps best practices, embedding security scans, compliance checks, and secret management in the CI/CD lifecycle.

Supporting platform observability, diagnosing production incidents, and enhancing self‑service for developer teams.

Collaborating with cross‑functional teams to streamline delivery and improve developer productivity.

Qualifications

BS/MS in Computer Science and/or equivalent experience.

12+ years of hands‑on experience building/supporting complex services.

Strong hands‑on experience with AWS services (VPC, IAM, EC2, EKS, Lambda, CloudWatch).

Deep knowledge of Kubernetes internals, Helm charts, and container orchestration principles.

Proficiency with GitLab CI/CD or equivalent pipeline automation tools.

Experience implementing GitOps workflows (ArgoCD, FluxCD).

Strong foundation in scripting languages such as Python, Bash, or Go.

Familiarity with networking, load balancing, and security in cloud‑native environments.

Experience enforcing cloud and container security standards and compliance practices.

Excellent documentation, problem‑solving, and communication skills for cross‑team alignment.

Ways to Stand Out

Managed multi‑cloud and hybrid Kubernetes clusters across AWS, GCP, and Azure.

Contributed to open‑source DevOps projects, including Kubernetes and GitLab initiatives.

Earned certifications such as CKA, AWS DevOps Engineer, and GitLab Certified Specialist.

Applied AI/ML tools and AIOps platforms for predictive monitoring and automation.

Led DevOps teams in platform engineering, chaos testing, disaster recovery, and process optimization.

Compensation & Benefits Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until December 13, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Reference ID: JR2006864

#J-18808-Ljbffr