Overview
We are seeking a highly skilled Senior Network Automation Architect to design, implement, and oversee end-to-end automation frameworks for provisioning Baremetal and Kubernetes clusters across hybrid and multi-cloud environments. This role blends deep networking expertise with infrastructure-as-code principles, enabling rapid, reliable, and secure deployment of Kubernetes clusters at scale. The successful candidate will work closely with SREs, platform engineers, network engineers, and Tooling teams to automate cluster lifecycle management, integrate provisioning workflows with CI/CD pipelines, and ensure observability, resilience, and compliance across all automation systems.
What You’ll Be Doing
- Design scalable, secure, and repeatable automation for Kubernetes cluster provisioning and network configuration across multi-cloud, on-prem, and hybrid environments.
- Build Infrastructure-as-Code templates, automation pipelines, and CI/CD integrations to streamline provisioning and connectivity.
- Automate networking integrations including CNI plugins, service meshes, load balancers, and ingress controllers.
- Implement policies, firewalling, routing, and self-service provisioning through API-driven workflows.
- Deliver observability with monitoring, logging, alerting, and self-healing workflows to improve resilience and performance.
- Act as the technical authority for network automation, mentoring teams and driving cross-functional automation maturity.
What We Need To See (Qualifications)
- Deep knowledge of Kubernetes networking (CNI, services, ingress, eBPF) and service meshes (Istio, Linkerd, Consul).
- Strong background in network security, firewalling, routing, and containerized environments.
- Hands-on experience with IaC and automation tools (Terraform, Ansible, Pulumi), GitOps tooling (ArgoCD, Flux), and CI/CD systems.
- Proficiency in Python, Go, or Bash, with experience building API-driven integrations (REST, gRPC).
- Familiarity with Kubernetes lifecycle management and provisioning tools (CAPI, kubeadm, kops, Rancher, OpenShift, bare metal).
- Experience with observability and tracing stacks (Prometheus, Grafana, ELK/EFK, Jaeger, OpenTelemetry).
- 8+ years in network engineering and automation development; 3+ years of experience with Kubernetes platform engineering.
- Bachelor's degree or equivalent experience.
Ways To Stand Out From The Crowd
- Proven track record of crafting and delivering automated provisioning solutions at scale.
- Experience with multi-cloud networking (AWS VPC, Azure VNet, GCP VPC).
- Excellent architecture documentation and interpersonal skills; ability to lead cross-functional technical initiatives.
About NVIDIA
NVIDIA’s deep learning platforms have made major impacts across academia, startups, and industry. We are looking for passionate, hard-working, and creative people to help us tackle opportunities in deep learning cloud solutions. NVIDIA is an equal opportunity employer that values diversity in its current and future employees and does not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
Additional Details
- Seniority level: Mid-Senior level
- Employment type: Full-time
- Job function: Engineering and Information Technology
- Industries: Computer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing
Applications for this job will be accepted at least until September 2, 2025.
#J-18808-Ljbffr