Pro Talent Crafter
Senior Platform DevOps HPC Engineer : : Mountain View, CA Hybrid
Pro Talent Crafter, Mountain View, California, us, 94039
Title : Senior Platform DevOps HPC Engineer
Duration : 12 months
Location : Mountain View, CA Hybrid 3 days per week onsite
Mode of Interview : Video
The position is ideal for someone with deep experience in DevOps, HPC infrastructure, and cloud technologies, especially in mission-critical environments. Below are the key details :
Role Overview
As a Senior Platform Engineer, you'll help architect and maintain secure, high-performance infrastructure supporting aerospace engineering, simulation, and mission operations. You'll work across cloud and on-prem environments, enabling rapid development and secure operations. Key Responsibilities
Architect and manage scalable cloud and HPC infrastructure. Enhance HPC clusters for simulation and modeling workloads. Build observability platforms (monitoring, logging, alerting). Automate infrastructure using Terraform and Ansible. Optimize CI / CD pipelines (GitLab preferred). Troubleshoot across cloud, on-prem, and HPC environments. Ensure performance, security, and compliance. Required Qualifications
12+ years in Platform Engineering, SRE, or DevOps. Experience with HPC clusters (Slurm, PBS, Grid Engine). Cloud infrastructure expertise (GCP / AWS preferred). Proficiency with Terraform, Ansible, Prometheus, Grafana, ELK. Strong Linux administration and scripting (Python, Bash, Go). CI / CD pipeline experience. Preferred Experience
Aerospace or defense industry background. Hybrid HPC solutions (e.g., AWS ParallelCluster). Familiarity with NIST 800-53, FedRAMP. Experience with parallel filesystems (Lustre, GPFS, PANFS, NFS). Kubernetes in production environments. GPU computing (CUDA, AI / ML). Relevant certifications (AWS, HashiCorp, CNCF).
#J-18808-Ljbffr
As a Senior Platform Engineer, you'll help architect and maintain secure, high-performance infrastructure supporting aerospace engineering, simulation, and mission operations. You'll work across cloud and on-prem environments, enabling rapid development and secure operations. Key Responsibilities
Architect and manage scalable cloud and HPC infrastructure. Enhance HPC clusters for simulation and modeling workloads. Build observability platforms (monitoring, logging, alerting). Automate infrastructure using Terraform and Ansible. Optimize CI / CD pipelines (GitLab preferred). Troubleshoot across cloud, on-prem, and HPC environments. Ensure performance, security, and compliance. Required Qualifications
12+ years in Platform Engineering, SRE, or DevOps. Experience with HPC clusters (Slurm, PBS, Grid Engine). Cloud infrastructure expertise (GCP / AWS preferred). Proficiency with Terraform, Ansible, Prometheus, Grafana, ELK. Strong Linux administration and scripting (Python, Bash, Go). CI / CD pipeline experience. Preferred Experience
Aerospace or defense industry background. Hybrid HPC solutions (e.g., AWS ParallelCluster). Familiarity with NIST 800-53, FedRAMP. Experience with parallel filesystems (Lustre, GPFS, PANFS, NFS). Kubernetes in production environments. GPU computing (CUDA, AI / ML). Relevant certifications (AWS, HashiCorp, CNCF).
#J-18808-Ljbffr