Phizenix
1 month ago Be among the first 25 applicants
Menlo Park, CA | On-Site | Full-Time/Direct Hire
Looking for ML Infra experts (Bay Area preferred) with deep experience in CUDA, GPU optimization, VLLMs, and LLM inference—pure language focus, no vision/audio.
Client Opportunity | Through Phizenix
Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment. Menlo Park, CA | On-Site | Full-Time/Direct Hire
Looking for ML Infra experts (Bay Area preferred) with deep experience in CUDA, GPU optimization, VLLMs, and LLM inference—pure language focus, no vision/audio.
Client Opportunity | Through Phizenix
Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment.
We’re looking for a
ML Infrastructure Engineer
to help build the infrastructure that powers large-scale model training and real-time inference. You’ll collaborate with world-class researchers and engineers to design high-performance, distributed systems that bring advanced LLMs into production.
Responsibilities
Design and manage distributed infrastructure for ML training at scale Optimize model serving systems for low-latency inference Build automated pipelines for data processing, model training, and deployment Implement observability tools to monitor performance in production Maximize resource utilization across GPU clusters and cloud environments Translate research requirements into robust, scalable system designs
Must-Haves
Masters or PhD in Computer Science, Engineering, or a related field (or equivalent experience) Strong foundation in software engineering, systems design, and distributed systems Experience with cloud platforms (AWS, GCP, or Azure) Proficient in Python and at least one systems-level language (C++/Rust/Go) Hands-on experience with Docker, Kubernetes, and CI/CD workflows Familiarity with ML frameworks like PyTorch or TensorFlow from a systems perspective Understanding of GPU programming and high-performance infrastructure
Nice-to-Haves
Experience with large-scale ML training clusters and GPU orchestration Knowledge of LLM-serving tools (vLLM, TensorRT, ONNX Runtime) Experience with distributed training strategies (e.g., data/model/pipeline parallelism) Familiarity with orchestration tools like Kubeflow or Airflow Background in performance tuning, system profiling, and MLOps best practices
At
Phizenix , we’re committed to supporting diverse and inclusive teams. This is your chance to shape the systems that power the next generation of AI innovation. Let’s build the future—together.
California Pay Range: $180,000 USD - $200,000 USD Seniority level
Seniority level Not Applicable Employment type
Employment type Full-time Job function
Industries Real Estate, Financial Services, and Capital Markets Referrals increase your chances of interviewing at PHIZENIX by 2x Get notified about new Infrastructure Engineer jobs in
Menlo Park, CA . Fall 2025 Onboard Infrastructure Engineer
San Jose, CA $113,600.00-$170,400.00 2 weeks ago Dublin, CA $128,000.00-$173,200.00 1 week ago Palo Alto, CA $144,000.00-$216,000.00 2 weeks ago San Jose, CA $84,000.00-$134,000.00 1 week ago San Jose, CA $130,000.00-$182,000.00 5 months ago Fremont, CA $70,000.00-$100,000.00 2 weeks ago Hayward, CA $100,000.00-$150,000.00 6 months ago San Jose, CA $84,000.00-$134,000.00 1 month ago Sunnyvale, CA $204,000.00-$247,000.00 1 month ago San Jose, CA $82,000.00-$133,000.00 1 week ago Sunnyvale, CA $168,000.00-$276,000.00 3 weeks ago Palo Alto, CA $149,500.00-$184,000.00 2 weeks ago San Mateo, CA $157,000.00-$171,500.00 1 month ago San Jose, CA $123,500.00-$212,850.00 1 week ago Data Infrastructure Engineer, Google Fi and Store
Mountain View, CA $166,000.00-$244,000.00 5 days ago Palo Alto, CA $146,000.00-$183,000.00 19 hours ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Looking for ML Infra experts (Bay Area preferred) with deep experience in CUDA, GPU optimization, VLLMs, and LLM inference—pure language focus, no vision/audio.
Client Opportunity | Through Phizenix
Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment. Menlo Park, CA | On-Site | Full-Time/Direct Hire
Looking for ML Infra experts (Bay Area preferred) with deep experience in CUDA, GPU optimization, VLLMs, and LLM inference—pure language focus, no vision/audio.
Client Opportunity | Through Phizenix
Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment.
We’re looking for a
ML Infrastructure Engineer
to help build the infrastructure that powers large-scale model training and real-time inference. You’ll collaborate with world-class researchers and engineers to design high-performance, distributed systems that bring advanced LLMs into production.
Responsibilities
Design and manage distributed infrastructure for ML training at scale Optimize model serving systems for low-latency inference Build automated pipelines for data processing, model training, and deployment Implement observability tools to monitor performance in production Maximize resource utilization across GPU clusters and cloud environments Translate research requirements into robust, scalable system designs
Must-Haves
Masters or PhD in Computer Science, Engineering, or a related field (or equivalent experience) Strong foundation in software engineering, systems design, and distributed systems Experience with cloud platforms (AWS, GCP, or Azure) Proficient in Python and at least one systems-level language (C++/Rust/Go) Hands-on experience with Docker, Kubernetes, and CI/CD workflows Familiarity with ML frameworks like PyTorch or TensorFlow from a systems perspective Understanding of GPU programming and high-performance infrastructure
Nice-to-Haves
Experience with large-scale ML training clusters and GPU orchestration Knowledge of LLM-serving tools (vLLM, TensorRT, ONNX Runtime) Experience with distributed training strategies (e.g., data/model/pipeline parallelism) Familiarity with orchestration tools like Kubeflow or Airflow Background in performance tuning, system profiling, and MLOps best practices
At
Phizenix , we’re committed to supporting diverse and inclusive teams. This is your chance to shape the systems that power the next generation of AI innovation. Let’s build the future—together.
California Pay Range: $180,000 USD - $200,000 USD Seniority level
Seniority level Not Applicable Employment type
Employment type Full-time Job function
Industries Real Estate, Financial Services, and Capital Markets Referrals increase your chances of interviewing at PHIZENIX by 2x Get notified about new Infrastructure Engineer jobs in
Menlo Park, CA . Fall 2025 Onboard Infrastructure Engineer
San Jose, CA $113,600.00-$170,400.00 2 weeks ago Dublin, CA $128,000.00-$173,200.00 1 week ago Palo Alto, CA $144,000.00-$216,000.00 2 weeks ago San Jose, CA $84,000.00-$134,000.00 1 week ago San Jose, CA $130,000.00-$182,000.00 5 months ago Fremont, CA $70,000.00-$100,000.00 2 weeks ago Hayward, CA $100,000.00-$150,000.00 6 months ago San Jose, CA $84,000.00-$134,000.00 1 month ago Sunnyvale, CA $204,000.00-$247,000.00 1 month ago San Jose, CA $82,000.00-$133,000.00 1 week ago Sunnyvale, CA $168,000.00-$276,000.00 3 weeks ago Palo Alto, CA $149,500.00-$184,000.00 2 weeks ago San Mateo, CA $157,000.00-$171,500.00 1 month ago San Jose, CA $123,500.00-$212,850.00 1 week ago Data Infrastructure Engineer, Google Fi and Store
Mountain View, CA $166,000.00-$244,000.00 5 days ago Palo Alto, CA $146,000.00-$183,000.00 19 hours ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr