Odyssey
Member of Technical Staff, Infrastructure Engineer
Odyssey, Santa Clara, California, us, 95053
Member of Technical Staff, Infrastructure Engineer
Join to apply for the
Member of Technical Staff, Infrastructure Engineer
role at
Odyssey
Who we are Odyssey is an AI lab pioneering interactive video models—models that dream video you can watch and interact with in real-time. This new form of general-purpose intelligence will power the next generation of gaming, film, education, social media, advertising, training, companionship, simulation, and entirely new applications.
What we're looking for A fresh perspective for building the engines that make groundbreaking research and products possible. Think in systems, love performance, and get energy from turning theoretical bottlenecks into beautifully efficient reality. Excited to design infrastructure not just for scale, but for speed, creativity, and discovery. Build the compute substrate that lets Odyssey’s models imagine, act, and interact in real time.
Who you are
Develop and operate our low-latency model inference platform, ensuring high availability, scaling, and efficient resource utilization for Odyssey’s products.
Engineer and scale our core data processing infrastructure (e.g., Flyte, Ray with k8s) to handle petabyte-scale datasets.
Design, build, and maintain our large-scale, GPU-based training clusters for deep learning, focusing on high throughput and reliability.
Automate infrastructure provisioning, configuration, monitoring, and alerting using Infrastructure as Code (IaC) principles.
Drive performance tuning, cost optimization, and reliability improvements across the entire stack.
Collaborate closely with researchers and product developers to understand their requirements, optimize their workflows, and improve platform usability.
What you’ll do
Strong programming skills (e.g., Python, Go, or similar) and a solid understanding of software engineering best practices.
Deep, hands‑on experience with containerization (e.g., Docker), container orchestration (Kubernetes) and Infrastructure as Code (Terraform).
Proven experience building and managing large-scale, distributed systems with GPU computational workloads (e.g., compute platforms, data pipelines, or high‑availability services).
Experienced in designing infrastructure for ML workloads where performance, parallelism, and data movement are critical.
A collaborative mindset and excellent communication skills, with a passion for building developer-friendly platforms.
Motivated by building for the frontier: you want to shape the compute and infrastructure foundation of a lab redefining how people create and interact with media.
Seniority level Mid‑Senior level
Employment type Full‑time
Job function Engineering and Information Technology
Industries Research Services
Referrals increase your chances of interviewing at Odyssey by 2x
#J-18808-Ljbffr
Member of Technical Staff, Infrastructure Engineer
role at
Odyssey
Who we are Odyssey is an AI lab pioneering interactive video models—models that dream video you can watch and interact with in real-time. This new form of general-purpose intelligence will power the next generation of gaming, film, education, social media, advertising, training, companionship, simulation, and entirely new applications.
What we're looking for A fresh perspective for building the engines that make groundbreaking research and products possible. Think in systems, love performance, and get energy from turning theoretical bottlenecks into beautifully efficient reality. Excited to design infrastructure not just for scale, but for speed, creativity, and discovery. Build the compute substrate that lets Odyssey’s models imagine, act, and interact in real time.
Who you are
Develop and operate our low-latency model inference platform, ensuring high availability, scaling, and efficient resource utilization for Odyssey’s products.
Engineer and scale our core data processing infrastructure (e.g., Flyte, Ray with k8s) to handle petabyte-scale datasets.
Design, build, and maintain our large-scale, GPU-based training clusters for deep learning, focusing on high throughput and reliability.
Automate infrastructure provisioning, configuration, monitoring, and alerting using Infrastructure as Code (IaC) principles.
Drive performance tuning, cost optimization, and reliability improvements across the entire stack.
Collaborate closely with researchers and product developers to understand their requirements, optimize their workflows, and improve platform usability.
What you’ll do
Strong programming skills (e.g., Python, Go, or similar) and a solid understanding of software engineering best practices.
Deep, hands‑on experience with containerization (e.g., Docker), container orchestration (Kubernetes) and Infrastructure as Code (Terraform).
Proven experience building and managing large-scale, distributed systems with GPU computational workloads (e.g., compute platforms, data pipelines, or high‑availability services).
Experienced in designing infrastructure for ML workloads where performance, parallelism, and data movement are critical.
A collaborative mindset and excellent communication skills, with a passion for building developer-friendly platforms.
Motivated by building for the frontier: you want to shape the compute and infrastructure foundation of a lab redefining how people create and interact with media.
Seniority level Mid‑Senior level
Employment type Full‑time
Job function Engineering and Information Technology
Industries Research Services
Referrals increase your chances of interviewing at Odyssey by 2x
#J-18808-Ljbffr