Machine Learning Operations Manager
RAI Institute - Cambridge, Massachusetts, us, 02140
Work at RAI Institute
Overview
- View job
Overview
Machine Learning Operations Manager
role at
RAI Institute 3 months ago Be among the first 25 applicants Join to apply for the
Machine Learning Operations Manager
role at
RAI Institute Our mission is to solve the most important and fundamental challenges in AI and Robotics to enable future generations of intelligent machines that will help us all live better lives.
Apply below after reading through all the details and supporting information regarding this job opportunity.
Who we are looking for:
We are seeking a Machine Learning Operations (ML-OPs) Manager who is both technically adept and an effective leader. In this role, you will lead a small team of engineers while also being hands-on in designing, building, and maintaining infrastructure that supports the entire lifecycle of Machine Learning (ML) projects. If you have a passion for building scalable ML infrastructure, mentoring engineers, and collaborating with world-class researchers, this is the role for you!
What You Will Do
Technical Leadership & Strategy: Drive the design, development, and maintenance of company-wide MLOps platforms and tools, leveraging Kubernetes infrastructure for ML and data processing applications. Team Management & Mentorship: Manage and mentor a small team of engineers, providing technical guidance, setting priorities, and fostering a collaborative team culture Scalability & Performance: Enable self-service access to ML-compute resources across on-prem and cloud environments, ensuring workload scalability, fault tolerance, and efficient job scheduling Monitoring & Observability: Enhance system observability through integrations with tools and services such as FluentD, Prometheus, Grafana, and DataDog to improve reliability and debugging Experiment & Model Lifecycle Management: Integrate ML applications with experiment tracking and model management services such as Weights and Biases Best Practices & Collaboration: Champion engineering best practices, drive improvements in CI/CD, infrastructure automation, and reproducibility. Work closely with ML Engineers, Data Engineers, DevOps teams, and researchers to accelerate research and deployment.
What You Will Bring
BS or MS in Computer Science, Engineering, or equivalent 5+ years of experience in an ML-Ops, DevOps, ML Engineering, or software engineering role 2+ years of experience managing engineers (can be formal management or technical leadership) Strong, hands-on experience with Kubernetes for ML applications Experience developing ML-Ops platforms (covering data/artifact management, reproducibility, fault tolerance, experiment tracking, and model serving) Proficiency in Python, Docker, and environment management tools (pip, poetry, uv, or similar)Familiarity with CI/CD tools (GitHub Actions, ArgoCD) and Infrastructure as Code (Terraform)
Skills We Value
Experience with job scheduling mechanisms like Kueue Hands-on experience with workflow orchestration tools (Airflow, Metaflow, Argo Workflows) Experience managing cloud infrastructure (GCP, AWS) and hybrid-cloud environments Knowledge of scalable AI/ML platforms like Ray or PyTorch Lightning Experience with logging & monitoring tools (FluentD, Prometheus, Grafana, DataDog or similar Exposure to ML model serving frameworks (TorchServe, ONNX Runtime, or similar) Previous experience collaborating with research teams in academic or industrial settings
We provide equal employment opportunities to all employees and applicants for employment and prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.Seniority level
Seniority levelMid-Senior level Employment type
Employment typeFull-time Job function
Job functionManagement and Manufacturing IndustriesResearch Services Referrals increase your chances of interviewing at RAI Institute by 2x Sign in to set job alerts for “Operations Manager” roles. Marlborough, MA $113,600 - $147,700 2 weeks ago Boston, MA $130,000 - $160,000 14 hours ago Senior Operations Manager- Retail Encore Boston Harbor Boston, MA $110,000.00 - $140,000.00 2 weeks ago Operations Manager, Harvard Kennedy School Boston, MA $75,000.00 - $85,000.00 2 weeks ago Vice President, Clean Energy Deployment Operations Westford, MA $127,300.00 - $254,700.00 2 days ago Site Director – (PACE Center) – Great Opportunity Lowell, MAAssociate Inventory Replenishment Manager Boston, MA $90,095.00 - $135,143.00 1 day ago Avon, MA $80,000.00 - $125,000.00 1 month ago Boston, MA $57,500.00 - $65,000.00 2 weeks ago Cambridge, MA $70,000 - $85,000 3 months ago Boston, MA $152,000 - $228,000 2 weeks ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr