Synergis
Senior Machine Learning Infrastructure Engineer
Synergis, Millbrae, California, United States, 94031
Senior Machine Learning Infrastructure Engineer
Direct Hire
$195K-$295K
About the Team:
The ML Inference Platform is a vital part of the AI Compute Platforms organization within Infrastructure Platforms. Our team is dedicated to delivering a cloud-agnostic, reliable, and cost-effective infrastructure that drives our clients' AI initiatives. We proudly support teams developing autonomous vehicles (L3/L4/L5) and various groups creating AI-driven products. Our focus is on enabling rapid innovation and streamlining feature development, enhancing performance, availability, concurrency, and scalability for state-of-the-art (SOTA) machine learning models.
About the Role:
We are seeking an experienced Senior ML Infrastructure Engineer to design and scale robust compute platforms for ML workflows. In this pivotal position, you will collaborate with ML engineers and researchers to optimize model serving and inference in production environments, supporting workflows like data mining, labeling, and model distillation. This is a high-impact opportunity to shape the future of AI infrastructure, contributing to the architecture, roadmap, and user experience of our ML inference services.
What You Will Be Doing:
Design and implement essential backend software components for our platform.
Work closely with ML engineers and researchers to identify critical workflows and translate them into platform requirements, delivering incremental value.
Lead technical decision-making regarding model serving strategies, orchestration, caching, model versioning, and auto-scaling mechanisms.
Develop monitoring, observability, and metrics systems to ensure the reliability, performance, and optimization of inference services.
Research and integrate state-of-the-art model serving frameworks, hardware accelerators, and distributed computing methods.
Drive large-scale technical initiatives within the ML ecosystem.
Elevate engineering standards through technical leadership and establishing best practices.
Contribute to open-source projects and represent our organization in relevant communities.
Minimum Requirements:
8+ years of industry experience focused on machine learning systems or high-performance backend services.
Advanced proficiency in Go, Python, C++, or similar programming languages.
Expertise in ML inference and model serving frameworks (such as Triton, Ray Serve, vLLM, etc.).
Strong communication skills with a proven track record of driving cross-functional initiatives.
Experience with cloud platforms like GCP, Azure, or AWS.
Adept at thriving in a dynamic, multi-tasking environment with shifting priorities.
Preferred Qualifications:
Hands-on experience developing ML infrastructure platforms for model serving and inference.
Experience designing interfaces, APIs, and clients for ML workflows.
Familiarity with the Ray framework and/or vLLM.
Experience with distributed systems and large-scale data processing.
Knowledge of telemetry and feedback loops for product improvements.
Experience in optimizing hardware (GPUs) for inference workloads.
Active contributions to open-source ML serving frameworks.
The compensation range for this position is $195,000 to $295,000 (dependent on factors including but not limited to client requirements, experience, statutory considerations, and location).
*Note: Disclosure as required by the Equal Pay for Equal Work Act (CO), NYC Pay Transparency Law, and sb5761 (WA).
Synergis is proud to be an Equal Opportunity Employer. We value diversity and do not discriminate based on race, color, ethnicity, national origin, religion, age, gender, gender identity, political affiliation, sexual orientation, marital status, disability, military/veteran status, or any other status protected by applicable law.
For consideration, please forward your resume to dwicks@synergishr.com.
If you require assistance or accommodations in the application or employment process, please contact us at dwicks@synergishr.com.
Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable state and local laws.
Synergis is a workforce solutions partner serving businesses and job seekers nationwide, helping enhance their IT ecosystems for growth and innovation.