Logo
University of Southern California

Machine Learning (ML) Ops Engineer - IS Clinical Research - Full Time 8 Hour Day

University of Southern California, Glendale, California, us, 91222

Save Job

Overview Machine Learning (ML) Ops Engineer - IS Clinical Research - Full Time 8 Hour Days (Exempt) (Non-Union) Keck Medicine of USC, Hospital

Is this your next job Read the full description below to find out, and do not hesitate to make an application. Los Angeles, California Under the direction of Information Services Leadership, the incumbent will be responsible for the full lifecycle management of machine learning models, including design, build, and maintenance of machine learning models. The MLOps Engineer will play an integral role in implementing artificial intelligence solutions across Keck Medicine of USC. The incumbent will partner with data scientists, data team members, and clinical operations to deploy, monitor, and maintain machine learning solutions that will improve patient care, support operational excellence, and advance clinical research. The incumbent will ensure seamless integration, automation, and scaling of AI solutions within the existing infrastructure by leveraging DevOps expertise. They will maintain and continuously improve MLOps pipelines for monitoring, versioning, and deploying models in production environments. The incumbent will be responsible for the end-to-end lifecycle management of artificial intelligence solutions and comes with DevOps experience, ensuring seamless integration, deployment, and automation of systems. The MLOps Engineer will implement best practices for testing, debugging, and performance monitoring of AI systems to ensure reliability and scalability.

Essential Duties

Design, build and maintain production-grade machine learning models, with real-time inference, scalability, and reliability. Develop end-to-end scalable ML infrastructure using cloud platforms, such as AWS, GCP, or Microsoft Azure. Develop AI pipelines for various data processing needs, including data ingestion, pre-processing, and search and retrieval, ensuring solutions meet all technical and business requirements. Monitor model performance for data drift and concept drift detection, automate retraining processes where necessary to maintain model accuracy and relevance. Collaborate with data scientists, data engineers, analytics teams, and DevOps teams to design and implement robust deployment pipelines for continuous improvement of machine learning models. Implement and optimize CI/CD pipelines for machine learning models, automating testing and deployment processes. Configure and manage monitoring and logging solutions to track model performance, system health, and anomalies, enabling timely intervention and proactive maintenance. Implement version control systems for machine learning models, parameters, results and associated code to track changes and facilitate collaboration. Ensure all machine learning systems meet security and compliance standards, including data protection and privacy regulations. Lead engineering efforts in creating and implementing methods and workflows for ML/GenAI model engineering, LLM advancements, and optimizing deployment frameworks while aligning with business strategic directions. Maintain clear and comprehensive documentation of MLOps processes and configuration. Strong communication and collaboration skills, to collaborate cross-functionally and align on deployment strategies and technical requirements Other duties as assigned.

Required Qualifications

Bachelor's Degree in computer science, engineering or closely related field Proven experience with: Artificial intelligence and machine learning platforms (e.g., AWS, Azure or GCP). Containerization technologies (e.g., Docker) or container orchestration platforms (e.g., Kubernetes). CI/CD tools (e.g., Github Actions). Programming languages and frameworks (e.g., Python, R, SQL). MLOps engineering principles, agile methodologies, and DevOps lifecycle management. Technical writing and documentation for AI/ML models and processes. Healthcare data and machine learning use cases. Ability to solve complex problems through troubleshooting Deep understanding of coding, architecture, and deployment processes Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracy Excellent organizational skills and attention to detail Self-starter with the ability to solution when requirements are vague or ambiguous

Preferred Qualifications

Master's degree in computer science, engineering or closely related field

Required Licenses/Certifications

Fire Life Safety Training (LA City). If no card upon hire, one must be obtained within 30 days of hire and maintained by renewal before expiration date. (Required within LA City only)

Salary: The annual base salary range for this position is approximately $145,600.00 - $240,240.00. USC may consider factors such as scope and responsibilities, candidate experience and education, and other relevant criteria when extending offers.

#J-18808-Ljbffr