Jobright.ai
AI Evaluation Data Scientist - Health, Mid Level
Jobright.ai, Cupertino, California, United States, 95014
AI Evaluation Data Scientist - Health, Mid Level
Join to apply for the
AI Evaluation Data Scientist - Health, Mid Level
role at
Jobright.ai AI Evaluation Data Scientist - Health, Mid Level
2 days ago Be among the first 25 applicants Join to apply for the
AI Evaluation Data Scientist - Health, Mid Level
role at
Jobright.ai Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust. Job Summary: Apple is a leading technology company focused on health technologies that support users in living healthier lives. The AI Evaluation Data Scientist in the Health team will develop and validate evaluation methodologies for Generative AI systems, design human annotation frameworks, and conduct statistical analyses to enhance the quality of health products. Responsibilities: • Design and analyze human evaluations of AI systems to create reliable annotation frameworks, and ensure validity and reliability of measurements of latent constructs • Develop and refine benchmarks and evaluation protocols, using statistical modeling, test theory, and task design to capture model performance across diverse contexts and user needs • Conduct statistical analysis of evaluation data to extract meaningful insights, identify systematic issues, and inform improvements to both models and evaluation processes • Analyze model behavior, identify weaknesses, and drive design decisions with failure analysis. Examples include, but not limited to: model experimentation, adversarial testing, counterfactual analysis, creating tools to assess model behavior and user impact • Collaborate with engineers to translate evaluation methods and analysis techniques into scalable, adaptable, and reliable solutions that can be reused across different features, use cases, and evaluation workflows • Work cross-functionally to apply methods to real-world applications with designers, clinical experts, and engineering teams across Hardware and Software • Independently run and analyze experiments for real improvements Qualifications: Required: • Bachelor's degree (or equivalent experience) in a empirical field with emphasis on quantitative methodologies of human behavior, including HCI, Psychometrics, Quantitative or Experimental Psychology, Educational Measurement, Language Assessment, or a relevant field • Proficiency in Python and ability to write clean, performant code and collaborate using standard software development practices (e.g. Git) • Strong statistical analysis skills and experience in crafting experiments, validating data quality and model performance • Experience in building and extending data and inference pipelines to process large scale datasets Preferred: • MS and a minimum of 3 years of relevant industry experience or PhD in relevant fields • Real-world experience with LLM-based evaluation systems and human annotation and human evaluation methodologies • Experience in rigorous, evidence-based approaches to test development, e.g. quantitative and qualitative test design, reliability and validity analysis • Customer-focused mindset with experience or strong interest in building consumer digital health and wellness products • Strong communication skills and ability to work cross-functionally with technical and non-technical stakeholders Company: Apple is a technology company that designs, manufactures, and markets consumer electronics, personal computers, and software. Founded in 1976, headquartered in Cupertino, California, USA, team size 10001+ employees, currently Public Company. Apple has a track record of offering H1B sponsorships. Seniority level
Seniority level Mid-Senior level Employment type
Employment type Full-time Job function
Industries Software Development Referrals increase your chances of interviewing at Jobright.ai by 2x Inferred from the description for this job
Medical insurance Vision insurance 401(k) Get notified about new Data Scientist jobs in
Cupertino, CA . Data Scientist, Analytics - Safety Response
San Francisco Bay Area $140,000.00-$157,500.00 2 weeks ago Data Scientist Graduate (TikTok-Product-Data Science)-2026 Start (BS/MS)
San Jose, CA $114,000.00-$177,777.00 1 week ago Mountain View, CA $240,000.00-$280,000.00 19 hours ago San Jose, CA $109,000.00-$192,400.00 2 weeks ago Fremont, CA $145,000.00-$204,000.00 5 days ago Data Scientist, Energy Analytics, Google Cloud
Sunnyvale, CA $166,000.00-$244,000.00 1 week ago AI/ML Engineer (Multiple roles and seniority levels)
Mountain View, CA $141,000.00-$202,000.00 4 days ago Santa Clara, CA $170,000.00-$225,000.00 1 day ago Palo Alto, CA $100,000.00-$200,000.00 15 hours ago San Jose, CA $130,000.00-$200,000.00 5 days ago Mountain View, CA $136,301.00-$172,486.00 1 week ago Senior Data Scientist, ML - Recommendations
Data Scientist II – Experimentation & Measurement
San Mateo, CA $153,300.00-$229,900.00 2 weeks ago Sunnyvale, CA $145,000.00-$204,000.00 1 week ago Data Scientist, Product, Play Data Science and Analytics
Mountain View, CA $183,000.00-$271,000.00 2 weeks ago Sunnyvale, CA $173,000.00-$242,000.00 1 week ago San Francisco Bay Area $155,000.00-$265,500.00 2 weeks ago San Jose, CA $123,500.00-$212,850.00 2 weeks ago Fremont, CA $145,000.00-$204,000.00 1 week ago Sunnyvale, CA $165,000.00-$180,000.00 5 days ago Mountain View, CA $141,000.00-$202,000.00 2 weeks ago Sunnyvale, CA $158,200.00-$185,000.00 4 weeks ago San Jose, CA $176,356.00-$329,334.00 1 day ago San Jose, CA $137,500.00-$236,500.00 1 week ago Sunnyvale, CA $170,000.00-$277,000.00 2 weeks ago Data Scientist - Search, Trust & Safety - San Jose
San Jose, CA $144,000.00-$329,333.00 2 weeks ago Data Scientist Graduate [TikTok LIVE-Data Science] - 2026 Start (BS/MS)
San Jose, CA $84,445.00-$124,445.00 1 week ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Join to apply for the
AI Evaluation Data Scientist - Health, Mid Level
role at
Jobright.ai AI Evaluation Data Scientist - Health, Mid Level
2 days ago Be among the first 25 applicants Join to apply for the
AI Evaluation Data Scientist - Health, Mid Level
role at
Jobright.ai Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust. Job Summary: Apple is a leading technology company focused on health technologies that support users in living healthier lives. The AI Evaluation Data Scientist in the Health team will develop and validate evaluation methodologies for Generative AI systems, design human annotation frameworks, and conduct statistical analyses to enhance the quality of health products. Responsibilities: • Design and analyze human evaluations of AI systems to create reliable annotation frameworks, and ensure validity and reliability of measurements of latent constructs • Develop and refine benchmarks and evaluation protocols, using statistical modeling, test theory, and task design to capture model performance across diverse contexts and user needs • Conduct statistical analysis of evaluation data to extract meaningful insights, identify systematic issues, and inform improvements to both models and evaluation processes • Analyze model behavior, identify weaknesses, and drive design decisions with failure analysis. Examples include, but not limited to: model experimentation, adversarial testing, counterfactual analysis, creating tools to assess model behavior and user impact • Collaborate with engineers to translate evaluation methods and analysis techniques into scalable, adaptable, and reliable solutions that can be reused across different features, use cases, and evaluation workflows • Work cross-functionally to apply methods to real-world applications with designers, clinical experts, and engineering teams across Hardware and Software • Independently run and analyze experiments for real improvements Qualifications: Required: • Bachelor's degree (or equivalent experience) in a empirical field with emphasis on quantitative methodologies of human behavior, including HCI, Psychometrics, Quantitative or Experimental Psychology, Educational Measurement, Language Assessment, or a relevant field • Proficiency in Python and ability to write clean, performant code and collaborate using standard software development practices (e.g. Git) • Strong statistical analysis skills and experience in crafting experiments, validating data quality and model performance • Experience in building and extending data and inference pipelines to process large scale datasets Preferred: • MS and a minimum of 3 years of relevant industry experience or PhD in relevant fields • Real-world experience with LLM-based evaluation systems and human annotation and human evaluation methodologies • Experience in rigorous, evidence-based approaches to test development, e.g. quantitative and qualitative test design, reliability and validity analysis • Customer-focused mindset with experience or strong interest in building consumer digital health and wellness products • Strong communication skills and ability to work cross-functionally with technical and non-technical stakeholders Company: Apple is a technology company that designs, manufactures, and markets consumer electronics, personal computers, and software. Founded in 1976, headquartered in Cupertino, California, USA, team size 10001+ employees, currently Public Company. Apple has a track record of offering H1B sponsorships. Seniority level
Seniority level Mid-Senior level Employment type
Employment type Full-time Job function
Industries Software Development Referrals increase your chances of interviewing at Jobright.ai by 2x Inferred from the description for this job
Medical insurance Vision insurance 401(k) Get notified about new Data Scientist jobs in
Cupertino, CA . Data Scientist, Analytics - Safety Response
San Francisco Bay Area $140,000.00-$157,500.00 2 weeks ago Data Scientist Graduate (TikTok-Product-Data Science)-2026 Start (BS/MS)
San Jose, CA $114,000.00-$177,777.00 1 week ago Mountain View, CA $240,000.00-$280,000.00 19 hours ago San Jose, CA $109,000.00-$192,400.00 2 weeks ago Fremont, CA $145,000.00-$204,000.00 5 days ago Data Scientist, Energy Analytics, Google Cloud
Sunnyvale, CA $166,000.00-$244,000.00 1 week ago AI/ML Engineer (Multiple roles and seniority levels)
Mountain View, CA $141,000.00-$202,000.00 4 days ago Santa Clara, CA $170,000.00-$225,000.00 1 day ago Palo Alto, CA $100,000.00-$200,000.00 15 hours ago San Jose, CA $130,000.00-$200,000.00 5 days ago Mountain View, CA $136,301.00-$172,486.00 1 week ago Senior Data Scientist, ML - Recommendations
Data Scientist II – Experimentation & Measurement
San Mateo, CA $153,300.00-$229,900.00 2 weeks ago Sunnyvale, CA $145,000.00-$204,000.00 1 week ago Data Scientist, Product, Play Data Science and Analytics
Mountain View, CA $183,000.00-$271,000.00 2 weeks ago Sunnyvale, CA $173,000.00-$242,000.00 1 week ago San Francisco Bay Area $155,000.00-$265,500.00 2 weeks ago San Jose, CA $123,500.00-$212,850.00 2 weeks ago Fremont, CA $145,000.00-$204,000.00 1 week ago Sunnyvale, CA $165,000.00-$180,000.00 5 days ago Mountain View, CA $141,000.00-$202,000.00 2 weeks ago Sunnyvale, CA $158,200.00-$185,000.00 4 weeks ago San Jose, CA $176,356.00-$329,334.00 1 day ago San Jose, CA $137,500.00-$236,500.00 1 week ago Sunnyvale, CA $170,000.00-$277,000.00 2 weeks ago Data Scientist - Search, Trust & Safety - San Jose
San Jose, CA $144,000.00-$329,333.00 2 weeks ago Data Scientist Graduate [TikTok LIVE-Data Science] - 2026 Start (BS/MS)
San Jose, CA $84,445.00-$124,445.00 1 week ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr