Clarivate
Join to apply for the
Senior Data Scientist (NLP)
role at
Clarivate Join to apply for the
Senior Data Scientist (NLP)
role at
Clarivate Get AI-powered advice on this job and more exclusive features. We are seeking a Senior Data Scientist specializing in Natural Language Processing (NLP) and modern retrieval-augmented generation (RAG) architectures to join our Life Sciences & Health (LS&H) team. This is an amazing opportunity to work on large-scale AI-enabled solutions that modernize and enhance our content delivery systems. Youll be at the intersection of innovation, architecture, and real-world AI integration. The team consists of several domain and technical experts and reports to the VP of AI, Content. We would love to speak with you if you have deep expertise across text processing pipelines including indexing, vectorization, prompting, fine-tuning, summarization and context management and bring hands-on experience with frameworks like LangChain and LangGraph. Familiarity with architectures such as VRAG and GraphRAG is highly desirable.
About You Experience, Education, Skills, And Accomplishments
Bachelors degree in Computer Science, Data Science, Computational Linguistics, or a related field At least 5 years of hands-on experience in data science, focused on natural language processing (NLP) At least 5 years of experience using Python, with expertise in NLP libraries such as LangChain, LangGraph, or other Lang-based toolkits Proven experience in model development and applying machine learning techniques to real-world problems
It Would Be Great If You Also Had
Expertise in retrieval-based LLM workflows (RAG, VRAG, GraphRAG) Deep understanding of embedding models, semantic search, and vector stores (e.g., FAISS, Pinecone) Experience with document loaders and text splitters/document splitting strategies Familiarity with MLOps practices and production-level deployment of AI pipelines Experience with cloud platforms (e.g., AWS, Azure, or GCP) Experience applying Graph Neural Networks (GNNs) to retrieval-enhanced generation Knowledge of LangSmith and vector orchestration platforms Familiarity with multilingual NLP and cross-lingual embeddings Exposure to real-time knowledge graphs and stream-based RAG systems A Masters or PhD in a technical field (Computer Science, Data Science, etc.)
What will you be doing in this role?
Design NLP Workflows: Develop scalable pipelines for text ingestion, cleaning, normalization, and tokenization to support downstream applications. Implement Indexing and Vectorization Strategies: Architect and maintain robust indexing systems and vector databases for semantic search and retrieval. Develop Prompting and Finetuning Frameworks: Create reusable prompting strategies and lead fine-tuning initiatives for LLMs tailored to business-specific tasks. Build LangChain/LangGraph Applications: Construct dynamic knowledge systems and agentic workflows using LangChain and LangGraph. Integrate Advanced RAG Architectures: Apply VRAG and GraphRAG design patterns to enrich information retrieval and contextual understanding. Conduct Performance Optimization: Perform benchmark testing and model evaluations to improve accuracy, efficiency, and scalability of NLP systems. Collaborate Across Teams: Work closely with engineering, product, and research stakeholders to deliver integrated AI-driven features. Provide Technical Leadership: Mentor junior data scientists, guide best practices, and drive innovation across AI projects.
About The Team
This role sits within the Life Sciences & Healthcare (LS&H) segment under the Content Technology team. The team is focused on driving innovation through large-scale AI solutions. Youll work closely with the VP of Content Technology, Solutions Architects, and internal stakeholders who are SMEs and the end users of the platform. This role offers the chance to contribute meaningfully to cutting-edge AI projects that have visibility across leadership teams. The work is focused, impactful, and offers excellent career advancement opportunities in a fast-evolving AI space.
Hours of Work
Full-time permanent position, primarily working core business hours in your time zone, with flexibility to adjust to various global time zones as needed Fully remote position based in the US
Compensation - US Only
The expected base salary for this position is $117,000 - $147,000 USD per year.This role is eligible for bonus incentive earnings.Individual pay is based upon experience, education, skill and ability, expertise, and relevant factors.
In addition to a competitive remuneration package, you will be eligible to participate in a benefits package that includes medical, dental, prescription drug, life insurance, 401k with match, long term disability coverage, vacation, sick time, volunteer time, discount programs, and many more.
At Clarivate, we are committed to providing equal employment opportunities for all qualified persons with respect to hiring, compensation, promotion, training, and other terms, conditions, and privileges of employment. We comply with applicable laws and regulations governing non-discrimination in all locations.
Seniority level
Seniority level
Mid-Senior level Employment type
Employment type
Full-time Job function
Job function
Engineering and Information Technology Industries
Information Services Referrals increase your chances of interviewing at Clarivate by 2x Sign in to set job alerts for Senior Data Scientist roles.
Scottsdale, AZ $177,000.00-$284,000.00 3 weeks ago Senior Compliance Data Analyst, NAST S&C DAS
Tempe, AZ $42,000.00-$79,800.00 2 weeks ago Senior Compliance Data Analyst, NAST S&C DAS
Tempe, AZ $42,000.00-$79,800.00 2 weeks ago SAP - Analytics - Senior - Consulting - Location OPEN
Phoenix, AZ $102,500.00-$187,900.00 1 week ago Phoenix, AZ $165,000.00-$315,000.00 1 day ago Scottsdale, AZ $123,500.00-$212,850.00 4 months ago Scottsdale, AZ $123,500.00-$212,850.00 4 months ago Scottsdale, AZ $110,000.00-$150,000.00 2 weeks ago Scottsdale, AZ $123,500.00-$212,850.00 1 month ago Scottsdale, AZ $123,500.00-$212,850.00 1 month ago Phoenix, AZ $164,780.00-$314,960.00 4 days ago Principal Data Scientist, Marketing Analytics and Data Science (Hybrid)
Scottsdale, AZ $131,182.00-$195,709.00 2 weeks ago Screening Platform & Filter Sr Data Scientist
Sr. Data Scientist, WW Insurance & Claim
Tempe, AZ $143,300.00-$247,600.00 4 days ago Teradata ML Data Scientist With Clear Scape Analytics
Data Scientist Senior Actuary & Analytics
Phoenix, AZ $143,320.00-$257,970.00 1 week ago Director, Data Science, Consumer and Credit Fraud
Scottsdale, AZ $158,000.00-$230,000.00 14 hours ago Senior Data Scientist, Specialist Senior - SFL Scientific
Tempe, AZ $107,600.00-$198,400.00 2 weeks ago Scottsdale, AZ $158,000.00-$230,000.00 20 hours ago Tempe, AZ $80,000.00-$110,000.00 2 months ago Data Governance Senior Data Quality Analyst
Senior Data Analyst (Statistical Programming)
Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
Senior Data Scientist (NLP)
role at
Clarivate Join to apply for the
Senior Data Scientist (NLP)
role at
Clarivate Get AI-powered advice on this job and more exclusive features. We are seeking a Senior Data Scientist specializing in Natural Language Processing (NLP) and modern retrieval-augmented generation (RAG) architectures to join our Life Sciences & Health (LS&H) team. This is an amazing opportunity to work on large-scale AI-enabled solutions that modernize and enhance our content delivery systems. Youll be at the intersection of innovation, architecture, and real-world AI integration. The team consists of several domain and technical experts and reports to the VP of AI, Content. We would love to speak with you if you have deep expertise across text processing pipelines including indexing, vectorization, prompting, fine-tuning, summarization and context management and bring hands-on experience with frameworks like LangChain and LangGraph. Familiarity with architectures such as VRAG and GraphRAG is highly desirable.
About You Experience, Education, Skills, And Accomplishments
Bachelors degree in Computer Science, Data Science, Computational Linguistics, or a related field At least 5 years of hands-on experience in data science, focused on natural language processing (NLP) At least 5 years of experience using Python, with expertise in NLP libraries such as LangChain, LangGraph, or other Lang-based toolkits Proven experience in model development and applying machine learning techniques to real-world problems
It Would Be Great If You Also Had
Expertise in retrieval-based LLM workflows (RAG, VRAG, GraphRAG) Deep understanding of embedding models, semantic search, and vector stores (e.g., FAISS, Pinecone) Experience with document loaders and text splitters/document splitting strategies Familiarity with MLOps practices and production-level deployment of AI pipelines Experience with cloud platforms (e.g., AWS, Azure, or GCP) Experience applying Graph Neural Networks (GNNs) to retrieval-enhanced generation Knowledge of LangSmith and vector orchestration platforms Familiarity with multilingual NLP and cross-lingual embeddings Exposure to real-time knowledge graphs and stream-based RAG systems A Masters or PhD in a technical field (Computer Science, Data Science, etc.)
What will you be doing in this role?
Design NLP Workflows: Develop scalable pipelines for text ingestion, cleaning, normalization, and tokenization to support downstream applications. Implement Indexing and Vectorization Strategies: Architect and maintain robust indexing systems and vector databases for semantic search and retrieval. Develop Prompting and Finetuning Frameworks: Create reusable prompting strategies and lead fine-tuning initiatives for LLMs tailored to business-specific tasks. Build LangChain/LangGraph Applications: Construct dynamic knowledge systems and agentic workflows using LangChain and LangGraph. Integrate Advanced RAG Architectures: Apply VRAG and GraphRAG design patterns to enrich information retrieval and contextual understanding. Conduct Performance Optimization: Perform benchmark testing and model evaluations to improve accuracy, efficiency, and scalability of NLP systems. Collaborate Across Teams: Work closely with engineering, product, and research stakeholders to deliver integrated AI-driven features. Provide Technical Leadership: Mentor junior data scientists, guide best practices, and drive innovation across AI projects.
About The Team
This role sits within the Life Sciences & Healthcare (LS&H) segment under the Content Technology team. The team is focused on driving innovation through large-scale AI solutions. Youll work closely with the VP of Content Technology, Solutions Architects, and internal stakeholders who are SMEs and the end users of the platform. This role offers the chance to contribute meaningfully to cutting-edge AI projects that have visibility across leadership teams. The work is focused, impactful, and offers excellent career advancement opportunities in a fast-evolving AI space.
Hours of Work
Full-time permanent position, primarily working core business hours in your time zone, with flexibility to adjust to various global time zones as needed Fully remote position based in the US
Compensation - US Only
The expected base salary for this position is $117,000 - $147,000 USD per year.This role is eligible for bonus incentive earnings.Individual pay is based upon experience, education, skill and ability, expertise, and relevant factors.
In addition to a competitive remuneration package, you will be eligible to participate in a benefits package that includes medical, dental, prescription drug, life insurance, 401k with match, long term disability coverage, vacation, sick time, volunteer time, discount programs, and many more.
At Clarivate, we are committed to providing equal employment opportunities for all qualified persons with respect to hiring, compensation, promotion, training, and other terms, conditions, and privileges of employment. We comply with applicable laws and regulations governing non-discrimination in all locations.
Seniority level
Seniority level
Mid-Senior level Employment type
Employment type
Full-time Job function
Job function
Engineering and Information Technology Industries
Information Services Referrals increase your chances of interviewing at Clarivate by 2x Sign in to set job alerts for Senior Data Scientist roles.
Scottsdale, AZ $177,000.00-$284,000.00 3 weeks ago Senior Compliance Data Analyst, NAST S&C DAS
Tempe, AZ $42,000.00-$79,800.00 2 weeks ago Senior Compliance Data Analyst, NAST S&C DAS
Tempe, AZ $42,000.00-$79,800.00 2 weeks ago SAP - Analytics - Senior - Consulting - Location OPEN
Phoenix, AZ $102,500.00-$187,900.00 1 week ago Phoenix, AZ $165,000.00-$315,000.00 1 day ago Scottsdale, AZ $123,500.00-$212,850.00 4 months ago Scottsdale, AZ $123,500.00-$212,850.00 4 months ago Scottsdale, AZ $110,000.00-$150,000.00 2 weeks ago Scottsdale, AZ $123,500.00-$212,850.00 1 month ago Scottsdale, AZ $123,500.00-$212,850.00 1 month ago Phoenix, AZ $164,780.00-$314,960.00 4 days ago Principal Data Scientist, Marketing Analytics and Data Science (Hybrid)
Scottsdale, AZ $131,182.00-$195,709.00 2 weeks ago Screening Platform & Filter Sr Data Scientist
Sr. Data Scientist, WW Insurance & Claim
Tempe, AZ $143,300.00-$247,600.00 4 days ago Teradata ML Data Scientist With Clear Scape Analytics
Data Scientist Senior Actuary & Analytics
Phoenix, AZ $143,320.00-$257,970.00 1 week ago Director, Data Science, Consumer and Credit Fraud
Scottsdale, AZ $158,000.00-$230,000.00 14 hours ago Senior Data Scientist, Specialist Senior - SFL Scientific
Tempe, AZ $107,600.00-$198,400.00 2 weeks ago Scottsdale, AZ $158,000.00-$230,000.00 20 hours ago Tempe, AZ $80,000.00-$110,000.00 2 months ago Data Governance Senior Data Quality Analyst
Senior Data Analyst (Statistical Programming)
Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr