Logo
Highmark Health

Lead Data Scientist - Research and Development - Graph Intelligence

Highmark Health, Austin, Texas, us, 78716

Save Job

Overview

Company :

Highmark Health Job Title:

Lead Data Scientist, Research & Development, Graph Intelligence Are you an architect of interconnected data, driven by the belief that relationships hold the key to uncovering society's most complex challenges? Highmark Health is seeking a groundbreaking Lead Data Scientist, Research & Development, specializing in Graph Intelligence, who will not just work with graph data, but will define the future of how we harness relational insights in healthcare. This is a premier R&D position where you will lead the charge in inventing and proving out transformative graph-native analytical solutions. Your core mission is to push the boundaries of what's possible in advanced analytics by pioneering novel methodologies that leverage network science, knowledge graphs, and Graph Machine Learning (GML) to solve critical problems across the healthcare continuum. From personalizing patient care pathways to detecting complex fraud rings and understanding population health dynamics, your work will directly impact millions of lives. As our Lead Graph Intelligence specialist, you will be the spearhead of cutting-edge research projects. This means deeply engaging with graph theory, building and enriching large-scale knowledge graphs, and developing next-generation Graph Neural Networks (GNNs), graph convolutional networks (GCNs), graph attention networks (GATs), and other advanced network algorithms. You'll architect unique graph embeddings, perform sophisticated link prediction, community detection, and anomaly detection on complex healthcare data. Your responsibilities will include designing rigorous experiments, building robust proof-of-concept models, and meticulously evaluating the performance and interpretability of these novel graph algorithms to ensure their real-world applicability. You are not merely a data scientist; you are a relational data innovator with a strategic mindset. You inherently understand that the explicit modeling of entities and their relationships unlocks a deeper layer of intelligence that traditional tabular or sequential data models cannot. You will proactively identify opportunities to construct and leverage comprehensive healthcare knowledge graphs, integrating diverse patient, provider, claims, and clinical data to uncover hidden patterns, propagate insights through networks, and develop groundbreaking analytical solutions that exploit the rich, multi-modal structures within healthcare. Leveraging your profound expertise in graph databases (e.g., Neo4j, ArangoDB, Amazon Neptune, Ontotext GraphDB), distributed graph processing frameworks (e.g., Apache Spark GraphX, Dask-Graph), and leading GML libraries (e.g., PyTorch Geometric, DGL, Spektral), you will conduct in-depth research, construct sophisticated predictive, prescriptive, and diagnostic models directly on graph structures. You will drive initiatives from theoretical concept to validated, scalable prototypes. You are a vigilant scout of the graph AI landscape, continuously scanning, rigorously evaluating, and championing the adoption of emerging graph platforms, algorithms, and tools. Furthermore, you will actively foster collaborations with leading academic institutions, healthcare research experts, and the broader graph community. Your contributions will extend to publishing seminal research findings in top-tier conferences and leading the dialogue on the transformative power of graph intelligence in healthcare.

Responsibilities

Work directly with the business to understand their business processes and aims, then identify how analytical solutions could help deliver value for them. This would include being accountable for:

Outlining complex new use cases + creating high level impact estimates.

Identifying data elements needed and where to get them (including proxies).

Assembling data sets independently using knowledge of Highmark operational and analytic data structures.

Delivering the analytical solution to several complex business problems simultaneously.

Documenting objectives, assumptions and processes and enriching/expanding our standards as needed

Select and apply the appropriate advanced modeling/machine learning techniques to these data sets to deliver business insight, ensuring that the final analysis is well researched, accurate, and documented. This requires: Proficiency of a substantial number of advanced analytical techniques and mastery of a few, evidenced by in-depth knowledge and delivery record (for example regression models, tree-based learning, neural networks, clustering techniques, natural language processing)

Consult with the business to contextualize and translate the results of our analysis in a form which the business can understand and act upon. This will include: Written reports, presentation and data visualizations, and draws clear lines between the high-level problem specifications for a broad range of audiences, the analyses performed, and how the results link directly back to business objectives, and lead implementation which drives frontline workflow.

Plan, prepare and deliver/coordinate all elements of several analyses largely independently in such a way that it is delivered on time, to a high standard and ready to implement on a production basis (including dissemination through the Organization's user systems). This includes identifying the best route to implementation (developing the analytical solution accordingly).

Expertise and in-depth understanding of subject, be the face of major projects within ED&A, external presence/earned credibility (conferences, white papers, local/national associations); mentoring/teaching others

Other duties as assigned or requested.

Education & Experience

Education Required Master's degree in Analytics, Mathematics, Physics, Computer and Information Science, Engineering Technology or related field

OR

Bachelor's Degree + 3 years of relevant work experience in lieu of a Master's Degree Preferred Doctoral degree (Ph.D.) in Analytics, Mathematics, Physics, Computer and Information Science, Engineering Technology, or a related field. Experience Required 5 years of Data Science 3 years Data Science (if PhD Education) Preferred Deep Expertise in Graph Theory & Network Science:

Comprehensive understanding of fundamental graph algorithms (centrality, community detection, pathfinding, clustering), knowledge graph principles, and network analysis techniques for complex systems. Advanced Graph Machine Learning (GML):

Proven experience designing, implementing, and optimizing various Graph Neural Network (GNN) architectures, graph convolutional networks, and other graph-specific machine learning models for tasks like node classification, link prediction, and anomaly detection in graphs. Knowledge Graph Engineering:

Hands-on experience in the entire lifecycle of knowledge graphs, including schema design (ontologies, RDF, OWL), data ingestion, graph construction, data cleaning, entity resolution, and advanced graph querying (e.g., Cypher, SPARQL). Graph Database & Platform Experience:

Practical experience with one or more leading graph databases (e.g., Neo4j, Google Spanner Graph) and distributed graph processing frameworks. GML Libraries & Frameworks:

Strong command of specialized GML libraries like PyTorch Geometric (PyG), Deep Graph Library (DGL), Spektral, or StellarGraph. Cloud Platform & MLOps:

Experience deploying and managing ML models, particularly GML pipelines, in cloud environments (e.g., AWS, Azure, GCP) and familiarity with MLOps principles for research projects. Research & Publication Acumen:

A track record of contributing to cutting-edge research, including peer-reviewed publications in top-tier conferences or demonstrated experience in driving novel solution development from concept to prototype. Healthcare Data Familiarity:

Understanding of healthcare data domains (claims, clinical, EMR) and related ontologies or standards (e.g., SNOMED CT, ICD) is a significant plus. Experimental Design & Rigor:

Demonstrated ability to design robust experiments, rigorously evaluate model performance, interpret complex results, and contribute to the scientific understanding of graph-based solutions. Licenses or Certifications Required None Preferred None Skills Analysis of business problems/needs

Analytical and Logical Reasoning/Thinking

Collaborative Problem Solving

Data Analysis with SQL, BigQuery

Statistical Analysis with Python, R

Written & Oral Presentation Skills

Basic proto-typing/front end skills

Travel 0% - 25% Compliance Disclaimer: This position adheres to the ethical and legal standards and behavioral expectations as set forth in the code of business conduct and company policies, and HIPAA/privacy requirements. Pay Range Minimum: $108,000.00 Maximum: $201,800.00 Base pay is determined by a variety of factors including a candidate’s qualifications, experience, and expected contributions, as well as internal peer equity, market, and business considerations. The displayed salary range does not reflect any geographic differential Highmark may apply for certain locations based upon comparative markets. Highmark Health and its affiliates prohibit discrimination against qualified individuals based on protected characteristics and provide contact information for accessibility accommodations as found on the original posting. Req ID: J267786

#J-18808-Ljbffr