Logo
Novartis Group Companies

Clinical Data Engineering Lead

Novartis Group Companies, Cambridge, Massachusetts, us, 02140

Save Job

Job Description Summary About the role:

#LI:Onsite

Are you ready to make a lasting impact by building the future of oncology research? Novartis Biomedical Research (NBR) is searching for a visionary Associate Director to lead Clinical Data Engineering within our Oncology Data Science (OncDS) team. In this pivotal role, you'll be at the forefront of shaping early clinical development by building innovative biomarker data infrastructure, championing translational research, unlocking AI-powered discoveries, and raising the bar for operational excellence in biomarker data and multimodal analytics across Novartis' oncology trials.

Job Description Key responsibilities: Define and implement the clinical data engineering roadmap in alignment with Novartis' data and digital strategy, collaborating with SMEs and OncDS leadership.

Integrate advanced tools and AI/ML-ready infrastructure to support predictive modeling, multimodal analytics, and real-world data applications.

Align clinical and pre-clinical data engineering initiatives with the broader oncology strategy.

Lead, manage, and develop a high-performing clinical data engineering team, fostering collaboration and growth.

Drive strategic initiatives and partnerships across a matrixed organization.

Oversee data ingestion, transformation, and validation processes for clinical trial data, ensuring compliance with GCP/GxP, CDISC, and SOPs.

Collaborate with CROs and internal teams to optimize data flow, versioning, and retention policies.

Build and optimize data pipelines for both structured and unstructured clinical data to enable advanced analytics and informed decision-making.

Deploy scalable solutions for data harmonization, metadata management, and interoperability across platforms such as Foundry, Domino, Snowflake, and POSIT Connect.

Develop and manage applications and visualization tools, contributing to novel data products that support clinical decision-making and enable AI-driven initiatives in oncology trials.

Essential Requirements: This position will be located at the Cambridge, MA site and will not have the ability to be located remotely. This position will require 0-3% travel as defined by the business (domestic and/ or international).

Master's degree in computer science, Bioinformatics, Data Engineering, Software Engineering or a closely related discipline; PhD preferred.

Minimum 10 years of hands-on experience architecting and managing clinical data engineering, data management, and bioinformatics solutions in pharmaceutical or biotechnology industry.

Demonstrated expertise in designing, implementing, and scaling data infrastructure to support clinical development-including Artificial Intelligence (AI) / Machine Learning (ML) -driven analytics and multimodal data integration.

Proven ability to define, document, and operationalize end-to-end assay data generation and processing pipelines, with a focus on automation, orchestration, and compliance.

Extensive experience with oncology clinical trials, including regulatory-compliant management of clinical biomarker data and application of data standards (e.g., Clinical Data Interchange Standards Consortium [CDISC], Study Data Tabulation Model [SDTM], Analysis Data Model [ADaM]).

Deep familiarity with FAIR (Findable, Accessible, Interoperable, Reusable) data principles, data harmonization, and enterprise data governance frameworks.

Strong leadership in technical teams, with advanced communication and stakeholder management skills.

Desirable requirements: Extensive experience leading cross-functional data science initiatives in oncology, including translational science, biomarker analysis, real-world data, and exploratory clinical research; proven expertise with NGS technologies, and modern bioinformatics tools.

Advanced proficiency in cloud-native architectures, data lakes, and visualization frameworks (e.g., RShiny, Dash, Spotfire); strong programming and engineering skills (R, Python, Java, shell scripting, Linux, HPC), with a deep understanding of GxP, Agile methodologies, AI/ML operations, and architecting/managing AI agents in large clinical data environments.

The salary for this position is expected to range between $176,400 and $327,600 per year.

The final salary offered is determined based on factors like, but not limited to, relevant skills and experience, and upon joining Novartis will be reviewed periodically. Novartis may change the published salary range based on company and market factors.

Your compensation will include a performance-based cash incentive and, depending on the level of the role, eligibility to be considered for annual equity awards.

US-based eligible employees will receive a comprehensive benefits package that includes health, life and disability benefits, a 401(k) with company contribution and match, and a variety of other benefits. In addition, employees are eligible for a generous time off package including vacation, personal days, holidays and other leaves.

EEO Statement:

The Novartis Group of Companies are Equal Opportunity Employers. We do not discriminate in recruitment, hiring, training, promotion or other employment practices for reasons of race, color, religion, sex, national origin, age, sexual orientation, gender identity or expression, marital or veteran status, disability, or any other legally protected status.

Accessibility and reasonable accommodations

The Novartis Group of Companies are committed to working with and providing reasonable accommodation to individuals with disabilities. If, because of a medical condition or disability, you need a reasonable accommodation for any part of the application process, or to perform the essential functions of a position, please send an e-mail to

or call

and let us know the nature of your request and your contact information. Please include the job requisition number in your message.

Salary Range $176,400.00 - $327,600.00

Skills Desired Apache Spark, Artificial Intelligence (AI), Big Data, Data Governance, Data Literacy, Data Management, Data Quality, Data Science, Data Strategy, Data Visualization, Machine Learning (Ml), Master Data Management, Python (Programming Language), R (Programming Language), Statistical Analysis