Logo
Jobs via Dice

Data Scientist VI

Jobs via Dice, San Diego, California, United States, 92189

Save Job

Join to apply for the

Data Scientist VI

role at

Jobs via Dice

4 days ago Be among the first 25 applicants

Join to apply for the

Data Scientist VI

role at

Jobs via Dice

Get AI-powered advice on this job and more exclusive features.

Job Summary This senior individual contributor is primarily responsible for overseeing the design and development of data pipelines and automation for data acquisition and ingestion of raw data from multiple data sources and data formats. This role is also responsible for leading and overseeing the development of detailed problem statements outlining hypotheses and their effect on target clients/customers, serving as a lead expert in the analysis and investigation of complex data sets, overseeing the selection, manipulation and transformation of data into features used in machine learning algorithms, training statistical models, directing the deployment and maintenance of reliable and efficient models through production, ensuring and leading the verification of model performance, and building and maintaining partnerships with internal and external stakeholders across domains to develop and deliver statistical driven outcomes.

Essential Responsibilities

Promotes learning in others by communicating information and providing advice to drive projects forward; builds collaborative, cross-functional relationships. Solicits and acts on performance feedback; provides actionable feedback to others, including upward feedback to leadership; influences, mentors, and coaches team members. Practices self-leadership; creates, evaluates, and responds to the strengths and weaknesses of self and unit or team members. Leads the adaptation to competing demands and new responsibilities; adapts to and learns from change, challenges, and feedback. Fosters open dialogue amongst team members.

Drives the execution of multiple work streams by identifying member and operational needs; translates business strategy into actionable business requirements; develops and updates new procedures and policies. Gains cross-functional support for objectives and priorities; determines and carries out processes and methodologies; solves highly complex issues; escalates and resolves issues as appropriate; sets standards and measures progress. Develops work plans to meet business priorities and deadlines; coordinates, obtains and distributes resources. Removes obstacles that impact performance; guides performance and develops contingency plans accordingly; influences the completion of project tasks by others.

Leads and oversees the development of detailed problem statements outlining hypotheses and their effect on target clients/customers by ensuring comprehensive and accurate definitions of scope, objectives, outcome statements and metrics.

Oversees the design and development of data pipelines and automation for data acquisition and ingestion of raw data from multiple data sources and data formats by driving the transformation, cleansing, and storing of data for consumption by downstream processes; overseeing the development and optimization of diverse and complex SQL queries; and demonstrating advanced expertise of database fundamentals.

Serves as a lead expert in the analysis and investigation of complex data sets by ensuring optimum data visualization methods are employed; determining and communicating how best to manipulate data sources to discover patterns, spot anomalies, test hypotheses, and/or check assumptions; and reviewing and verifying summaries of key dataset characteristics.

Oversees the selection, manipulation, and transformation of data into features used in machine learning algorithms by leveraging and demonstrating advanced expertise in techniques to conduct dimensionality reduction, feature importance, and feature selection.

Trains statistical models by selecting and leveraging algorithms and data mining techniques; overseeing model testing; ensuring the proper use of various algorithms to assess the input dataset and related features; and ensuring appropriate techniques are used to prevent overfitting such as cross-validation.

Directs the deployment and maintenance of reliable and efficient models through production.

Ensures and leads the verification of model performance by demonstrating advanced expertise in the practice of a variety of model validation techniques to assess and discriminate the goodness of model fit; and leveraging feedback and output to direct and strengthen model performance.

Builds and maintains partnerships with internal and external stakeholders across domains to develop and deliver statistical driven outcomes by generating and delivering insights and values from heterogeneous data to investigate complex problems for multiple use cases; driving informed decision-making; and presenting findings to both technical and non-technical senior leadership.

Minimum Qualifications

Minimum three (3) years experience working with Exploratory Data Analysis (EDA) and visualization methods.

Minimum seven (7) years machine learning and/or algorithmic experience.

Minimum seven (7) years statistical analysis and modeling experience.

Minimum seven (7) years programming experience.

Minimum five (5) years experience in a leadership role with or without direct reports.

Bachelors degree in Mathematics, Statistics, Computer Science, Engineering, Economics, Public Health, or related field AND Minimum ten (10) years experience in data science or a directly related field. Additional equivalent work experience in a directly related field may be substituted for the degree requirement. Advanced degrees may be substituted for the work experience requirements.

Additional Requirements

Knowledge, Skills, and Abilities (KSAs): Strategic Thinking; Advanced Quantitative Data Modeling; Algorithms; Applied Data Analysis; Data Extraction; Data Visualization Tools; Deep Learning/Neural Networks; Machine Learning; Relational Database Management; Project Management; Microsoft Excel; Design Thinking; Business Intelligence Tools; Data Manipulation/Wrangling; Data Ensemble Techniques; Feature Analysis/Engineering; Open Source Languages & Tools; Model Optimization; Data Architecture; Data Engineering.

Preferred Qualifications

Doctorate degree in Mathematics, Statistics, Computer Science, Engineering, Economics, Public Health, or related field.

Six (6) years healthcare experience.

Five (5) years experience working with SQL.

Ten (10) years experience working with Open Source Tools (e.g., R, Python).

Four (4) years experience with Huggingface transformers.

At least three (3) years experience fine-tuning language models.

Five (5) years experience working with Docker.

Five (5) years experience developing data science pipelines with workflow orchestration frameworks (e.g., Airflow, Flyte, Kubeflow).

Primary Location:

California, San Diego, El Camino Real Administration

Scheduled Weekly Hours:

40

Shift:

Day

Workdays:

Mon, Tue, Wed, Thu, Fri

Working Hours Start:

08:00 AM

Working Hours End:

04:30 PM

Job Schedule:

Full-time

Job Type:

Standard

Worker Location:

Remote

Employee Status:

Regular

Employee Group/Union Affiliation:

NUE-SCAL-01|NUE|Non Union Employee

Job Level:

Individual Contributor

Department:

Parsons West Annex - Prj Mgmt-Innovtn Proj-Qlty&Svc - 0806

Pay Range:

$182,100 - $235,620 / year Kaiser Permanente is committed to pay equity and transparency. The posted pay range is based on possible base salaries for the role and does not include the value of our total rewards package. Actual pay determined at offer will be based on years of relevant work experience, education, certifications, skills and geographic location along with a review of current employees in similar roles to ensure that pay equity is achieved and maintained across Kaiser Permanente.

Travel:

Yes, 10 % of the Time Kaiser Permanente is an equal opportunity employer committed to fair, respectful, and inclusive workplaces. Applicants will be considered for employment without regard to race, religion, sex, age, national origin, disability, veteran status, or any other protected characteristic or status.

Kaiser Permanente is an equal opportunity employer committed to fair, respectful, and inclusive workplaces.

Consistently supports compliance and the Principles of Responsibility (Kaiser Permanente's Code of Conduct) by maintaining the privacy and confidentiality of information, protecting the assets of the organization, acting with ethics and integrity, reporting non-compliance, and adhering to applicable federal, state, and local laws and regulations, accreditation, and licensure requirements (where applicable), and Kaiser Permanente's policies and procedures.

Models and reinforces ethical behavior in self and others in accordance with the Principles of Responsibility, adheres to organizational policies and guidelines; supports compliance initiatives; maintains confidences; admits mistakes; conducts business with honesty, shows consistency in words and actions; follows through on commitments.

Job duties with at least occasional or possible access to: (1) patients, the general public, or other employees; (2) confidential protected health information and other confidential KP information (including employee, proprietary, financial or trade secret information); (3) KP property and assets, for example, electronic assets, medical instruments, or devices; (4) controlled substances regulated by federal law or potentially subject to diversion.

Note:

This position is currently active and accepting applications.

#J-18808-Ljbffr