Puertoricoindex

Principal Data Scientist-Data Services & Solutions

Puertoricoindex, San Juan, San Juan, us, 00902

Our Client, a Global Leading Top Fortune Company is offering a unique opportunity to be part of a Global Share services in Puerto Rico. This unique start-up offers a lifetime professional opportunity to obtain a Global experience and exposure working from Puerto Rico.

Be part of the Brain and Heart of this Multinational Operations ‘mission of offering solutions through edge technology to their worldwide client base. Contact Careers to become part of an exciting and attractive Global Company without the need to relocate out of Puerto Rico.

The Principal Data Scientist

is the most senior individual contributor position within this Global Data Operation Organization. This position is the Leader of the scientific designand continuous improvements of global data pipelines, analytics models for Global Data Services & Solutions.

This role is responsible for end-to-end data lifecycle. The Principal Data Scientist ensures that Company’s Data Products and AI systems performance and operational excellence including data lifecycle management, automation, and optimization.

Responsibilities

Serve as the Global Puerto Rico Data Services & Solutions scientific authority for data driven decision intelligence.

Develop and execute the enterprise AI and advanced analytics strategy

Design and implement predictive, prescriptive and generative AI models that improve a Global supply chain efficiency

Partnering with data engineers to develop data pipelines workloads and automated monitoring using Databricks, Apache Spark, and /or Delta Lake.

Drive innovations by researching and evaluating emerging AI techniques.

Owner of the Architect and optimization of data ingestion, transformation and loading of pipelines ensuring the highest standards

Define and execute standards for data modeling, store management, tracing and governance.

Collaborate with Data Governance to ensure full compliance with regulations (i.e. GDPR, FDA 21, CFR Part 11, SOC2, ISO 27001)

Integrate AI frameworks (XAI, LIME, SHARP) into model designs

Requirements

Master Degree required in Data Science, Computer Science, Applied Mathematics or a related quantitative field

PHD preferred

8 plus years of experience in data science, data engineering or applied AI

Core platforms & Tools: Databricks (Delta Lake, MLflow), Apache Spark, Azure Synapse Analytics and Kubernetes for AI workloads

Programming languages: Python, SQL, Scala, Java

Databases: PostgreSQL, MySQL, SQL Server, NoSQL (Mongo DB, Cosmos DB)

Graph technologies: Neo4j and RDF triple store modeling

AI frameworks: Tensor Flow, Py Torch, Scikit-Learn, Keras, XGBoost, Hugging Face transformers

Data Visualization & BI: Power BI, Tableau, Ploty or equivalent

Data Governance: Microsoft Purview, Collibra, or Alation for lineage, Datadog, Grafana, Open Telemetry, Dynatrace or equivalents

Cloud Platforms: Azure, AWS, and Multicloud data architectures

English Proficiency required

#J-18808-Ljbffr