Puertoricoindex
Principal Data Scientist-Data Services & Solutions
Puertoricoindex, San Juan, San Juan, us, 00902
Our Client, a Global Leading Top Fortune Company is offering a unique opportunity to be part of a Global Share services in Puerto Rico. This unique start-up offers a lifetime professional opportunity to obtain a Global experience and exposure working from Puerto Rico.
Be part of the Brain and Heart of this Multinational Operations ‘mission of offering solutions through edge technology to their worldwide client base. Contact Careers to become part of an exciting and attractive Global Company without the need to relocate out of Puerto Rico.
The Principal Data Scientist
is the most senior individual contributor position within this Global Data Operation Organization. This position is the Leader of the scientific designand continuous improvements of global data pipelines, analytics models for Global Data Services & Solutions.
This role is responsible for end-to-end data lifecycle. The Principal Data Scientist ensures that Company’s Data Products and AI systems performance and operational excellence including data lifecycle management, automation, and optimization.
Responsibilities
Serve as the Global Puerto Rico Data Services & Solutions scientific authority for data driven decision intelligence.
Develop and execute the enterprise AI and advanced analytics strategy
Design and implement predictive, prescriptive and generative AI models that improve a Global supply chain efficiency
Partnering with data engineers to develop data pipelines workloads and automated monitoring using Databricks, Apache Spark, and /or Delta Lake.
Drive innovations by researching and evaluating emerging AI techniques.
Owner of the Architect and optimization of data ingestion, transformation and loading of pipelines ensuring the highest standards
Define and execute standards for data modeling, store management, tracing and governance.
Collaborate with Data Governance to ensure full compliance with regulations (i.e. GDPR, FDA 21, CFR Part 11, SOC2, ISO 27001)
Integrate AI frameworks (XAI, LIME, SHARP) into model designs
Requirements
Master Degree required in Data Science, Computer Science, Applied Mathematics or a related quantitative field
PHD preferred
8 plus years of experience in data science, data engineering or applied AI
Core platforms & Tools: Databricks (Delta Lake, MLflow), Apache Spark, Azure Synapse Analytics and Kubernetes for AI workloads
Programming languages: Python, SQL, Scala, Java
Databases: PostgreSQL, MySQL, SQL Server, NoSQL (Mongo DB, Cosmos DB)
Graph technologies: Neo4j and RDF triple store modeling
AI frameworks: Tensor Flow, Py Torch, Scikit-Learn, Keras, XGBoost, Hugging Face transformers
Data Visualization & BI: Power BI, Tableau, Ploty or equivalent
Data Governance: Microsoft Purview, Collibra, or Alation for lineage, Datadog, Grafana, Open Telemetry, Dynatrace or equivalents
Cloud Platforms: Azure, AWS, and Multicloud data architectures
English Proficiency required
#J-18808-Ljbffr
Be part of the Brain and Heart of this Multinational Operations ‘mission of offering solutions through edge technology to their worldwide client base. Contact Careers to become part of an exciting and attractive Global Company without the need to relocate out of Puerto Rico.
The Principal Data Scientist
is the most senior individual contributor position within this Global Data Operation Organization. This position is the Leader of the scientific designand continuous improvements of global data pipelines, analytics models for Global Data Services & Solutions.
This role is responsible for end-to-end data lifecycle. The Principal Data Scientist ensures that Company’s Data Products and AI systems performance and operational excellence including data lifecycle management, automation, and optimization.
Responsibilities
Serve as the Global Puerto Rico Data Services & Solutions scientific authority for data driven decision intelligence.
Develop and execute the enterprise AI and advanced analytics strategy
Design and implement predictive, prescriptive and generative AI models that improve a Global supply chain efficiency
Partnering with data engineers to develop data pipelines workloads and automated monitoring using Databricks, Apache Spark, and /or Delta Lake.
Drive innovations by researching and evaluating emerging AI techniques.
Owner of the Architect and optimization of data ingestion, transformation and loading of pipelines ensuring the highest standards
Define and execute standards for data modeling, store management, tracing and governance.
Collaborate with Data Governance to ensure full compliance with regulations (i.e. GDPR, FDA 21, CFR Part 11, SOC2, ISO 27001)
Integrate AI frameworks (XAI, LIME, SHARP) into model designs
Requirements
Master Degree required in Data Science, Computer Science, Applied Mathematics or a related quantitative field
PHD preferred
8 plus years of experience in data science, data engineering or applied AI
Core platforms & Tools: Databricks (Delta Lake, MLflow), Apache Spark, Azure Synapse Analytics and Kubernetes for AI workloads
Programming languages: Python, SQL, Scala, Java
Databases: PostgreSQL, MySQL, SQL Server, NoSQL (Mongo DB, Cosmos DB)
Graph technologies: Neo4j and RDF triple store modeling
AI frameworks: Tensor Flow, Py Torch, Scikit-Learn, Keras, XGBoost, Hugging Face transformers
Data Visualization & BI: Power BI, Tableau, Ploty or equivalent
Data Governance: Microsoft Purview, Collibra, or Alation for lineage, Datadog, Grafana, Open Telemetry, Dynatrace or equivalents
Cloud Platforms: Azure, AWS, and Multicloud data architectures
English Proficiency required
#J-18808-Ljbffr