CloudTech Innovations
Join to apply for the
Data Scientist
role at
CloudTech Innovations Join to apply for the
Data Scientist
role at
CloudTech Innovations Job Description
Job Description
Job Title:
Data Scientist – Machine Learning, Big Data, GenAI (8–10 Years Experience)
Location:
Remote
Employment Type:
Contract
About The Role
We are seeking a highly experienced
Data Scientist
with 8–10 years of expertise delivering
production-grade AI/ML solutions
at scale. This role requires deep technical proficiency in
Machine Learning, Big Data, Generative AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) , combined with
hands-on cloud experience
(AWS, Azure, or GCP) and
migration expertise
for modernizing data and AI platforms.
The ideal candidate can lead projects end-to-end, from architecture design to deployment, while mentoring teams, optimizing for performance and cost, and ensuring alignment with business objectives.
Key Responsibilities
Design, develop, and deliver end-to-end ML/AI solutions in cloud-native environments from design to deployment and monitoring. Architect and implement Generative AI solutions leveraging LLMs (e.g., GPT, LLaMA, Claude, Mistral) and RAG pipelines with vector search. Build and optimize Big Data pipelines using Apache Spark, PySpark, and Delta Lake integrated with cloud storage (AWS S3, Azure Data Lake, GCP Cloud Storage). Design and maintain data lakehouse architectures with Databricks, Snowflake, or Delta Lake. Deploy scalable MLOps pipelines using MLflow, SageMaker, Azure ML, or Vertex AI with Docker, Kubernetes (EKS, AKS, GKE), and CI/CD. Implement and manage vector databases (Pinecone, FAISS, Milvus, Weaviate, ChromaDB) for RAG applications. Oversee ETL/ELT workflows and pipeline orchestration using Airflow, dbt, or Azure Data Factory. Migration projects, on-prem to cloud, cross-cloud, or legacy platform upgrades (e.g., Hadoop to Databricks, Hive to Delta Lake) , ensuring data integrity and minimal downtime. Integrate streaming data solutions using Apache Kafka and real-time analytics frameworks. Conduct feature engineering, hyperparameter tuning, and model optimization for performance and scalability. Mentor junior data scientists and guide best practices for AI/ML development and deployment. Collaborate with product, engineering, and executive teams to align AI solutions with business KPIs and compliance requirements.
Required Skills & Experience
8–10 years in data science, machine learning, and AI/ML solution delivery. Strong hands-on expertise in at least one major cloud platform (AWS, Azure, or GCP) with proven production deployments. Proficiency in Python, PySpark, and SQL. Proven experience with Apache Spark, Hadoop ecosystem, and Big Data processing. Hands-on experience with Generative AI, Hugging Face Transformers, LangChain, or LlamaIndex. Expertise in RAG architectures and vector databases (Pinecone, FAISS, Milvus, Weaviate, ChromaDB). Experience with MLOps workflows using MLflow, Docker, Kubernetes, and CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Migration experience involving AI/ML workloads, big data pipelines, and data platforms to modern cloud-based architectures. Knowledge of data services (AWS S3, Redshift; Azure Synapse; GCP BigQuery) and infrastructure-as-code (Terraform, CloudFormation, ARM templates). Familiarity with streaming technologies (Kafka) and query engines (Hive, Presto, Trino). Strong foundation in statistics, probability, and ML algorithms.
Preferred Qualifications
Experience with knowledge graphs and semantic search. Background in NLP, transformer architectures, and deep learning frameworks (TensorFlow, PyTorch). Exposure to BI tools (Power BI, Tableau, Looker). Domain expertise in finance, healthcare, or e-commerce.
Seniority level
Seniority level Mid-Senior level Employment type
Employment type Full-time Job function
Job function Engineering and Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at CloudTech Innovations by 2x Sign in to set job alerts for “Data Scientist” roles.
Boston, MA $111,800.00-$175,670.00 6 days ago Boston, MA $119,000.00-$169,000.00 2 months ago Boston, MA $90,000.00-$130,000.00 3 weeks ago Machine Learning (ML) Applications Engineer- Chemical/Process Engineering
Boston, MA $119,000.00-$169,000.00 2 months ago Research Scientist, Machine Learning for Human-Machine Interactions
Boston, MA $72,000.00-$108,000.00 3 days ago Product Data Scientist, Education Data Science
Cambridge, MA $156,000.00-$229,000.00 2 weeks ago Machine Learning Scientist, Open-Endedness (Level Flexible)
Boston, MA $130,000.00-$170,000.00 1 month ago Boston, MA $150,000.00-$220,000.00 2 months ago Machine Learning Scientist, LLM Training & Inference Research
Business Data Scientist, Cloud Learning Services
Cambridge, MA $166,000.00-$244,000.00 3 weeks ago Amazon Robotics - Data Scientist (New Grad), Amazon Robotics, Software Research and Science
Associate Data Scientist - Fraud Analytics
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Data Scientist
role at
CloudTech Innovations Join to apply for the
Data Scientist
role at
CloudTech Innovations Job Description
Job Description
Job Title:
Data Scientist – Machine Learning, Big Data, GenAI (8–10 Years Experience)
Location:
Remote
Employment Type:
Contract
About The Role
We are seeking a highly experienced
Data Scientist
with 8–10 years of expertise delivering
production-grade AI/ML solutions
at scale. This role requires deep technical proficiency in
Machine Learning, Big Data, Generative AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) , combined with
hands-on cloud experience
(AWS, Azure, or GCP) and
migration expertise
for modernizing data and AI platforms.
The ideal candidate can lead projects end-to-end, from architecture design to deployment, while mentoring teams, optimizing for performance and cost, and ensuring alignment with business objectives.
Key Responsibilities
Design, develop, and deliver end-to-end ML/AI solutions in cloud-native environments from design to deployment and monitoring. Architect and implement Generative AI solutions leveraging LLMs (e.g., GPT, LLaMA, Claude, Mistral) and RAG pipelines with vector search. Build and optimize Big Data pipelines using Apache Spark, PySpark, and Delta Lake integrated with cloud storage (AWS S3, Azure Data Lake, GCP Cloud Storage). Design and maintain data lakehouse architectures with Databricks, Snowflake, or Delta Lake. Deploy scalable MLOps pipelines using MLflow, SageMaker, Azure ML, or Vertex AI with Docker, Kubernetes (EKS, AKS, GKE), and CI/CD. Implement and manage vector databases (Pinecone, FAISS, Milvus, Weaviate, ChromaDB) for RAG applications. Oversee ETL/ELT workflows and pipeline orchestration using Airflow, dbt, or Azure Data Factory. Migration projects, on-prem to cloud, cross-cloud, or legacy platform upgrades (e.g., Hadoop to Databricks, Hive to Delta Lake) , ensuring data integrity and minimal downtime. Integrate streaming data solutions using Apache Kafka and real-time analytics frameworks. Conduct feature engineering, hyperparameter tuning, and model optimization for performance and scalability. Mentor junior data scientists and guide best practices for AI/ML development and deployment. Collaborate with product, engineering, and executive teams to align AI solutions with business KPIs and compliance requirements.
Required Skills & Experience
8–10 years in data science, machine learning, and AI/ML solution delivery. Strong hands-on expertise in at least one major cloud platform (AWS, Azure, or GCP) with proven production deployments. Proficiency in Python, PySpark, and SQL. Proven experience with Apache Spark, Hadoop ecosystem, and Big Data processing. Hands-on experience with Generative AI, Hugging Face Transformers, LangChain, or LlamaIndex. Expertise in RAG architectures and vector databases (Pinecone, FAISS, Milvus, Weaviate, ChromaDB). Experience with MLOps workflows using MLflow, Docker, Kubernetes, and CI/CD tools (Jenkins, GitHub Actions, GitLab CI). Migration experience involving AI/ML workloads, big data pipelines, and data platforms to modern cloud-based architectures. Knowledge of data services (AWS S3, Redshift; Azure Synapse; GCP BigQuery) and infrastructure-as-code (Terraform, CloudFormation, ARM templates). Familiarity with streaming technologies (Kafka) and query engines (Hive, Presto, Trino). Strong foundation in statistics, probability, and ML algorithms.
Preferred Qualifications
Experience with knowledge graphs and semantic search. Background in NLP, transformer architectures, and deep learning frameworks (TensorFlow, PyTorch). Exposure to BI tools (Power BI, Tableau, Looker). Domain expertise in finance, healthcare, or e-commerce.
Seniority level
Seniority level Mid-Senior level Employment type
Employment type Full-time Job function
Job function Engineering and Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at CloudTech Innovations by 2x Sign in to set job alerts for “Data Scientist” roles.
Boston, MA $111,800.00-$175,670.00 6 days ago Boston, MA $119,000.00-$169,000.00 2 months ago Boston, MA $90,000.00-$130,000.00 3 weeks ago Machine Learning (ML) Applications Engineer- Chemical/Process Engineering
Boston, MA $119,000.00-$169,000.00 2 months ago Research Scientist, Machine Learning for Human-Machine Interactions
Boston, MA $72,000.00-$108,000.00 3 days ago Product Data Scientist, Education Data Science
Cambridge, MA $156,000.00-$229,000.00 2 weeks ago Machine Learning Scientist, Open-Endedness (Level Flexible)
Boston, MA $130,000.00-$170,000.00 1 month ago Boston, MA $150,000.00-$220,000.00 2 months ago Machine Learning Scientist, LLM Training & Inference Research
Business Data Scientist, Cloud Learning Services
Cambridge, MA $166,000.00-$244,000.00 3 weeks ago Amazon Robotics - Data Scientist (New Grad), Amazon Robotics, Software Research and Science
Associate Data Scientist - Fraud Analytics
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr