Logo
CloudTech Innovations

Data Scientist

CloudTech Innovations, WorkFromHome

Save Job

Join to apply for the Data Scientist role at CloudTech Innovations

Join to apply for the Data Scientist role at CloudTech Innovations

Job Description

Job Description
Job Title: Data Scientist – Machine Learning, Big Data, GenAI (8–10 Years Experience)
Location: Remote
Employment Type: Contract
About The Role
We are seeking a highly experienced Data Scientist with 8–10 years of expertise delivering production-grade AI/ML solutions at scale. This role requires deep technical proficiency in Machine Learning, Big Data, Generative AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) , combined with hands-on cloud experience (AWS, Azure, or GCP) and migration expertise for modernizing data and AI platforms.
The ideal candidate can lead projects end-to-end, from architecture design to deployment, while mentoring teams, optimizing for performance and cost, and ensuring alignment with business objectives.
Key Responsibilities

  • Design, develop, and deliver end-to-end ML/AI solutions in cloud-native environments from design to deployment and monitoring.
  • Architect and implement Generative AI solutions leveraging LLMs (e.g., GPT, LLaMA, Claude, Mistral) and RAG pipelines with vector search.
  • Build and optimize Big Data pipelines using Apache Spark, PySpark, and Delta Lake integrated with cloud storage (AWS S3, Azure Data Lake, GCP Cloud Storage).
  • Design and maintain data lakehouse architectures with Databricks, Snowflake, or Delta Lake.
  • Deploy scalable MLOps pipelines using MLflow, SageMaker, Azure ML, or Vertex AI with Docker, Kubernetes (EKS, AKS, GKE), and CI/CD.
  • Implement and manage vector databases (Pinecone, FAISS, Milvus, Weaviate, ChromaDB) for RAG applications.
  • Oversee ETL/ELT workflows and pipeline orchestration using Airflow, dbt, or Azure Data Factory.
  • Migration projects, on-prem to cloud, cross-cloud, or legacy platform upgrades (e.g., Hadoop to Databricks, Hive to Delta Lake) , ensuring data integrity and minimal downtime.
  • Integrate streaming data solutions using Apache Kafka and real-time analytics frameworks.
  • Conduct feature engineering, hyperparameter tuning, and model optimization for performance and scalability.
  • Mentor junior data scientists and guide best practices for AI/ML development and deployment.
  • Collaborate with product, engineering, and executive teams to align AI solutions with business KPIs and compliance requirements.
Required Skills & Experience
  • 8–10 years in data science, machine learning, and AI/ML solution delivery.
  • Strong hands-on expertise in at least one major cloud platform (AWS, Azure, or GCP) with proven production deployments.
  • Proficiency in Python, PySpark, and SQL.
  • Proven experience with Apache Spark, Hadoop ecosystem, and Big Data processing.
  • Hands-on experience with Generative AI, Hugging Face Transformers, LangChain, or LlamaIndex.
  • Expertise in RAG architectures and vector databases (Pinecone, FAISS, Milvus, Weaviate, ChromaDB).
  • Experience with MLOps workflows using MLflow, Docker, Kubernetes, and CI/CD tools (Jenkins, GitHub Actions, GitLab CI).
  • Migration experience involving AI/ML workloads, big data pipelines, and data platforms to modern cloud-based architectures.
  • Knowledge of data services (AWS S3, Redshift; Azure Synapse; GCP BigQuery) and infrastructure-as-code (Terraform, CloudFormation, ARM templates).
  • Familiarity with streaming technologies (Kafka) and query engines (Hive, Presto, Trino).
  • Strong foundation in statistics, probability, and ML algorithms.
Preferred Qualifications
  • Experience with knowledge graphs and semantic search.
  • Background in NLP, transformer architectures, and deep learning frameworks (TensorFlow, PyTorch).
  • Exposure to BI tools (Power BI, Tableau, Looker).
  • Domain expertise in finance, healthcare, or e-commerce.

Seniority level

  • Seniority level

    Mid-Senior level

Employment type

  • Employment type

    Full-time

Job function

  • Job function

    Engineering and Information Technology
  • Industries

    IT Services and IT Consulting

Referrals increase your chances of interviewing at CloudTech Innovations by 2x

Sign in to set job alerts for “Data Scientist” roles.

Boston, MA $111,800.00-$175,670.00 6 days ago

Boston, MA $119,000.00-$169,000.00 2 months ago

Boston, MA $90,000.00-$130,000.00 3 weeks ago

Machine Learning (ML) Applications Engineer- Chemical/Process Engineering

Boston, MA $119,000.00-$169,000.00 2 months ago

Research Scientist, Machine Learning for Human-Machine Interactions

Boston, MA $72,000.00-$108,000.00 3 days ago

Product Data Scientist, Education Data Science

Cambridge, MA $156,000.00-$229,000.00 2 weeks ago

Machine Learning Scientist, Open-Endedness (Level Flexible)

Boston, MA $130,000.00-$170,000.00 1 month ago

Boston, MA $150,000.00-$220,000.00 2 months ago

Machine Learning Scientist, LLM Training & Inference Research

Business Data Scientist, Cloud Learning Services

Cambridge, MA $166,000.00-$244,000.00 3 weeks ago

Amazon Robotics - Data Scientist (New Grad), Amazon Robotics, Software Research and Science

Associate Data Scientist - Fraud Analytics

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr