Rulesiq

Data Scientist

Rulesiq, Germantown, Ohio, United States

Role Summary

We are seeking a highly skilled and versatile

Data Scientist/Machine Learning Engineer

to join our team. The ideal candidate will have a strong foundation in machine learning, data science, and software engineering, coupled with the ability to design and implement end-to-end systems. This role involves working with cutting-edge technologies, including LLMs, recommendation models, and NLP, while leveraging big data engineering, system design, and cloud infrastructure to deliver impactful solutions. Key Responsibilities

Machine Learning & Data Science

Develop and deploy machine learning models for various use cases such as recommendation systems, propensity scoring, and NLP. Design, train, and fine-tune large language models (LLMs) and integrate them into production workflows. Conduct exploratory data analysis (EDA), feature engineering, and statistical modeling to derive actionable insights.

Big Data Engineering

Build and optimize data pipelines and workflows for large-scale data processing using tools like Apache Spark, EMR, or similar. Collaborate with the data engineering team to ensure data integrity, scalability, and efficiency.

System Design & Development

Architect and implement end-to-end ML systems, from data ingestion to model deployment and monitoring. Develop robust and scalable APIs for model integration and data access. Ensure seamless integration with backend systems (MongoDB) and cloud infrastructure (AWS).

Infrastructure & DevOps

Containerize applications and ML models using Docker, ensuring portability and consistency across environments. Orchestrate and manage deployments using Kubernetes. Monitor and optimize system performance, ensuring high availability and reliability.

Cloud Computing & Database Management

Utilize AWS services such as S3, Lambda, SageMaker, and ECS for building and deploying solutions. Design efficient and scalable data storage solutions using MongoDB and related tools.

Collaboration & Communication

Work closely with cross-functional teams, including data engineers, software developers, and product managers. Translate business requirements into technical solutions.

Requirements

Technical Skills

Proficiency in Python, with expertise in libraries such as NumPy, Pandas, Scikit-learn, TensorFlow, PyTorch, and Hugging Face Transformers. Strong understanding of machine learning algorithms, deep learning architectures, and NLP techniques. Hands-on experience with recommendation systems, propensity scoring, and statistical methods. Knowledge of big data tools (e.g., Spark, Hadoop) and stream processing. Solid experience with API development and integration. Expertise in Docker, Kubernetes, and CI/CD practices. Familiarity with AWS services and cloud-native architectures. Analytical & Design Skills

Strong grasp of data science concepts, including predictive modeling, clustering, and classification. Experience with LLM fine-tuning and deployment for NLP applications. Sound understanding of system design principles and infrastructure best practices. Education & Experience

Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field. 3+ years of professional experience in machine learning engineering or data science roles. Previous experience in building and deploying end-to-end ML pipelines in production environments. Nice-to-Have Skills

Experience with MongoDB Atlas and serverless architectures. Knowledge of MLOps tools and practices for productionizing ML models. Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana).

#J-18808-Ljbffr