Strategic Staffing Solutions
Generative AI Engineer
Strategic Staffing Solutions, Charlotte, North Carolina, United States, 28245
Job Title:
Generative AI Engineer Location:
Charlotte, NC Pay:
[Full-Time / Contract]
Job Overview:
We are seeking a
Generative AI Engineer
with deep technical expertise across Generative AI, MLOps, and scalable distributed systems. In this role, you'll lead the design, development, deployment, and optimization of AI/ML solutions powered by LLMs, embedding models, and Retrieval-Augmented Generation (RAG) frameworks. You'll also be instrumental in driving production-grade MLOps workflows, managing big data pipelines, and integrating cloud-native tools across AWS, GCP, and Azure.
You'll work closely with cross-functional teams and mentor engineers to deliver transformative AI solutions for enterprise environments, including the Microsoft ecosystem.
Key Responsibilities: Design and implement
Generative AI solutions
using LLMs, vector databases, embedding models, vector search, and RAG techniques. Build and maintain robust
MLOps pipelines
for model training, testing, and deployment using
AWS SageMaker ,
Ray , and modern CI/CD practices. Engineer
distributed data pipelines
and streaming systems using
Apache Spark ,
Kafka ,
Hadoop ,
HBase , and
Cassandra . Apply
machine learning
and
deep learning
frameworks such as
Scikit-learn ,
TensorFlow ,
Keras , and
Spark MLlib . Conduct advanced
NLP
tasks using
spaCy ,
nltk , and embedding strategies. Analyze large datasets using the
Python data ecosystem
(Pandas, Scikit-learn, etc.). Optimize performance of distributed systems and applications in
Python ,
Java , and
Scala . Manage and scale
data stores : vector stores (e.g.,
Milvus ,
MongoDB Atlas ), NoSQL (e.g.,
Redis ,
Cassandra ), and SQL (e.g.,
Postgres ,
MySQL ). Utilize DevOps and infrastructure tools:
Docker ,
Kubernetes ,
Ansible ,
Terraform ,
Linux , and
Git . Monitor and tune the performance of AI models, applications, and distributed systems. Lead, mentor, and inspire high-performing technical teams. Collaborate on AI integrations with
Microsoft Copilot Studio ,
Power Platform , and
Dynamics 365 . Required Qualifications:
7+ years of experience in AI/ML, data engineering, or distributed systems. Proven experience with
LLMs ,
RAG architectures , and
vector databases . Hands-on expertise with
MLOps
tools, especially
SageMaker
and
Ray . Strong programming skills in
Python ,
Java ,
Scala , and
R . Solid understanding of
cloud platforms : AWS (preferred), GCP, and Azure. Demonstrated success leading engineering teams or AI initiatives. Proficient with modern
DevOps
toolchains and best practices. Preferred Qualifications:
Experience with enterprise AI integrations using
Microsoft Copilot Studio ,
Power Platform , or
Dynamics 365 . Background in building real-time AI solutions and observability tooling. Contributions to open-source AI projects or research in Generative AI.
Generative AI Engineer Location:
Charlotte, NC Pay:
[Full-Time / Contract]
Job Overview:
We are seeking a
Generative AI Engineer
with deep technical expertise across Generative AI, MLOps, and scalable distributed systems. In this role, you'll lead the design, development, deployment, and optimization of AI/ML solutions powered by LLMs, embedding models, and Retrieval-Augmented Generation (RAG) frameworks. You'll also be instrumental in driving production-grade MLOps workflows, managing big data pipelines, and integrating cloud-native tools across AWS, GCP, and Azure.
You'll work closely with cross-functional teams and mentor engineers to deliver transformative AI solutions for enterprise environments, including the Microsoft ecosystem.
Key Responsibilities: Design and implement
Generative AI solutions
using LLMs, vector databases, embedding models, vector search, and RAG techniques. Build and maintain robust
MLOps pipelines
for model training, testing, and deployment using
AWS SageMaker ,
Ray , and modern CI/CD practices. Engineer
distributed data pipelines
and streaming systems using
Apache Spark ,
Kafka ,
Hadoop ,
HBase , and
Cassandra . Apply
machine learning
and
deep learning
frameworks such as
Scikit-learn ,
TensorFlow ,
Keras , and
Spark MLlib . Conduct advanced
NLP
tasks using
spaCy ,
nltk , and embedding strategies. Analyze large datasets using the
Python data ecosystem
(Pandas, Scikit-learn, etc.). Optimize performance of distributed systems and applications in
Python ,
Java , and
Scala . Manage and scale
data stores : vector stores (e.g.,
Milvus ,
MongoDB Atlas ), NoSQL (e.g.,
Redis ,
Cassandra ), and SQL (e.g.,
Postgres ,
MySQL ). Utilize DevOps and infrastructure tools:
Docker ,
Kubernetes ,
Ansible ,
Terraform ,
Linux , and
Git . Monitor and tune the performance of AI models, applications, and distributed systems. Lead, mentor, and inspire high-performing technical teams. Collaborate on AI integrations with
Microsoft Copilot Studio ,
Power Platform , and
Dynamics 365 . Required Qualifications:
7+ years of experience in AI/ML, data engineering, or distributed systems. Proven experience with
LLMs ,
RAG architectures , and
vector databases . Hands-on expertise with
MLOps
tools, especially
SageMaker
and
Ray . Strong programming skills in
Python ,
Java ,
Scala , and
R . Solid understanding of
cloud platforms : AWS (preferred), GCP, and Azure. Demonstrated success leading engineering teams or AI initiatives. Proficient with modern
DevOps
toolchains and best practices. Preferred Qualifications:
Experience with enterprise AI integrations using
Microsoft Copilot Studio ,
Power Platform , or
Dynamics 365 . Background in building real-time AI solutions and observability tooling. Contributions to open-source AI projects or research in Generative AI.