Genesis10

AI/ML Engineer - LLM & Model

Genesis10, Charlotte, North Carolina, United States, 28245

Overview

Genesis10 is currently seeking an AI/ML Engineer – LLM & Model with our client in the financial industry located in Charlotte, NC. This is a 12+ month contract position. Responsibilities

Develop, optimize, and deploy cutting-edge AI models in production environments, leveraging GPUs and modern container platforms Design, develop, and deploy solutions using LLMs (LLaMA 3 or other major LLM frameworks) for enterprise-scale applications Implement RAG pipelines to enhance LLM capabilities through retrieval-augmented generation techniques Work with vector databases (Redis and others) for efficient embedding storage and retrieval Write and optimize queries using SQL for data preprocessing and analysis Develop, maintain, and optimize Python code for AI/ML workflows Manage and monitor NVIDIA Triton Inference Server deployments for high-performance model inference Utilize Unix/Linux skills for system operations, automation, and troubleshooting Leverage GPU acceleration and work with containerized environments (Docker, OpenShift) for scalable deployments Implement CI/CD pipelines for AI/ML solutions using XLR (XL Release) and Datical for automated deployments Work in an Agile environment, collaborate with cross-functional teams including data scientists, DevOps engineers, and product managers Requirements

Deep expertise in Large Language Models (LLMs), model deployment, and high-performance computing environments Strong skills in Python, RAG (Retrieve-Augment-Generate) frameworks, vector databases, and CI/CD pipeline automation Proven experience with LLaMA 3 or other major LLMs (OpenAI GPT, Claude, Mistral, etc.) Strong understanding of RAG techniques and vector database architectures Proficiency in Python Solid knowledge of SQL and relational database concepts Experience deploying models using NVIDIA Triton Inference Server Familiarity with GPU hardware optimization for AI workloads Strong Unix/Linux command-line and scripting skills Hands-on experience with Docker, OpenShift, and container orchestration Experience with Agile methodologies Familiarity with XLR and Datical for CI/CD pipeline management Desired skills

Experience in large-scale AI production environments Background in optimizing inference latency and throughput Knowledge of cloud AI services and infrastructure Secondary experience with Java Pay Range: $55.06-$63.06 Only candidates available and ready to work directly as Genesis10 employees will be considered for this position. If you have the described qualifications and are interested in this exciting opportunity, please apply! About Genesis10

Genesis10 is ranked a Top Staffing Firm in the U.S. by Staffing Industry Analysts for six consecutive years. Genesis10 puts thousands of consultants and employees to work across the United States every year in contract, contract-for-hire, and permanent placement roles. With more than 300 active clients, Genesis10 provides access to many Fortune 100 firms and a variety of mid-market organizations across the full spectrum of industry verticals. Genesis10 is an Equal Opportunity Employer. Candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

#J-18808-Ljbffr