Logo
Dechen Consulting

Machine Learning Engineering Engineer

Dechen Consulting, Dearborn, Michigan, United States, 48120

Save Job

About Dechen Consulting Group (DCG)

Dechen Consulting Group (DCG) is a rapidly expanding, innovative IT Professional Services and Management Consulting company with a track record of more than twenty-five years in delivering skilled professionals to our clients across diverse sectors.

Job Opportunity in Dearborn, MI

We are currently offering a W2 contract opportunity in Dearborn, MI, with the potential to extend over multiple years and the chance to transition to a direct hire position with our client. We provide healthcare, vacation, relocation assistance, and visa sponsorship/transfer. This is a W2 position, not C2C. THIRD PARTIES NEED NOT APPLY. This role offers excellent prospects for career progression!

Position Description

Employees in this role are responsible for designing, developing, and deploying cutting-edge Generative AI solutions, with a particular emphasis on Retrieval-Augmented Generation (RAG) systems. This involves leveraging various AI techniques, including vector databases and robust API development frameworks like FastAPI, and ensuring efficient deployment through containerization and MLOps practices, to build intelligent applications that enhance user experience and automate complex processes.

Skills Required GCP - Mid Level Big Data - Entry level Artificial Intelligence & Expert Systems - Entry Level API: Mid level Skills Preferred

Google Cloud Platform Experience Required

3 years of experience in software engineering with a focus on Generative AI, Machine Learning, or related AI fields. Experience Preferred

Experience deploying AI/ML models into production environments at scale. Previous experience in a large enterprise or fast-paced technology environment. Education Required

Bachelor's Degree Education Preferred

Certification Program Additional Information

POSITION IS HYBRID

Design, develop, and implement Generative AI models and applications, specifically focusing on building and optimizing RAG systems, including the integration and management of vector databases, using various technology stacks, with a preference for the OpenAI SDK. Apply fundamental Machine Learning concepts, including model fine-tuning, to improve the performance and accuracy of AI solutions, and deploy them via efficient APIs, such as those built with FastAPI, utilizing containerization for consistent environments. Perform data engineering tasks to prepare, process, and manage data pipelines essential for training, evaluating, and deploying Generative AI models, including data ingestion for vector databases, ensuring data quality and accessibility. Utilize advanced prompt engineering techniques to optimize interactions with large language models and achieve desired outputs, and expose these capabilities through well-designed APIs. Collaborate with cross-functional teams to integrate AI solutions into existing products and services, ensuring scalability, reliability, and maintainability on cloud platforms, particularly Google Cloud Platform (GCP), adhering to MLOps principles and continuous integration/continuous deployment (CI/CD) practices.

We are a people-focused company with a deep emphasis on family values and look forward to working with you.