Amazon Web Services (AWS)
Machine Learning Engineer, Model Customization, Generative AI Innovation Center
Amazon Web Services (AWS), New York, New York, us, 10261
Overview
Machine Learning Engineer, Model Customization, Generative AI Innovation Center – AWS
The Generative AI Innovation Center at AWS empowers customers to harness state of the art AI technologies for transformative business opportunities. Our multidisciplinary team collaborates with customers across industries to fine-tune and deploy customized generative AI applications at scale, and to optimize AI models for Amazon Silicon. As an SDE on our team, you will drive the development of custom Large Language Models (LLMs) across languages, domains, and modalities. You will be responsible for fine-tuning state-of-the-art LLMs for diverse use cases while optimizing models for high-performance deployment on AWS’s custom AI accelerators. This role offers the opportunity to innovate at the forefront of AI, tackling end-to-end LLM training pipelines at massive scale and delivering next-generation AI solutions for top AWS clients. Key responsibilities
Large-Scale Training Pipelines: Design and implement distributed training pipelines for LLMs using tools such as Fully Sharded Data Parallel (FSDP) and DeepSpeed, ensuring scalability and efficiency LLM Customization & Fine-Tuning: Adapt LLMs for new languages, domains, and vision applications through continued pre-training, fine-tuning, and Reinforcement Learning with Human Feedback (RLHF) Model Optimization on AWS Silicon: Optimize AI models for deployment on AWS Inferentia and Trainium, leveraging the AWS Neuron SDK and developing custom kernels for enhanced performance Customer Collaboration: Interact with enterprise customers and foundational model providers to understand their business and technical challenges, co-developing tailored generative AI solutions Basic Qualifications
3+ years of non-internship professional software development experience 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience Bachelor\'s degree in computer science or equivalent Hands-on experience with deep learning and/or machine learning methods (e.g. for training, fine tuning, and inference) Hands-on experience with generative AI technology Preferred Qualifications
3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience 1+ years of programming with at least one software programming language experience 1+ years of experience hands-on experience with developing, deploying, or optimizing machine learning models using a recognized ML library or framework EEO & Accommodation Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner. Compensation & Benefits Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on factors including market location and job-related knowledge, skills, and experience. Amazon is a total compensation company. Depending on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total package. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site. Company – Amazon Web Services, Inc. | Job ID: A3081156
#J-18808-Ljbffr
The Generative AI Innovation Center at AWS empowers customers to harness state of the art AI technologies for transformative business opportunities. Our multidisciplinary team collaborates with customers across industries to fine-tune and deploy customized generative AI applications at scale, and to optimize AI models for Amazon Silicon. As an SDE on our team, you will drive the development of custom Large Language Models (LLMs) across languages, domains, and modalities. You will be responsible for fine-tuning state-of-the-art LLMs for diverse use cases while optimizing models for high-performance deployment on AWS’s custom AI accelerators. This role offers the opportunity to innovate at the forefront of AI, tackling end-to-end LLM training pipelines at massive scale and delivering next-generation AI solutions for top AWS clients. Key responsibilities
Large-Scale Training Pipelines: Design and implement distributed training pipelines for LLMs using tools such as Fully Sharded Data Parallel (FSDP) and DeepSpeed, ensuring scalability and efficiency LLM Customization & Fine-Tuning: Adapt LLMs for new languages, domains, and vision applications through continued pre-training, fine-tuning, and Reinforcement Learning with Human Feedback (RLHF) Model Optimization on AWS Silicon: Optimize AI models for deployment on AWS Inferentia and Trainium, leveraging the AWS Neuron SDK and developing custom kernels for enhanced performance Customer Collaboration: Interact with enterprise customers and foundational model providers to understand their business and technical challenges, co-developing tailored generative AI solutions Basic Qualifications
3+ years of non-internship professional software development experience 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience Bachelor\'s degree in computer science or equivalent Hands-on experience with deep learning and/or machine learning methods (e.g. for training, fine tuning, and inference) Hands-on experience with generative AI technology Preferred Qualifications
3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience 1+ years of programming with at least one software programming language experience 1+ years of experience hands-on experience with developing, deploying, or optimizing machine learning models using a recognized ML library or framework EEO & Accommodation Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner. Compensation & Benefits Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on factors including market location and job-related knowledge, skills, and experience. Amazon is a total compensation company. Depending on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total package. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site. Company – Amazon Web Services, Inc. | Job ID: A3081156
#J-18808-Ljbffr