Logo
Amazon

Software Engineer II - AI/ML at AWS Neuron

Amazon, Arlington, Virginia, United States, 22201

Save Job

Join the AWS Neuron team as a Software Engineer II, where you will be at the forefront of advancing the complete software stack for AWS's cloud-scale machine learning accelerators, including Inferentia and Trainium. As part of the Machine Learning Applications (ML Apps) team, you will work on the development, performance tuning, and enablement of cutting-edge machine learning models, including state-of-the-art generative AI models and large language models like Llama-4 and Deepseeq. In this role, you will collaborate closely with chip architects, compiler engineers, and runtime engineers to optimize both the performance and accuracy of advanced models. Your work will involve automating machine learning techniques to evaluate and resolve accuracy issues that arise during model migration to AI accelerators. This position provides a unique opportunity to develop a sophisticated AI toolchain aimed at enhancing model performance and ease of use. Key Responsibilities: Lead efforts to integrate distributed inference support into PyTorch using XLA and the Neuron compiler and runtime stacks. Identify optimization opportunities through benchmarking and comparative analysis of alternative solutions. Build automated solutions to ensure accuracy of AI accelerators while enhancing their performance. Create deep AI toolchains that abstract and simplify interaction with low-level AI accelerator modules. A Day in the Life: At AWS, we prioritize work-life balance, encouraging you to create a harmonious flow between work and personal life. Our team values flexibility in working hours and supports you in finding a balance that suits your lifestyle. Mentorship & Career Growth: We are committed to fostering the growth of new team members through knowledge sharing and mentorship. With a diverse mix of experience within the team, you will receive guidance from senior engineers through one-on-one mentoring and comprehensive code reviews. We actively assign projects tailored to help you develop your engineering skills and prepare for more complex challenges in the future. Basic Qualifications: 3+ years programming experience in a modern language like Java, C++, or C#, with a strong background in object-oriented design. 3+ years in leading design or architecture for new and existing systems, focusing on design patterns, reliability, and scalability. 3+ years of experience in the full software development lifecycle, including coding standards, code reviews, source control management, build processes, testing, and operations. Foundational knowledge of machine learning and deep learning models, their architectures, training, and inference lifecycles, along with optimization experience for model execution. Preferred Qualifications: Bachelor's degree in computer science or a related field. Our commitment to inclusivity ensures that all qualified applicants will receive equal opportunity during the hiring process. If you have a disability and require accommodations, please inform us during your application process. The compensation for this position ranges from $129,300 to $223,600, depending on experience and location. This role will be open until filled, and we encourage all interested candidates to apply.