Logo
Amazon

Machine Learning Compiler Engineer II, AWS Neuron

Amazon, Seattle, Washington, us, 98127

Save Job

Do you want to be part of the AI revolution? At AWS, our vision is to democratize access to AI hardware and software infrastructure, making deep learning accessible for everyday developers. To achieve this vision, we have created innovative software and hardware solutions. One of these is AWS Neuron, the SDK designed to optimize the performance of complex ML models executed on AWS Inferentia and Trainium, our custom chips for accelerating deep-learning workloads. This role focuses on developing the next generation of the Neuron compiler, which transforms ML models from various frameworks (e.g., PyTorch, TensorFlow, and JAX) for deployment on AWS Inferentia and Trainium servers in the cloud. Your responsibilities will include: Solving complex compiler optimization problems to maximize performance for a range of ML models, including large language models, vision transformers, and multi-model architectures. Understanding the internal workings of these models to inform compiler decisions for optimal instruction generation. Collaborating with internal and external stakeholders to ensure our solutions meet user needs and help drive product adoption. Participating in pre-silicon design and launching new products/features to market. Key Responsibilities: Design, implement, test, and maintain innovative software solutions aimed at improving the performance and user experience of the Neuron compiler. Collaborate with chip architects, runtime/OS engineers, scientists, and ML application teams to deploy advanced ML models efficiently on AWS accelerators. Engage with open-source projects to enhance optimization techniques for complex ML workloads. Develop features that enhance the user's experience and reduce difficulties faced by developers globally. About the Team: Our team is committed to hosting a supportive environment for new members. We cherish knowledge-sharing, mentorship, and learning from each other. Senior team members provide generous mentoring and insightful code reviews. We’re focused on your career growth, and every project will enhance your skills and empower you to tackle more complex tasks in the future. Qualifications: We encourage you to apply if you meet the following qualifications: 3+ years of professional software development experience. 2+ years of design or architecture experience in creating and maintaining systems. Proficiency in one or more object-oriented programming languages such as C++/Java. Preferred: Master’s degree or PhD in Computer Science or a related technical field, and familiarity with compiler design and various ML technologies (e.g., PyTorch, JAX). AWS values diverse experiences and encourages all qualified candidates to apply, regardless of their background. Our compensation reflects labor costs across several US geographic markets. The base pay for this position ranges from $129,300 to $223,600 per year, depending on location and candidate experience. This position is located at our AWS offices. For more information on AWS and our employee benefits, please visit our workplace resources.