Amazon
Software Engineering Manager, ML Kernel Performance, AWS Neuron, Annapurna Labs
Amazon, Cupertino, California, United States, 95014
Overview
The Annapurna Labs team at AWS builds AWS Neuron, the SDK used to accelerate deep learning and GenAI workloads on AWS Inferentia and Trainium accelerators. The Acceleration Kernel Library team focuses on maximizing performance for AWS custom ML accelerators by crafting high‑performance kernels for ML functions across hardware and software boundaries. The Neuron SDK provides a compiler, runtime, and application framework that integrates with popular ML frameworks like PyTorch to boost inference and training performance.
As part of the Neuron Compiler organization, this role works across frameworks, compilers, runtime, and collectives to optimize performance and help shape future architectures. You will work at the intersection of machine learning, high‑performance computing, and distributed architectures, contributing to cutting‑edge AI acceleration technology.
Responsibilities
Design and implement high‑performance compute kernels for ML operations, leveraging the Neuron architecture and programming models.
Analyze and optimize kernel‑level performance across multiple generations of Neuron hardware.
Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks.
Implement compiler optimizations such as fusion, sharding, tiling, and scheduling.
Work directly with customers to enable and optimize their ML models on AWS accelerators.
Collaborate across teams to develop innovative kernel optimization techniques.
A day in the life In design and coding activities, you will create metrics, implement automation and other improvements, and resolve the root causes of software defects. You will also:
Build high‑impact solutions for a large customer base.
Participate in design discussions, code reviews, and communicate with stakeholders.
Work cross‑functionally to drive technical decisions with your input.
Operate in a startup‑like development environment, focusing on the most important work.
About the team AWS is a world’s leading cloud platform, committed to inclusion and innovation. We value work‑life balance, mentorship, and diverse experiences. We encourage applicants who may not meet every bullet in the qualifications to apply.
Qualifications
3+ years of engineering team management experience
7+ years of experience working directly within engineering teams
3+ years of designing or architecting systems (design patterns, reliability, scaling)
8+ years of leading multi‑tier web services development
Knowledge of full software/hardware/networks development lifecycle, including coding standards, code reviews, source control, build processes, testing, certification, and live site operations
Experience partnering with product or program management teams
Experience communicating with users, other technical teams, and senior leadership to gather requirements and describe features and design
Experience recruiting, mentoring, and managing teams of software engineers
Amazon is an equal opportunity employer and does not discriminate on basis of protected status. Los Angeles County postings include safety and compliance expectations and related notices. If you need a workplace accommodation during the application or interview process, please review the accommodations information with Amazon Careers. For more information about benefits, visit the employee benefits page. This position will remain posted until filled.
#J-18808-Ljbffr
As part of the Neuron Compiler organization, this role works across frameworks, compilers, runtime, and collectives to optimize performance and help shape future architectures. You will work at the intersection of machine learning, high‑performance computing, and distributed architectures, contributing to cutting‑edge AI acceleration technology.
Responsibilities
Design and implement high‑performance compute kernels for ML operations, leveraging the Neuron architecture and programming models.
Analyze and optimize kernel‑level performance across multiple generations of Neuron hardware.
Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks.
Implement compiler optimizations such as fusion, sharding, tiling, and scheduling.
Work directly with customers to enable and optimize their ML models on AWS accelerators.
Collaborate across teams to develop innovative kernel optimization techniques.
A day in the life In design and coding activities, you will create metrics, implement automation and other improvements, and resolve the root causes of software defects. You will also:
Build high‑impact solutions for a large customer base.
Participate in design discussions, code reviews, and communicate with stakeholders.
Work cross‑functionally to drive technical decisions with your input.
Operate in a startup‑like development environment, focusing on the most important work.
About the team AWS is a world’s leading cloud platform, committed to inclusion and innovation. We value work‑life balance, mentorship, and diverse experiences. We encourage applicants who may not meet every bullet in the qualifications to apply.
Qualifications
3+ years of engineering team management experience
7+ years of experience working directly within engineering teams
3+ years of designing or architecting systems (design patterns, reliability, scaling)
8+ years of leading multi‑tier web services development
Knowledge of full software/hardware/networks development lifecycle, including coding standards, code reviews, source control, build processes, testing, certification, and live site operations
Experience partnering with product or program management teams
Experience communicating with users, other technical teams, and senior leadership to gather requirements and describe features and design
Experience recruiting, mentoring, and managing teams of software engineers
Amazon is an equal opportunity employer and does not discriminate on basis of protected status. Los Angeles County postings include safety and compliance expectations and related notices. If you need a workplace accommodation during the application or interview process, please review the accommodations information with Amazon Careers. For more information about benefits, visit the employee benefits page. This position will remain posted until filled.
#J-18808-Ljbffr