Amazon
Description:
Join our dynamic team focused on advancing the AWS Neuron software stack, designed for the AWS Inferentia and Trainium cloud-scale machine learning accelerators. As a Senior Software Engineer in our Machine Learning Applications (ML Apps) team, you will be at the forefront of developing, enabling, and optimizing various ML model families, including cutting-edge GEN-AI models and large-scale language models like llama-4, Deepseeq, and stable diffusion. You will collaborate closely with chip architects, compiler engineers, and runtime engineers to enhance the performance and accuracy of state-of-the-art models. Key Responsibilities: Lead the integration of distributed inference support into PyTorch utilizing XLA and the Neuron compiler and runtime stacks. Leverage your strong programming skills in C++ and Python alongside your machine learning expertise to identify and implement optimization opportunities through comparative analysis and benchmarking. Develop and automate solutions that ensure the accuracy of AI accelerators while maximizing their performance. Create deep AI toolchains to simplify and abstract complex low-level AI accelerator modules. A Day in the Life: We value work-life balance and understand the importance of a healthy equilibrium between your professional and personal lives. Our flexible working hours allow you to establish a routine that energizes both aspects of your life. Mentorship & Career Growth: Our team is committed to supporting the development of new members through a blend of experience levels within a culture that promotes knowledge sharing and mentorship. Senior team members offer one-on-one mentorship and constructive code reviews, fostering an environment where you can grow as a professional and tackle progressively complex tasks. Basic Qualifications: 3+ years of experience programming in a modern language such as Java, C++, or C#, including a foundation in object-oriented design. 3+ years of experience in leading system design or architecture, focusing on design patterns, reliability, and scalability. 3+ years of full software development life cycle experience, covering coding standards, code reviews, source control management, build processes, testing, and operations. Solid understanding of machine learning and deep learning models, encompassing their architecture, training, and inference, along with experience in optimizing model execution. Preferred Qualifications: Bachelor's degree in computer science or equivalent field. Join us in delivering exceptional results for our customers while preserving an inclusive culture that empowers all team members. If you require workplace accommodations due to a disability, please reach out to our Recruiting Partner. This position is based in the United States, and the base salary ranges from $129,300 to $223,600 per year, depending on geographic location and relevant qualifications. Additional compensation may include equity, sign-on bonuses, and comprehensive benefits.
Join our dynamic team focused on advancing the AWS Neuron software stack, designed for the AWS Inferentia and Trainium cloud-scale machine learning accelerators. As a Senior Software Engineer in our Machine Learning Applications (ML Apps) team, you will be at the forefront of developing, enabling, and optimizing various ML model families, including cutting-edge GEN-AI models and large-scale language models like llama-4, Deepseeq, and stable diffusion. You will collaborate closely with chip architects, compiler engineers, and runtime engineers to enhance the performance and accuracy of state-of-the-art models. Key Responsibilities: Lead the integration of distributed inference support into PyTorch utilizing XLA and the Neuron compiler and runtime stacks. Leverage your strong programming skills in C++ and Python alongside your machine learning expertise to identify and implement optimization opportunities through comparative analysis and benchmarking. Develop and automate solutions that ensure the accuracy of AI accelerators while maximizing their performance. Create deep AI toolchains to simplify and abstract complex low-level AI accelerator modules. A Day in the Life: We value work-life balance and understand the importance of a healthy equilibrium between your professional and personal lives. Our flexible working hours allow you to establish a routine that energizes both aspects of your life. Mentorship & Career Growth: Our team is committed to supporting the development of new members through a blend of experience levels within a culture that promotes knowledge sharing and mentorship. Senior team members offer one-on-one mentorship and constructive code reviews, fostering an environment where you can grow as a professional and tackle progressively complex tasks. Basic Qualifications: 3+ years of experience programming in a modern language such as Java, C++, or C#, including a foundation in object-oriented design. 3+ years of experience in leading system design or architecture, focusing on design patterns, reliability, and scalability. 3+ years of full software development life cycle experience, covering coding standards, code reviews, source control management, build processes, testing, and operations. Solid understanding of machine learning and deep learning models, encompassing their architecture, training, and inference, along with experience in optimizing model execution. Preferred Qualifications: Bachelor's degree in computer science or equivalent field. Join us in delivering exceptional results for our customers while preserving an inclusive culture that empowers all team members. If you require workplace accommodations due to a disability, please reach out to our Recruiting Partner. This position is based in the United States, and the base salary ranges from $129,300 to $223,600 per year, depending on geographic location and relevant qualifications. Additional compensation may include equity, sign-on bonuses, and comprehensive benefits.