Logo
Amazon Web Services (AWS)

Applied Scientist, AWS Neuron Science Team

Amazon Web Services (AWS), Santa Clara, California, us, 95053

Save Job

Join to apply for the

Applied Scientist, AWS Neuron Science Team

role at

Amazon Web Services (AWS) .

The AWS Neuron Science Team is looking for talented scientists to enhance our software stack, accelerating customer adoption of Trainium and Inferentia accelerators. In this role, you will work directly with external and internal customers to identify key adoption barriers and optimization opportunities, collaborate closely with our engineering teams to implement innovative solutions, and engage with academic and research communities to advance state‑of‑the‑art ML systems. This is a strategic growth area for AWS and offers an exciting and impactful environment.

Responsibilities

Develop and apply ML/RL approaches for kernel and code generation and optimization.

Create advanced compiler techniques for ML workloads.

Build tools for accuracy and reliability validation to improve system robustness.

Design high‑performance kernels optimized for our ML accelerator architectures.

A day in the life includes supporting the development and management of compute, database, storage, and platform services in AWS, with potential exposure to Amazon’s growing suite of generative AI services and other cloud computing offerings.

About the Team Amazon Neuron is the software for Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best‑in‑class ML inference performance at the lowest cost, while Trainium enables best‑in‑class training performance. Neuron is a software stack that integrates an ML compiler and native support into popular ML frameworks. Our products are used at scale by external customers like Anthropic and Databricks and internal customers such as Alexa, Amazon Bedrock, Amazon Robotics, Amazon Ads, and Amazon Rekognition.

Basic Qualifications

PhD in computer science, computer engineering, or a related field.

Experience with patents or publications at top‑tier peer‑reviewed conferences or journals.

Strong background in algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, or high‑performance computing.

Programming experience in Java, C++, Python or related languages.

Experience using Unix/Linux.

Preferred Qualifications

Experience investigating, designing, prototyping, and delivering new and innovative system solutions.

Experience with state‑of‑the‑art deep learning model architecture design, training and optimization, and model pruning.

Experience with popular deep learning frameworks such as MXNet and TensorFlow.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

This position will remain posted until filled. Applicants should apply via our internal or external career site.

#J-18808-Ljbffr