Logo
Amazon

Software Development Engineer - Generative AI, AGIF | Inference Engine

Amazon, Boston, Massachusetts, us, 02298

Save Job

Software Development Engineer - Generative AI, AGIF | Inference Engine Job ID: 3109601 | Amazon.com Services LLC

Overview Are you interested in advancing Amazon's Generative AI capabilities? Come work with a talented team of engineers and scientists in a highly collaborative and friendly team. We are building state‑of‑the‑art Generative AI technology that will benefit all Amazon businesses and customers.

Key Job Responsibilities

Design, develop, test, and deploy high‑performance model inference capabilities across multi‑modality, SOTA model architectures, latency, throughput, and cost.

Collaborate closely with engineers and scientists to influence strategy and define the team’s roadmap.

Drive system architecture, spearhead best practices, and mentor junior engineers.

A Day in the Life You will consult with scientists to gain inspiration from emerging techniques and incorporate them into the roadmap; design and experiment with new algorithms from public and internal papers, benchmark the latency and accuracy of implementations; implement production‑grade solutions and see them deployed swiftly; collaborate with other science and engineering teams to get things done; uphold the highest bar in operational excellence and support production systems, continually creating solutions to reduce ops load.

About the Team Our mission is to build best‑in‑class, fast, accurate, and cost‑efficient frontier model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.

Basic Qualifications

3+ years of non‑internship professional software development experience

Experience with software performance optimization

Knowledge of Deep Learning and Transformer architectures

Preferred Qualifications

3+ years covering the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience

Bachelor's degree in computer science or equivalent

Experience with Large Language Model inference

Experience with GPU programming (TensorRT‑LLM)

Experience with Python, PyTorch, and C++ programming and performance optimization

Experience with Trainium and Inferentia development

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job‑related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign‑on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits.

#J-18808-Ljbffr