Amazon Web Services (AWS)
Systems Development Eng (AWS Generative AI & ML Servers), AWS Hardware Engineeri
Amazon Web Services (AWS), Seattle, Washington, us, 98127
Systems Development Eng (AWS Generative AI & ML Servers), AWS Hardware Engineering Accelerators
Join to apply for the Systems Development Eng (AWS Generative AI & ML Servers), AWS Hardware Engineering Accelerators role at Amazon Web Services (AWS).
Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Do you want to deliver continuous price performance improvements in the cloud for AI model training for multi‑billion variable LLMs? Come join us in designing, delivering and operating AWS cloud offerings that enable high performance and scalability in AI/ML and HPC workloads.
The AWS Hardware Engineering team creates server designs for Amazon’s innovative web services. Our designs are industry‑leading in frugality and operational excellence, and are critical to the success of the AWS business and millions of customers who use AWS today. Our engineers solve challenging technology problems, building architecturally sound, high‑quality components to enable AWS to realize critical business strategies.
The ideal candidate for this role will be an innovative self‑starter knowledgeable of the full technical stack - from baremetal server hardware up to the software in userland, and everything in between. You are passionate about cloud scale, curious how systems and software decisions impact the user, insist on highest standards, and can develop tools to diagnose and fix issues. You are an excellent systems debugger and a leader with strong organizational, planning, and communication skills. You are a builder!
Key job responsibilities You will work with engineers across the company to deliver the next‑generation AWS platforms. You will have a direct impact on the bottom line and the ability to deliver improvements for AWS. You will have ownership for the implementation of your work and see direct product improvements based on the results of your work.
You will be a technical leader solving complex architectural problems that may not have been defined beforehand. You will own the team’s system, proactively identify deficiencies, write tactical code to solve issues before they impact customers, and work with your team to scale the solution. You will decompose difficult server‑system testability, reliability and diagnosis problems into straightforward tasks and lead to deliver them yourself and through others in parallel. You will use knowledge of hardware, software, system designs, x86 architecture, processes, diagnosis and operations.
A day in the life: Working with a variety of job roles (SDEs, SDETs, Hardware Engineers, TPMs, Managers, Principals) and groups (AWS Hardware Engineering, EC2, other AWS services) through server conception, test, launch, and operations. Driving high quality and reliability into future/new designs for AWS Accelerated server solutions for AWS Cloud.
Basic Qualifications
2+ years of non‑internship professional software development experience
1+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
7+ years of administrative experience in networking, storage systems, operating systems and hands‑on systems engineering experience
Knowledge of systems engineering fundamentals (networking, storage, operating systems)
Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby
Preferred Qualifications
Experience with PowerShell (preferred), Python, Ruby, or Java
Experience working in an Agile environment using the Scrum methodology
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
#J-18808-Ljbffr
Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Do you want to deliver continuous price performance improvements in the cloud for AI model training for multi‑billion variable LLMs? Come join us in designing, delivering and operating AWS cloud offerings that enable high performance and scalability in AI/ML and HPC workloads.
The AWS Hardware Engineering team creates server designs for Amazon’s innovative web services. Our designs are industry‑leading in frugality and operational excellence, and are critical to the success of the AWS business and millions of customers who use AWS today. Our engineers solve challenging technology problems, building architecturally sound, high‑quality components to enable AWS to realize critical business strategies.
The ideal candidate for this role will be an innovative self‑starter knowledgeable of the full technical stack - from baremetal server hardware up to the software in userland, and everything in between. You are passionate about cloud scale, curious how systems and software decisions impact the user, insist on highest standards, and can develop tools to diagnose and fix issues. You are an excellent systems debugger and a leader with strong organizational, planning, and communication skills. You are a builder!
Key job responsibilities You will work with engineers across the company to deliver the next‑generation AWS platforms. You will have a direct impact on the bottom line and the ability to deliver improvements for AWS. You will have ownership for the implementation of your work and see direct product improvements based on the results of your work.
You will be a technical leader solving complex architectural problems that may not have been defined beforehand. You will own the team’s system, proactively identify deficiencies, write tactical code to solve issues before they impact customers, and work with your team to scale the solution. You will decompose difficult server‑system testability, reliability and diagnosis problems into straightforward tasks and lead to deliver them yourself and through others in parallel. You will use knowledge of hardware, software, system designs, x86 architecture, processes, diagnosis and operations.
A day in the life: Working with a variety of job roles (SDEs, SDETs, Hardware Engineers, TPMs, Managers, Principals) and groups (AWS Hardware Engineering, EC2, other AWS services) through server conception, test, launch, and operations. Driving high quality and reliability into future/new designs for AWS Accelerated server solutions for AWS Cloud.
Basic Qualifications
2+ years of non‑internship professional software development experience
1+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
7+ years of administrative experience in networking, storage systems, operating systems and hands‑on systems engineering experience
Knowledge of systems engineering fundamentals (networking, storage, operating systems)
Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby
Preferred Qualifications
Experience with PowerShell (preferred), Python, Ruby, or Java
Experience working in an Agile environment using the Scrum methodology
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
#J-18808-Ljbffr