Logo
Amazon Web Services (AWS)

Network Development Engineer, Annapurna Labs Infrastructure

Amazon Web Services (AWS), Austin, Texas, us, 78716

Save Job

Overview

Join to apply for the

Network Development Engineer, Annapurna Labs Infrastructure

role at

Amazon Web Services (AWS) . Annapurna Labs is an organization within AWS responsible for building innovation in silicon and software for AWS customers. With development centers in the U.S. and Israel, Annapurna is at the forefront of innovation by combining cloud scale with the world’s most talented engineers. The Annapurna team covers silicon engineering, hardware design and verification, software, and operations. The team has contributed to AWS cloud infrastructure in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), as well as in compute (e.g., AWS Graviton and F1 EC2 Instances), machine learning (AWS Neuron, Inferentia and Trainium ML Accelerators), and scalable NVMe storage. As part of the Annapurna Labs Infrastructure team, you will have the opportunity to contribute to the next generation of cloud computing infrastructure. The role involves a fast-paced, innovative environment focused on delivering high-impact infrastructure for Machine Learning Accelerators, including on-premise and cloud deployments for accelerated computing. Key responsibilities

The Network Development Engineering role involves developing a broad range of skills. Leverage Linux expertise to troubleshoot, implement fixes and workarounds, keep software up-to-date, and provide data and metrics to manage services. Design networks, develop network monitoring, and troubleshoot connectivity issues. Communicate clearly and collaborate with others to deliver results. Be a self-starter, comfortable with ambiguity and change. Be customer-obsessed, understanding customer pain points and delivering resolutions quickly and completely. Lead across teams to develop and execute infrastructure plans that enable customers and engineering teams developing the Machine Learning Acceleration product family. Dive deep to solve critical infrastructure issues involving networking, high-performance compute clusters, infrastructure automation of hardware/software/firmware testing, and ASIC/EDA development. Influence within your team, customers, and AWS service teams to drive and develop technical implementations for overall infrastructure designs. Identify and implement process improvements to improve agility and operations, including design, automation, development, test, or operations. Define new mechanisms for system health monitoring, diagnostics, repair, and automation. Develop, document, and update operational runbooks as you participate in on-call rotations. Work with customers to translate requirements into cloud and on-premise infrastructure solutions. Define infrastructure requirements for labs and server rooms, and liaison with contractors and vendors for infrastructure. Take ownership for testing, deployments and measuring infrastructure health; support silicon development workflows including ATE testers, emulators, and lab debug equipment. Day-in-the-life / On-site expectations

Collaborate with top engineers to develop Machine Learning Accelerators. Work backwards from customers to develop infrastructure requirements for cloud and on-premise environments. Deliver on-premises infrastructure that meets customer needs; own testing, deployments, and health metrics. Participate in on-call rotations and maintain runbooks. Basic Qualifications

4+ years of experience with major internet routing protocols. 4+ years of experience in a Linux/Unix environment. Preferred Qualifications

1+ years of automation scripting using Python, Bash, Shell and/or Perl. Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner. Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $127,400/year in our lowest geographic market up to $212,800/year in our highest geographic market. Pay is based on factors including market location and job-related knowledge, skills, and experience. Amazon is a total compensation company. Depending on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site. Company - Annapurna Labs (U.S.) Inc. Job ID: A3079852

#J-18808-Ljbffr