AMD

AI/ML Sr. Compiler Development Engineer

AMD, San Jose, California, United States, 95199

Overview

Join to apply for the

AI/ML Sr. Compiler Development Engineer

role at

AMD . This range is provided by AMD. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Compensation

Base pay range: $187,280.00/yr - $280,920.00/yr The Role

We are looking for a dynamic, energetic Sr. Compiler Development Engineer to join our growing team in the AI Group. In this role, you will be responsible for architecting and defining AI workload models, dataflow, block level and system level performance of Neural Processing Unit (NPU), NPU network performance modeling, and performance bottleneck analysis on pre/post silicon platforms. As a member of our dynamic team, you will have the opportunity to shape the future of AI model development. The Person

We are looking for a candidate who possesses strong engineering skills to tackle complex challenges on AI model development work. You should have experience in optimizing and accelerating CNN/Generative AI models. Person needs excellent cross team collaboration skills to succeed in this role. Strong experience in developing ML compiler for efficient network mapping on NPU Work with cross-functional teams to optimize various parts of the SW stack - AI Compiler, AI frameworks, device drivers, and firmware. Bring up emerging ML models based on CNN, transformers and characterize performance. Work on quantization, sparsity, and architecture search methods to optimize and enhance the performance, efficiency, and accuracy of Generative AI models. Collaborate closely with software engineers, data scientists, and researchers to integrate AI models into software applications and platforms. Key Responsibilities

Research, design, and implement novel methods for efficient CNN and GEN AI models. Model optimization method design including quantization, sparsity, NAS, etc. Collaborate with other team members and teams. Collaborate with the compiler team to develop optimization strategies for the compiler. Preferred Experience

Experience with deep learning frameworks, e.g., PyTorch/ONNX/TensorFlow. Experience on model compression, quantization, and end-to-end inference optimization. Strong coding skills in C/C++, Python required. Experience with any of the following also a plus: LLMs, stable diffusion, NeRF, or text-to-video generation. Solid knowledge of AI and ML concepts and techniques. Practical experience applying these concepts to solve real-world problems in the context of research or work experience. Understanding the performance implications on AI acceleration of different compute, memory, and communication configurations and hardware and software implementation choices. Developing and optimizing code for VLIW processors. Deep understanding of AI frameworks, preferably ONNX. Academic Credentials

Minimum of a BS degree, MS or above preferred. Location

San Jose, CA (Hybrid) Benefits offered are described: AMD benefits at a glance. AMD is an equal opportunity, inclusive employer. We will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout the recruitment process.

#J-18808-Ljbffr