Logo
Ampere

AI Accelerator Software Engineer-Graph Optimization

Ampere, Santa Clara, California, us, 95053

Save Job

Description

Invent the future with us.

Ampere is a semiconductor design company for a new era, leading the future of computing with an innovative approach to CPU design focused on high-performance, energy efficient, sustainable cloud computing.

By providing a new level of predictable performance, efficiency, and sustainability Ampere is working with leading cloud suppliers and a growing partner ecosystem to deliver cloud instances, servers and embedded/edge products that can handle the compute demands of today and tomorrow.

Join us at Ampere and work alongside a passionate and growing team - we'd love to have you apply!

About the role:

In this role as an AI Accelerator Software Engineer-Graph Optimization, you will drive the development and optimization of cutting-edge AI frameworks. You will be at the forefront of advancing AI capabilities, helping to pave the way for high-performance and efficient computing solutions that will meet future AI demands.

What you'll achieve: In this role, you will optimize the computational graph to fully unlock the potentials of Ampere's deep learning accelerator. Go deep into to the entire SW/HW stack to accelerate the deep learning including but not limited to inference serving, framework integration, compiler, runtime library, communication and compute kernel development, and performance tuning. Work on deep learning model enabling with performance and accuracy for popular frameworks like PyTorch and Llama.cpp and for serving platforms like vLLM and SGLang, positioning you at the forefront of AI innovation. HW/SW codesign to optimize existing AI architectures to enhance computational efficiency, increase throughput, reduce latency, and improve the scalability, pushing the boundaries of what's possible in AI technology. Be a key team member in building state-of-the-art software and hardware AI co-processors/accelerators, contribute to a collaborative and dynamic work environment, supporting continuous improvement and excellence. Collaborate with cross-functional teams to integrate AI solutions into Ampere's cloud-native processor platforms and accelerators. About you:

BS Computer Science, Mathematics or a related technical field & 12 years of related experience; or MS degree & 8 years; or PhD & 5 years Previous development experience with LLVM/MLIR/XLA for deep learning. This position requires strong expertise in programming languages such as Python, C/C++ with a strong background in performance tuning. Must have deep knowledge of Graph IR, such as Torch FX Graph IR, MLIR Dialects, etc. Experience with pattern recognition and fusion strategies is also required. Solid understanding of AI and machine learning concepts, including neural networks and data processing frameworks is also preferred. What we'll offer:

At Ampere we believe in taking care of our employees and providing a competitive total rewards package that includes base pay, bonus (i.e., variable pay tied to internal company goals), long-term incentive, and comprehensive benefits. The full base pay range for this role is between $169,500 and $282,500, except in the San Francisco Bay Area where the range is between $178,500 and $297,500.

Our benefits include health, wellness, and financial programs that support employees through every stage of life, with full benefits eligibility at 20 hours per week.

Benefit highlights include:

Premium medical insurance, dental insurance, vision insurance, as well as income protection and a 401K retirement plan, so that you can feel secure in your health and financial future. Unlimited Flextime and 10+ paid holidays so that you can embrace a healthy work-life balance. A variety of healthy snacks, energizing espresso, and refreshing drinks to keep you fueled and focused throughout the day.

And there is much more than compensation and benefits. At Ampere, we foster an inclusive culture that empowers our employees to do more and grow more. We are passionate about inventing industry leading cloud-native designs that contribute to a more sustainable future. We are excited to share more about our career opportunities with you through the interview process.

#LI-DR #LI-Hybrid

Ampere is an inclusive and equal opportunity employer and welcomes applicants from all backgrounds. All qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, religion, age, veteran and/or military status, sex, sexual orientation, gender, gender identity, gender expression, physical or mental disability, or any other basis protected by federal, state or local law.