SoC NPU Architect665 Clyde Avenue, Mountain View, CA, USA
Samsung Research America - Mountain View, California, United States, 94043
Work at Samsung Research America
Overview
- View job
Overview
The Samsung SoC Architecture Lab explores innovative SoC architecture, key IP blocks (CPU, GPU, NPU), bus / memory subsystem and multimedia subsystems for future Samsung Galaxy products (Smartphones, IoT and future devices) echoing Samsung's AI for All strategy. We are defining the high performance SoC architecture development for various Galaxy device lineups. This lab collaborates with Samsung MX headquarter team, key R&D teams around the globe and Samsung's strategic SoC partners to innovate and reinvent technology that will positively impact hundred-millions of people around the world via the Galaxy flagship products. The NPU Architect role at Samsung Research America focuses on advancing technology solutions for machine learning applications and mobile devices. The position involves driving the architectural definition of Neural Processing Units (NPUs) and developing new architectures that prioritize low power, real-time and high performance to meet the rapid-growing demand of generative AI use cases emerged from low level camera vision, computational photography, photo editing, LLM and multimodal language-vision models targeting on-device inference. This role is integral to the SoC Architecture Lab, which is part of Samsung Research America and the extension of Samsung's Mobile Experience Division. Position Responsibilities
Analyze the state-of-the-art AI algorithms including camera and Gen AI use cases (LLM and LVM) Drive architectural definition of NPUs for mobile AI applications Create NPU simulator to simulate real NPU Map Gen AI use cases to NPU and NPU simulator with compiler concept in mind to find the bottlenecks of NPU Optimize and co-design of SW and HW to develop new architectures for low power and high performance Lead PPA (power, performance, and area) tradeoff analysis to achieve requirements Work closely with AI use case developers and NPU compiler and software architects to come up with optimized architectural solutions Support the development of NPUs to make sure new NPU architecture succeed Work closely with design verification team for debug, to meet code and functional coverage targets Required Skills
Bachelor's, Masters or PhD degree in Electrical, Computer Science, related Science or equivalent combination of education, training and experience 15+ years of working experience in NPU architecture & development Good knowledge and experience in architecture/design of NPUs Good knowledge in CNN and Transformer Network with hands on experience on LLM/LVM Well versed in processor architecture like ARM and RISC-V Experience in ML frameworks (TensorFlow, PyTorch, ONNX) Good knowledge in ML compilers Experience in micro architecture of SoC, interface subsystems, logic modules Experience in complex logic designs and timing closure on large, sophisticated designs Self-motivated problem-solver with an ability to work well in a team Inclusive, adapting style to diverse global norms Avid learner with curiosity and resilience Collaborative, building relationships and offering support Innovative and creative, exploring innovative ideas and adapting quickly to change Proficient programming skills in Python and/or C++ Special Attributes
Experience in AI use case development and deployment (training, quantization, deployment) Experience in modeling for performance/power estimates using SystemC, Platform Architect, ModelSim, RTL or similar tools/expertise Experience in CPU, GPU, DSP, DRAM and interconnect IPs Good knowledge in ASIC design flow Highly motivated with good verbal and written communication skills Creativity in problem solving Base Pay Range $188,400 - $282,450 USD