Logo
Quadric

AI Kernel Engineer

Quadric, San Francisco, California, United States, 94199

Save Job

Join to apply for the

AI Kernel Engineer

role at

Quadric

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric’s co‑optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery‑operated smart‑sensor systems to high‑performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.

Role Overview The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel library for a variety of AI/LLM models; [2] analyze the performance and optimize the kernel for different hardware configurations; this senior technical role demands deep knowledge of hardware architecture, compiler toolchain and optimization techniques.

Responsibilities

Develop AI/LLM kernels/operators on Quadric platform for efficient inference

Optimize the kernel performance for different hardware configurations and workloads

Profile and analyze kernel performance in terms of compute, data and parallelism; identify micro‑architecture and software bottlenecks and provide optimization solutions

Optimize kernel C/C++ codes, maximize hardware utilization

Make improvement to Quadric toolchain, compiler and runtime

Provide technical support and documents to customers and developer community

Requirements

Bachelor’s or Master’s in Computer Science and/or Electrical Engineering.

5+ years of experience in AI kernel development and optimization.

Experience with model and kernel inference performance profiling.

Experience with at least one of the following compute development: CUDA, DSP, NEON, Triton‑lang.

Proficiency in C/C++ and Python, experience with assembly language a plus.

Demonstrate good capability in problem solving, debug and communication.

Life Insurance (Basic, Voluntary & AD&D)

Paid Time Off (Vacation, Sick & Public Holidays)

Family Leave (Maternity, Paternity)

Short Term & Long Term Disability

Training & Development

Work From Home

Free Food & Snacks

Stock Option Plan

Benefits

Life Insurance (Basic, Voluntary & AD&D)

Paid Time Off (Vacation, Sick & Public Holidays)

Family Leave (Maternity, Paternity)

Short Term & Long Term Disability

Training & Development

Work From Home

Free Food & Snacks

Stock Option Plan

Referrals increase your chances of interviewing at Quadric by 2x

#J-18808-Ljbffr