Quadric
Quadric Gpnpu Software Development Lead
Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. What We Value: Integrity, Humility, Happiness What We Expect: Initiative, Collaboration, Completion Role:
Reporting directly to the VP, Engineering, you will own the entire software team and will be responsible for execution on Quadric's Chimera SDK. Join us in our shared vision of shipping world class model optimization and compilers for AI inference where it matters moston the factory floor, in the vehicle, across consumer devices and enterprise. Responsibilities:
Own end-to-end delivery of the software stack: SDK, graph compiler, kernel libraries, and developer tooling. Set technical direction and a multi-release roadmap for compiler, kernels, and SDK; align with silicon, architecture. Own the inference optimization stack (Quantization -> Compile -> Accelerated Performance) on Chimera architecture for Vision Models, LLMs, VLMs etc. Build and track execution plans, milestones, and KPIs (operator coverage, latency/throughput, accuracy deltas, compile times). Manage and mentor an engineering org (managers + ICs), grow the team, and develop leaders. Be hands-on for critical designs, reviews, and codeespecially around IR design, codegen and kernels. Partner with customers and field teams to unblock POCs, prioritize the roadmap, and drive production wins. Establish quality bars and release engineering (CI/CD, testing, benchmarking, reproducible builds, docs, and samples).
Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. What We Value: Integrity, Humility, Happiness What We Expect: Initiative, Collaboration, Completion Role:
Reporting directly to the VP, Engineering, you will own the entire software team and will be responsible for execution on Quadric's Chimera SDK. Join us in our shared vision of shipping world class model optimization and compilers for AI inference where it matters moston the factory floor, in the vehicle, across consumer devices and enterprise. Responsibilities:
Own end-to-end delivery of the software stack: SDK, graph compiler, kernel libraries, and developer tooling. Set technical direction and a multi-release roadmap for compiler, kernels, and SDK; align with silicon, architecture. Own the inference optimization stack (Quantization -> Compile -> Accelerated Performance) on Chimera architecture for Vision Models, LLMs, VLMs etc. Build and track execution plans, milestones, and KPIs (operator coverage, latency/throughput, accuracy deltas, compile times). Manage and mentor an engineering org (managers + ICs), grow the team, and develop leaders. Be hands-on for critical designs, reviews, and codeespecially around IR design, codegen and kernels. Partner with customers and field teams to unblock POCs, prioritize the roadmap, and drive production wins. Establish quality bars and release engineering (CI/CD, testing, benchmarking, reproducible builds, docs, and samples).