Data Scientist - Model Optimization
ZipRecruiter - Burlingame
Work at ZipRecruiter
Overview
- View job
Overview
Job Description Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware are designed to run neural network (NN) inference workloads on a wide variety of edge and endpoint devices, from battery-operated smart sensors to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators that can only accelerate part of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. What We Value: Integrity, Humility, Happiness What We Expect: Initiative, Collaboration, Completion Role: You will join the data science team focused on model optimization, researching, prototyping, and validating low-precision techniques to make neural networks leaner and faster on the ChimeraGPNPU. Your analyses will determine quantization recipes in the ChimeraSDK and influence future hardware features. Responsibilities: Design statistically rigorous experiments to compare PTQ, QAT, pruning, and mixed-precision schemes on vision, language, and multimodal models. Build calibration datasets; develop Python notebooks/dashboards to track accuracy, latency, power, and memory trade-offs. Perform layer- and token-level error analysis to guide numerical-format choices. Partner with the compiler team to convert findings into SDK flows and reference configurations. Publish internal whitepapers, external benchmarks, and present results to customers and at industry events. Monitor academic literature on compression and efficient inference; translate promising ideas into reproducible prototypes. Requirements: M.S./Ph.D. in CS, EE, Applied Math, or similar, with 5+ years in ML model optimization or data-driven research. Deep understanding of fixed-point arithmetic, quantization theory, and statistical calibration. Fluent in Python, PyTorch or TensorFlow, NumPy/Pandas/SciPy, and data visualization tools like Matplotlib or Plotly. Hands-on experience with at least one quantization toolkit (PyTorchFX/PTQ/QAT, TF-Lite, ONNX-Runtime, TVM, MLIR Quant). Working knowledge of CNNs, Transformers, and DNN architectures. Benefits: Competitive salaries and meaningful equity Health Care Plan (Medical, Dental & Vision) Retirement Plan (401k, IRA) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation, Sick & Public Holidays) Family Leave (Maternity, Paternity) Work From Home Free Food & Snacks Founded in 2016 and based in downtown Burlingame, California, Quadric is building the world’s first supercomputer designed for the real-time needs of edge devices. Quadric aims to empower developers across industries with superpowers to create tomorrow’s technology today. The company was co-founded by technologists from MIT and Carnegie Mellon, previously the technical co-founders of the Bitcoin computing company 21. Quadric is proud to be an equal opportunity workplace and an affirmative action employer. We are committed to equal employment opportunity regardless of race, gender, age, religion, citizenship, marital status, or other protected characteristics. #J-18808-Ljbffr