XPENG & Volkswagen Group
Senior Computer Vision Engineer - Deployment
XPENG & Volkswagen Group, Santa Clara, California, us, 95053
Overview
XPENG is a leading smart technology company at the forefront of innovation, integrating AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), eVTOL aircraft, and robotics. XPENG focuses on intelligent mobility and reshaping the future of transportation through R&D in AI, machine learning, and smart connectivity. We are seeking a passionate and skilled Senior Computer Vision Engineer to lead the development and deployment of high-performance, large-scale AI models. You will optimize model inference, implement compression techniques (quantization, pruning, distillation), and ensure efficient on-device deployment across GPU and custom AI accelerator platforms. Your work will enable the next generation of intelligent systems in autonomous driving and beyond. Responsibilities
Optimize large-scale multimodal models for low-latency inference and efficient memory usage across diverse hardware platforms. Apply state-of-the-art model compression techniques, including quantization (e.g., INT8/FP16), pruning, and knowledge distillation. Develop and integrate custom inference kernels targeting GPU or custom AI accelerators. Build profiling tools and performance models to analyze bottlenecks and guide optimization strategies. Contribute to real-world deployment efforts in autonomous driving systems, including on-vehicle testing and iteration. Track the latest research in efficient ML inference and integrate relevant techniques into production pipelines. Qualifications
Master’s or Ph.D. in Computer Science, Electrical Engineering, or related field. Open to recent graduates. Strong coding skills in C++ and Python with a focus on performance and scalability. Proficient in deploying deep learning models using TensorRT, ONNX Runtime, or TVM. Familiarity with CUDA programming and parallel computing principles. Solid understanding of model inference workflows and system-level performance tuning. Effective communicator and collaborative team player. Preferred Qualifications
Hands-on experience with deploying vision-language or large multimodal models. Familiarity with low-precision inference (INT8/FP16), kernel fusion, and operator-level optimization. Experience in autonomous driving, robotics, or edge AI applications. Track record of open-source contributions or publications in ML/AI conferences (e.g., NeurIPS, ICML, CVPR). Background in system profiling, latency modeling, or compiler-level optimization. What we provide
A fun, supportive and engaging environment Infrastructures and computational resources to support your work Opportunity to work on cutting-edge technologies with top talents in the field Opportunity to make a meaningful impact on the transportation revolution through advancing autonomous driving Competitive compensation package Snacks, lunches, dinners, and fun activities Compensation
The base salary range for this full-time position is $174,720 - $295,680, in addition to bonus, equity and benefits. Salary ranges are determined by role, level and location. The range displayed reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, pay is determined by work location and factors including skills, experience, and education or training. Equal Opportunity
We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other category protected by federal or state regulations.
#J-18808-Ljbffr
XPENG is a leading smart technology company at the forefront of innovation, integrating AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), eVTOL aircraft, and robotics. XPENG focuses on intelligent mobility and reshaping the future of transportation through R&D in AI, machine learning, and smart connectivity. We are seeking a passionate and skilled Senior Computer Vision Engineer to lead the development and deployment of high-performance, large-scale AI models. You will optimize model inference, implement compression techniques (quantization, pruning, distillation), and ensure efficient on-device deployment across GPU and custom AI accelerator platforms. Your work will enable the next generation of intelligent systems in autonomous driving and beyond. Responsibilities
Optimize large-scale multimodal models for low-latency inference and efficient memory usage across diverse hardware platforms. Apply state-of-the-art model compression techniques, including quantization (e.g., INT8/FP16), pruning, and knowledge distillation. Develop and integrate custom inference kernels targeting GPU or custom AI accelerators. Build profiling tools and performance models to analyze bottlenecks and guide optimization strategies. Contribute to real-world deployment efforts in autonomous driving systems, including on-vehicle testing and iteration. Track the latest research in efficient ML inference and integrate relevant techniques into production pipelines. Qualifications
Master’s or Ph.D. in Computer Science, Electrical Engineering, or related field. Open to recent graduates. Strong coding skills in C++ and Python with a focus on performance and scalability. Proficient in deploying deep learning models using TensorRT, ONNX Runtime, or TVM. Familiarity with CUDA programming and parallel computing principles. Solid understanding of model inference workflows and system-level performance tuning. Effective communicator and collaborative team player. Preferred Qualifications
Hands-on experience with deploying vision-language or large multimodal models. Familiarity with low-precision inference (INT8/FP16), kernel fusion, and operator-level optimization. Experience in autonomous driving, robotics, or edge AI applications. Track record of open-source contributions or publications in ML/AI conferences (e.g., NeurIPS, ICML, CVPR). Background in system profiling, latency modeling, or compiler-level optimization. What we provide
A fun, supportive and engaging environment Infrastructures and computational resources to support your work Opportunity to work on cutting-edge technologies with top talents in the field Opportunity to make a meaningful impact on the transportation revolution through advancing autonomous driving Competitive compensation package Snacks, lunches, dinners, and fun activities Compensation
The base salary range for this full-time position is $174,720 - $295,680, in addition to bonus, equity and benefits. Salary ranges are determined by role, level and location. The range displayed reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, pay is determined by work location and factors including skills, experience, and education or training. Equal Opportunity
We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other category protected by federal or state regulations.
#J-18808-Ljbffr