Logo
Qualcomm

Sr. Staff Engineer, Machine Learning Engineering (Quantization SW)

Qualcomm, San Diego, California, United States, 92189

Save Job

Sr. Staff Engineer, Machine Learning Engineering (Quantization SW) Join to apply for the

Sr. Staff Engineer, Machine Learning Engineering (Quantization SW)

role at

Qualcomm .

Overview Qualcomm Technologies, Inc. is seeking a Sr. Staff Engineer in Machine Learning Engineering focused on Quantization software. We focus on on-device AI enabling edge solutions, including model fine tuning, hardware acceleration, model quantization, and edge inference for the intelligent edge.

Responsibilities

Work in a dynamic research environment.

Be part of a multi-disciplinary team of researchers and software engineers who work with cutting edge AI frameworks and tools.

Architect, design, develop and test model optimization techniques that include - but are not limited to - graph optimization, pruning and quantization.

Minimum Qualifications

Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 6+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

OR Master’s degree in Computer Science, Engineering, Information Systems, or related field and 5+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

OR PhD in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

Preferred Skills And Experience

Strong Software Engineering/Development skills combined with a solid foundation in AI and general ML techniques.

Proven hands-on experience evaluating and optimizing Generative AI workflows for accuracy, performance, and other key metrics.

Prior experience with ML model optimization frameworks and familiarity with quantization, pruning, etc.

Knowledge of neural networks, with hands-on experience using ML frameworks such as PyTorch, ONNX, etc.

Strong Python design and implementation skills.

Strong general analytical and debugging skills.

Prior experience working in agile environments.

Prior experience collaborating with multi-disciplinary teams across time zones.

Strong leadership skills as a mentor, team player, communicator and presenter.

Experience deploying GenAI LLM/LVM models on edge devices.

Prior experience with model quantization, profiling and running models on edge devices.

Prior experience with frameworks like HuggingFace Optimum, ONNX Runtime and OpenVINO.

Proven hands-on experience establishing a high-quality software delivery process using industry best-practices (code review, CI/CD, automation, etc.).

Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, Qualcomm is committed to providing an accessible process. You may e-mail disability-accommodations@qualcomm.com or call Qualcomm's toll-free number found here. Qualcomm will provide reasonable accommodations to support individuals with disabilities to participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities.

EEO Employer: Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification.

Pay Range And Other Compensation & Benefits: $178,400.00 - $267,600.00. The pay scale reflects the broad minimum to maximum for this job code location. Salary is one component of total compensation; Qualcomm also offers a discretionary bonus program and RSU grants, along with a comprehensive benefits package.

For more information about this role, please contact Qualcomm Careers.

#J-18808-Ljbffr