AIML - ML Engineer, Machine Learning Platform & Infra
Santa Clara, California, United States | Machine Learning and AI
Description
Work alongside the Foundation Model Research team to optimize inference for cutting-edge model architectures. Collaborate with product teams to develop production-grade solutions for launching models that serve millions of customers in real time. Build tools to identify bottlenecks in inference across various hardware and use cases. Mentor and guide engineers within the organization.
Minimum Qualifications
- 5+ years of experience leading and managing complex, ambiguous projects.
- Experience with high-throughput services at supercomputing scale.
- Proficiency in deploying applications on Cloud platforms (AWS / Azure or equivalent) using Kubernetes, Docker, etc.
- Knowledge of GPU programming concepts using CUDA.
- Familiarity with popular ML frameworks like PyTorch and TensorFlow.
Preferred Qualifications
- Proficiency in building and maintaining systems in modern languages (e.g., Golang, Python).
- Understanding of deep learning architectures such as Transformers and Encoder/Decoder models.
- Experience with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server, etc.
- Experience developing custom CUDA kernels using CUDA or OpenAI Triton.
At Apple, the base pay range for this role is between $175,800 and $312,200, depending on skills, qualifications, experience, and location. Apple offers comprehensive benefits including medical and dental coverage, retirement plans, stock programs, educational reimbursement, and more. Eligibility for bonuses, stock options, and relocation assistance may apply. Learn more about Apple Benefits.
Apple is an equal opportunity employer committed to diversity and inclusion. We promote equal opportunity without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other protected characteristics. Learn more about your EEO rights as an applicant.
#J-18808-Ljbffr