ML Framework Software Engineer (PhD)

Meta, Menlo Park, California, United States, 94029

Summary

This role is about developing the core PyTorch 2.0 technologies, innovating and advancing the state-of-the-art of ML compilers, and accelerating PT2 adoption through direct engagements with OSS and industry users. The PyTorch Compiler team is dedicated to making PyTorch run faster and more resource-efficient without sacrificing its flexibility and ease of use. The team is the driving force behind PT2, a step function change in PyTorch’s history that brought compiler technologies to the core of PyTorch. PT2 technologies have gained industry-wide recognition since their first release in March 2023. The team is committed to building the PT2 compiler that withstands the test of time while striving to become the #1 ML framework compiler in the industry. Our work is open source, cutting-edge, and industry leading. Responsibilities

Develop the PT2 compiler (e.g., TorchDynamo, TorchInductor, PyTorch Distributed, PyTorch Core) Improve PyTorch performance via systematic solutions for the entire community Explore the intersection of the PyTorch compiler and PyTorch distributed Optimize Generative AI models across the stack (pre-training, fine-tuning, and inference) Collaborate with users of PyTorch to enable new use cases of PT2 technologies both inside and outside Meta Minimum Qualifications

Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta Currently has or is in the process of obtaining a PhD degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta Research or industry experience in developing compilers, ML systems, ML accelerators, GPU performance, and similar Advanced in Python or C++ programming Preferred Qualifications

Experience in developing PyTorch/PT2, Triton, MLIR, JAX, XLA, TVM is a huge plus Knowledge in GPU architecture, ML accelerator performance, and developing high-performance kernels Experience in building OSS communities and extensive social media presence in the ML Sys domain Experience with training models, end-to-end model optimizations, or applying ML to systems Knowledge of communication collectives, PyTorch distributed, and parallelism Experience in developing inside other ML frameworks like Caffe2, TensorFlow, ONNX, TensorRT First-authored publications at peer-reviewed conferences (e.g. NeurIPS, MLSys, ASPLOS, PLDI, ICML, or similar) Public Compensation

$56.25/hour to $173,000/year + bonus + equity + benefits Industry

Internet Equal Opportunity

Meta is proud to be an Equal Employment Opportunity and affirmative action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment. Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.

#J-18808-Ljbffr