Meta
Software Engineer, Systems ML - Frameworks / Compilers / Kernels
Meta, Saint Paul, Minnesota, United States, 55199
Overview
The candidate will be a member of the MTIA (Meta Training & Inference Accelerator) Software team within the PyTorch AI framework organization. MTIA Software Team develops a comprehensive AI Compiler strategy to deliver a flexible platform to train and serve new DL/ML model architectures, with auto-tuned high performance for production across specialized hardware. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware impact performance and deployment velocity of AI training and inference platforms at Meta. You will work on core areas such as PyTorch framework components, AI compiler and runtime, high-performance kernels and tooling to accelerate machine learning workloads on MTIA hardware. You will collaborate with AI researchers to analyze models and lower them efficiently on MTIA hardware and partner with hardware design teams to develop compiler optimizations for high performance. You will apply software development best practices to design features, optimization, and performance tuning techniques. You will gain experience in developing machine learning compiler frameworks and contribute to next-generation hardware software co-design for AI domain problems. Responsibilities
Development of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures
Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance
Analyze deep learning networks, develop & implement compiler optimization algorithms
Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc
Performance tuning and optimizations of deep learning framework & software components
Minimum Qualifications
Proven C/C++ programming skills
Experience in AI framework development or accelerating deep learning models on hardware architectures
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
Preferred Qualifications
OR AI Compiler: Experience with compiler optimizations such as loop optimizations, vectorization, parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA, TVM, Halide is a plus.
OR AI frameworks: Experience in developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development.
OR AI high performance kernels: Experience with CUDA programming, OpenMP / OpenCL programming or AI hardware accelerator kernel programming. Experience in accelerating libraries on AI hardware, similar to cuBLAS, cuDNN, CUTLASS, HIP, ROCm etc.
A Bachelor\'s degree in Computer Science, Computer Engineering, relevant technical field and 7+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a Master\'s degree in Computer Science, Computer Engineering, relevant technical field and 4+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a PhD in Computer Science Computer Engineering, or relevant technical field and 3+ years of experience in AI framework development or accelerating deep learning models on hardware architectures.
Experience working with frameworks like PyTorch, Caffe2, TensorFlow, ONNX, TensorRT
Knowledge of GPU, CPU, or AI hardware accelerator architectures.
Compensation
Public Compensation:
$70.67/hour to $208,000/year + bonus + equity + benefits Industry
Internet Equal Opportunity
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment. Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.
#J-18808-Ljbffr
The candidate will be a member of the MTIA (Meta Training & Inference Accelerator) Software team within the PyTorch AI framework organization. MTIA Software Team develops a comprehensive AI Compiler strategy to deliver a flexible platform to train and serve new DL/ML model architectures, with auto-tuned high performance for production across specialized hardware. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware impact performance and deployment velocity of AI training and inference platforms at Meta. You will work on core areas such as PyTorch framework components, AI compiler and runtime, high-performance kernels and tooling to accelerate machine learning workloads on MTIA hardware. You will collaborate with AI researchers to analyze models and lower them efficiently on MTIA hardware and partner with hardware design teams to develop compiler optimizations for high performance. You will apply software development best practices to design features, optimization, and performance tuning techniques. You will gain experience in developing machine learning compiler frameworks and contribute to next-generation hardware software co-design for AI domain problems. Responsibilities
Development of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures
Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance
Analyze deep learning networks, develop & implement compiler optimization algorithms
Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc
Performance tuning and optimizations of deep learning framework & software components
Minimum Qualifications
Proven C/C++ programming skills
Experience in AI framework development or accelerating deep learning models on hardware architectures
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
Preferred Qualifications
OR AI Compiler: Experience with compiler optimizations such as loop optimizations, vectorization, parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA, TVM, Halide is a plus.
OR AI frameworks: Experience in developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development.
OR AI high performance kernels: Experience with CUDA programming, OpenMP / OpenCL programming or AI hardware accelerator kernel programming. Experience in accelerating libraries on AI hardware, similar to cuBLAS, cuDNN, CUTLASS, HIP, ROCm etc.
A Bachelor\'s degree in Computer Science, Computer Engineering, relevant technical field and 7+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a Master\'s degree in Computer Science, Computer Engineering, relevant technical field and 4+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a PhD in Computer Science Computer Engineering, or relevant technical field and 3+ years of experience in AI framework development or accelerating deep learning models on hardware architectures.
Experience working with frameworks like PyTorch, Caffe2, TensorFlow, ONNX, TensorRT
Knowledge of GPU, CPU, or AI hardware accelerator architectures.
Compensation
Public Compensation:
$70.67/hour to $208,000/year + bonus + equity + benefits Industry
Internet Equal Opportunity
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment. Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.
#J-18808-Ljbffr