Discord
Staff Software Engineer, Machine Learning Platform
Discord, San Francisco, California, United States, 94199
Overview
Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. The Machine Learning Platform team at Discord is responsible for the end to end model development lifecycle across all ML applications. We work closely with machine learning engineers, core infrastructure, and ML consumers to deliver the tools, services, and frameworks that make machine learning training and deployments easy, safe, scalable, and accessible. In this role, you’ll work on everything from training and batch inference pipelines, online model serving, feature stores, event stream processing, and LLM serving and evaluation. You’ll lead projects and partner with ML practitioners and other Staff+ engineers to shape the future of Discord’s data infrastructure. This role reports to the Senior Engineering Manager of the ML Platform team. What You'll Be Doing
Design and build the platform that ML engineers and data scientists rely on to deliver delightful user experiences and keep communities safe Evaluate and integrate new ML frameworks and tools to ensure Discord stays at the forefront of a fast-moving field, including LLMs and generative AI Partner with model builders to help them ship high-quality models quickly and often Champion best practices in feature engineering and ML infrastructure Build and maintain shared ML foundations, including datasets and embedding models, that elevate the overall quality of ML applications What You Should Have
8+ years of experience as a software or ML engineer, working with large-scale data or distributed systems 4+ years of experience building platforms and infrastructure primarily used by engineering stakeholders 2+ years of experience building machine learning platforms that support the full model lifecycle, including orchestration systems (e.g. Dagster, Flyte, or Airflow) Proven ability to establish and execute on clear, actionable, and accurate technical direction beyond the next month or quarter Experience leading technical initiatives across teams with diverse resources, skillsets, and priorities Bonus Points
Experience operating, monitoring, and maintaining large-scale production systems, especially on Kubernetes and/or in large data domains Experience with real-time data processing (e.g. Kafka, Spark, or Flink) Experience building ML models using modern frameworks (e.g. PyTorch, TensorFlow, or JAX) Experience deploying GenAI applications, eval frameworks, or agents Experience managing infrastructure costs by designing for efficiency and scalability Compensation
The US base salary range for this full-time position is $272,000 to $306,000 + equity + benefits. Our salary ranges are determined by role and level. Within the range, individual pay is determined by additional factors, including job-related skills, experience, and relevant education or training. Please note that the compensation details listed in US role postings reflect the base salary only, and do not include equity, or benefits. Why Discord?
Discord plays a uniquely important role in the future of gaming. We’re a multiplatform, multigenerational and multiplayer platform that helps people deepen their friendships around games and shared interests. We believe games give us a way to have fun with our favorite people, whether listening to music together or grinding in competitive matches for diamond rank. Join us in our mission! Your future is just a click away! Discord is committed to inclusion and providing reasonable accommodations during the interview process. If you are in need of reasonable accommodations, please let your recruiter know. Please see our Applicant and Candidate Privacy Policy for details regarding Discord’s collection and usage of personal information relating to the application and recruitment process by clicking HERE.
#J-18808-Ljbffr
Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. The Machine Learning Platform team at Discord is responsible for the end to end model development lifecycle across all ML applications. We work closely with machine learning engineers, core infrastructure, and ML consumers to deliver the tools, services, and frameworks that make machine learning training and deployments easy, safe, scalable, and accessible. In this role, you’ll work on everything from training and batch inference pipelines, online model serving, feature stores, event stream processing, and LLM serving and evaluation. You’ll lead projects and partner with ML practitioners and other Staff+ engineers to shape the future of Discord’s data infrastructure. This role reports to the Senior Engineering Manager of the ML Platform team. What You'll Be Doing
Design and build the platform that ML engineers and data scientists rely on to deliver delightful user experiences and keep communities safe Evaluate and integrate new ML frameworks and tools to ensure Discord stays at the forefront of a fast-moving field, including LLMs and generative AI Partner with model builders to help them ship high-quality models quickly and often Champion best practices in feature engineering and ML infrastructure Build and maintain shared ML foundations, including datasets and embedding models, that elevate the overall quality of ML applications What You Should Have
8+ years of experience as a software or ML engineer, working with large-scale data or distributed systems 4+ years of experience building platforms and infrastructure primarily used by engineering stakeholders 2+ years of experience building machine learning platforms that support the full model lifecycle, including orchestration systems (e.g. Dagster, Flyte, or Airflow) Proven ability to establish and execute on clear, actionable, and accurate technical direction beyond the next month or quarter Experience leading technical initiatives across teams with diverse resources, skillsets, and priorities Bonus Points
Experience operating, monitoring, and maintaining large-scale production systems, especially on Kubernetes and/or in large data domains Experience with real-time data processing (e.g. Kafka, Spark, or Flink) Experience building ML models using modern frameworks (e.g. PyTorch, TensorFlow, or JAX) Experience deploying GenAI applications, eval frameworks, or agents Experience managing infrastructure costs by designing for efficiency and scalability Compensation
The US base salary range for this full-time position is $272,000 to $306,000 + equity + benefits. Our salary ranges are determined by role and level. Within the range, individual pay is determined by additional factors, including job-related skills, experience, and relevant education or training. Please note that the compensation details listed in US role postings reflect the base salary only, and do not include equity, or benefits. Why Discord?
Discord plays a uniquely important role in the future of gaming. We’re a multiplatform, multigenerational and multiplayer platform that helps people deepen their friendships around games and shared interests. We believe games give us a way to have fun with our favorite people, whether listening to music together or grinding in competitive matches for diamond rank. Join us in our mission! Your future is just a click away! Discord is committed to inclusion and providing reasonable accommodations during the interview process. If you are in need of reasonable accommodations, please let your recruiter know. Please see our Applicant and Candidate Privacy Policy for details regarding Discord’s collection and usage of personal information relating to the application and recruitment process by clicking HERE.
#J-18808-Ljbffr