Anthropic
Senior Software Engineer, Infrastructure
Anthropic, San Francisco, California, United States, 94199
Senior Software Engineer, Infrastructure
About Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role
Anthropic is seeking talented and experienced Infrastructure Engineers to join our team and support the development, scaling, and maintenance of our cutting-edge AI systems. By joining our Infrastructure team, you will have the opportunity to work on groundbreaking AI technologies and contribute to the development of frontier models, supporting Anthropic's mission to create safe and reliable AI systems that benefit humanity. We have multiple teams that are currently hiring. Team placement occurs after the interview process, taking into account your interests and experience alongside organizational needs. This flexible approach allows us to match talented engineers with the infrastructure teams where they'll have the greatest impact and growth potential: Data Infrastructure:
We build and maintain the data systems powering Anthropic's AI research and products. You'll design and optimize data pipelines using tools like Spark, Airflow, and dbt across GCP and AWS. Core Infrastructure:
The systems team is responsible for supporting some of the largest, most sophisticated clusters in industry used to train, research, and ultimately serve AI models. Runtime Platform:
We build and maintain the infrastructure that monitors the health, performance, and efficiency of our AI systems. Developer Productivity:
The Developer Productivity team enables Anthropic researchers and engineers to be maximally effective in securely developing state-of-the-art models. Product Infrastructure:
The Product Infrastructure team enables Anthropic's products to achieve best-in-class performance, reliability, and developer velocity. Cloud Inference:
We scale and optimize Claude to serve the massive audiences of developers and enterprise companies using AWS and GCP. Responsibilities:
Lead build out of industry-leading AI clusters, partnering closely with cloud service providers on cluster build out and required features. Consult with different stakeholders to deeply understand infrastructure, data and compute needs, identifying potential solutions to support frontier research and product development. Set technical strategy and oversee development of high scale, reliable infrastructure systems. Mentor top technical talent. Design processes that help the team operate effectively. You may be a good fit if you:
Have 8+ years of relevant industry experience, 3+ years leading large scale, complex projects or teams as an engineer or tech lead. Are obsessed with distributed systems at scale, infrastructure reliability, scalability, security, and continuous improvement. Strong proficiency in at least one programming language (e.g., Python, Rust, Go, Java). Strong problem-solving skills and ability to work independently. Have a passion for supporting internal partners like research to understand their needs. Have excellent communication skills to build consensus with stakeholders. Possess deep knowledge of modern cloud infrastructure including Kubernetes, Infrastructure as Code, AWS, and GCP. Strong candidates may have:
Security and privacy best practice expertise. Experience with machine learning infrastructure like GPUs, TPUs, or Trainium. Low level systems experience, for example linux kernel tuning and eBPF. Technical expertise in understanding systems design tradeoffs. Deadline to apply:
None. Applications will be reviewed on a rolling basis. The expected salary range for this position is: $300,000 - $320,000 USD Logistics
Education requirements:
We require at least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy:
Currently, we expect all staff to be in one of our offices at least 25% of the time. Visa sponsorship:
We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. We encourage you to apply even if you do not believe you meet every single qualification. How we're different
We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Apply for this job #J-18808-Ljbffr
About Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role
Anthropic is seeking talented and experienced Infrastructure Engineers to join our team and support the development, scaling, and maintenance of our cutting-edge AI systems. By joining our Infrastructure team, you will have the opportunity to work on groundbreaking AI technologies and contribute to the development of frontier models, supporting Anthropic's mission to create safe and reliable AI systems that benefit humanity. We have multiple teams that are currently hiring. Team placement occurs after the interview process, taking into account your interests and experience alongside organizational needs. This flexible approach allows us to match talented engineers with the infrastructure teams where they'll have the greatest impact and growth potential: Data Infrastructure:
We build and maintain the data systems powering Anthropic's AI research and products. You'll design and optimize data pipelines using tools like Spark, Airflow, and dbt across GCP and AWS. Core Infrastructure:
The systems team is responsible for supporting some of the largest, most sophisticated clusters in industry used to train, research, and ultimately serve AI models. Runtime Platform:
We build and maintain the infrastructure that monitors the health, performance, and efficiency of our AI systems. Developer Productivity:
The Developer Productivity team enables Anthropic researchers and engineers to be maximally effective in securely developing state-of-the-art models. Product Infrastructure:
The Product Infrastructure team enables Anthropic's products to achieve best-in-class performance, reliability, and developer velocity. Cloud Inference:
We scale and optimize Claude to serve the massive audiences of developers and enterprise companies using AWS and GCP. Responsibilities:
Lead build out of industry-leading AI clusters, partnering closely with cloud service providers on cluster build out and required features. Consult with different stakeholders to deeply understand infrastructure, data and compute needs, identifying potential solutions to support frontier research and product development. Set technical strategy and oversee development of high scale, reliable infrastructure systems. Mentor top technical talent. Design processes that help the team operate effectively. You may be a good fit if you:
Have 8+ years of relevant industry experience, 3+ years leading large scale, complex projects or teams as an engineer or tech lead. Are obsessed with distributed systems at scale, infrastructure reliability, scalability, security, and continuous improvement. Strong proficiency in at least one programming language (e.g., Python, Rust, Go, Java). Strong problem-solving skills and ability to work independently. Have a passion for supporting internal partners like research to understand their needs. Have excellent communication skills to build consensus with stakeholders. Possess deep knowledge of modern cloud infrastructure including Kubernetes, Infrastructure as Code, AWS, and GCP. Strong candidates may have:
Security and privacy best practice expertise. Experience with machine learning infrastructure like GPUs, TPUs, or Trainium. Low level systems experience, for example linux kernel tuning and eBPF. Technical expertise in understanding systems design tradeoffs. Deadline to apply:
None. Applications will be reviewed on a rolling basis. The expected salary range for this position is: $300,000 - $320,000 USD Logistics
Education requirements:
We require at least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy:
Currently, we expect all staff to be in one of our offices at least 25% of the time. Visa sponsorship:
We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. We encourage you to apply even if you do not believe you meet every single qualification. How we're different
We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Apply for this job #J-18808-Ljbffr