Anthropic
Staff+ Software Engineer - Infrastructure
Anthropic, San Francisco, California, United States, 94199
Overview
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for users and society. Our team includes researchers, engineers, policy experts, and business leaders building beneficial AI systems. Anthropic is seeking talented Infrastructure Engineers to join our team and support the development, scaling, and maintenance of our cutting-edge AI systems. You will work on frontier models and contribute to building safe, reliable AI systems that benefit humanity. We have multiple teams hiring; team placement occurs after the interview process based on interests, experience, and organizational needs to match engineers with the most impactful teams. Responsibilities
Lead build out of industry-leading AI clusters (thousands to hundreds of thousands of machines), partnering with cloud service providers on cluster build out and features Consult with stakeholders to understand infrastructure, data and compute needs and identify solutions to support frontier research and product development Set technical strategy and oversee development of high-scale, reliable infrastructure systems Mentor top technical talent Design processes (e.g., postmortem reviews, incident response, on-call rotations) to help the team operate effectively and prevent repeated failures You may be a good fit if you
Have 10+ years of relevant industry experience, with 3+ years leading large-scale, complex projects or teams as an engineer or tech lead Are focused on distributed systems at scale, infrastructure reliability, security, and continuous improvement Have strong proficiency in at least one programming language (e.g., Python, Rust, Go, Java) Have strong problem-solving skills and ability to work independently Have a passion for supporting internal partners like research to understand their needs Have excellent communication skills to build consensus with stakeholders Possess deep knowledge of modern cloud infrastructure including Kubernetes, Infrastructure as Code, AWS, and GCP Strong candidates may have
Security and privacy best-practice expertise Experience with ML infrastructure (GPUs, TPUs, Trainium) and supporting networking infrastructure like NCCL Low-level systems experience (e.g., Linux kernel tuning, eBPF) Technical ability to quickly understand systems design tradeoffs in evolving software *This is a pipeline hiring posting across the infrastructure org. Team-specific postings are also available on our career site. Deadline to apply:
None. Applications are reviewed on a rolling basis. The expected base compensation for this position is below. Our total compensation package includes equity, benefits, and may include incentive compensation. $405,000 - $485,000 USD Logistics
Education requirements:
At least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy:
Staff are expected to be in one of our offices at least 25% of the time; some roles may require more time in offices. Visa sponsorship:
We sponsor visas where possible and will make reasonable efforts to assist with visa processes if an offer is made. We encourage you to apply even if you do not meet every qualification. We value diverse perspectives and believe AI work benefits from a range of experiences. How we’re different
We pursue high-impact AI research as a cohesive team, value communication, and aim to advance steerable, trustworthy AI. We emphasize collaboration and seek candidates who communicate effectively across teams.
#J-18808-Ljbffr
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for users and society. Our team includes researchers, engineers, policy experts, and business leaders building beneficial AI systems. Anthropic is seeking talented Infrastructure Engineers to join our team and support the development, scaling, and maintenance of our cutting-edge AI systems. You will work on frontier models and contribute to building safe, reliable AI systems that benefit humanity. We have multiple teams hiring; team placement occurs after the interview process based on interests, experience, and organizational needs to match engineers with the most impactful teams. Responsibilities
Lead build out of industry-leading AI clusters (thousands to hundreds of thousands of machines), partnering with cloud service providers on cluster build out and features Consult with stakeholders to understand infrastructure, data and compute needs and identify solutions to support frontier research and product development Set technical strategy and oversee development of high-scale, reliable infrastructure systems Mentor top technical talent Design processes (e.g., postmortem reviews, incident response, on-call rotations) to help the team operate effectively and prevent repeated failures You may be a good fit if you
Have 10+ years of relevant industry experience, with 3+ years leading large-scale, complex projects or teams as an engineer or tech lead Are focused on distributed systems at scale, infrastructure reliability, security, and continuous improvement Have strong proficiency in at least one programming language (e.g., Python, Rust, Go, Java) Have strong problem-solving skills and ability to work independently Have a passion for supporting internal partners like research to understand their needs Have excellent communication skills to build consensus with stakeholders Possess deep knowledge of modern cloud infrastructure including Kubernetes, Infrastructure as Code, AWS, and GCP Strong candidates may have
Security and privacy best-practice expertise Experience with ML infrastructure (GPUs, TPUs, Trainium) and supporting networking infrastructure like NCCL Low-level systems experience (e.g., Linux kernel tuning, eBPF) Technical ability to quickly understand systems design tradeoffs in evolving software *This is a pipeline hiring posting across the infrastructure org. Team-specific postings are also available on our career site. Deadline to apply:
None. Applications are reviewed on a rolling basis. The expected base compensation for this position is below. Our total compensation package includes equity, benefits, and may include incentive compensation. $405,000 - $485,000 USD Logistics
Education requirements:
At least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy:
Staff are expected to be in one of our offices at least 25% of the time; some roles may require more time in offices. Visa sponsorship:
We sponsor visas where possible and will make reasonable efforts to assist with visa processes if an offer is made. We encourage you to apply even if you do not meet every qualification. We value diverse perspectives and believe AI work benefits from a range of experiences. How we’re different
We pursue high-impact AI research as a cohesive team, value communication, and aim to advance steerable, trustworthy AI. We emphasize collaboration and seek candidates who communicate effectively across teams.
#J-18808-Ljbffr