Apple Inc.
Austin, Texas, United States Software and Services
Description
As an SRE in Apple Ads, you will own the health, performance, and scalability of ad-serving infrastructure and associated platform tooling. Your focus will be on building automation that eliminates manual processes, improves service resilience, and enables teams to move faster with confidence.The person in this role will:* Build and operate distributed systems using AWS managed services such as EKS, MSK, and ElastiCache.* Develop internal tooling and automation frameworks to improve infrastructure reliability, cost-efficiency, and operational visibility.* Collaborate with engineering teams to define infrastructure architecture, troubleshoot complex issues, and drive production excellence.* Design and manage Infrastructure as Code with Terraform, ensuring repeatable, secure, and scalable deployments.* Lead or participate in incident response, postmortems, and continuous improvement cycles to reduce future risk.This is not a DevOps-only or CI/CD-focused role. We are looking for engineers who build platform solutions, not just configure pipelines. Minimum Qualifications
5+ years of experience supporting internet-facing production systems and distributed cloud infrastructure. Strong programming skills in at least one of: Python, Go, or Java. Proven expertise with AWS-managed infrastructure, especially: Hands-on experience with Linux systems and deep knowledge of its internals. Demonstrated experience with Infrastructure as Code, especially Terraform. Strong foundation in SRE concepts: Monitoring, alerting, and observability, incident response and root cause analysis, error budgets, SLAs/SLOs, and system reliability Preferred Qualifications
Built tools or services that automate platform operations, reduce toil, or improve cost efficiency. Experience managing Kubernetes clusters at scale in production environments. Hands-on experience troubleshooting distributed systems under real-world load. Clear communication skills and comfort collaborating across engineering, infrastructure, and product teams. AWS certifications or broad experience across multiple AWS services is a plus. Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .
#J-18808-Ljbffr
As an SRE in Apple Ads, you will own the health, performance, and scalability of ad-serving infrastructure and associated platform tooling. Your focus will be on building automation that eliminates manual processes, improves service resilience, and enables teams to move faster with confidence.The person in this role will:* Build and operate distributed systems using AWS managed services such as EKS, MSK, and ElastiCache.* Develop internal tooling and automation frameworks to improve infrastructure reliability, cost-efficiency, and operational visibility.* Collaborate with engineering teams to define infrastructure architecture, troubleshoot complex issues, and drive production excellence.* Design and manage Infrastructure as Code with Terraform, ensuring repeatable, secure, and scalable deployments.* Lead or participate in incident response, postmortems, and continuous improvement cycles to reduce future risk.This is not a DevOps-only or CI/CD-focused role. We are looking for engineers who build platform solutions, not just configure pipelines. Minimum Qualifications
5+ years of experience supporting internet-facing production systems and distributed cloud infrastructure. Strong programming skills in at least one of: Python, Go, or Java. Proven expertise with AWS-managed infrastructure, especially: Hands-on experience with Linux systems and deep knowledge of its internals. Demonstrated experience with Infrastructure as Code, especially Terraform. Strong foundation in SRE concepts: Monitoring, alerting, and observability, incident response and root cause analysis, error budgets, SLAs/SLOs, and system reliability Preferred Qualifications
Built tools or services that automate platform operations, reduce toil, or improve cost efficiency. Experience managing Kubernetes clusters at scale in production environments. Hands-on experience troubleshooting distributed systems under real-world load. Clear communication skills and comfort collaborating across engineering, infrastructure, and product teams. AWS certifications or broad experience across multiple AWS services is a plus. Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .
#J-18808-Ljbffr