Voya Financial
Senior Site Reliability Engineer (SRE)
Voya Financial, Atlanta, Georgia, United States, 30383
Senior Site Reliability Engineer (SRE)
Voya Financial is seeking an experienced Site Reliability Engineer to drive the scalability, reliability, and security of our platforms. The role blends software engineering, infrastructure, and AI systems to support both traditional services and AI‑driven workloads.
Key Responsibilities
Design, build, and maintain scalable infrastructure and automation tools for traditional and AI‑based systems. Develop software solutions to improve system reliability and reduce manual toil. Implement and manage CI/CD pipelines, including model deployment workflows. Monitor system performance, availability, and security using modern observability tools. Collaborate with data science and ML engineering teams to support AI/ML model training, serving, and lifecycle management. Lead incident response, root cause analysis, and post‑mortem processes. Advocate for SRE principles across engineering and AI teams. Qualifications
5+ years of experience in SRE, DevOps, or software engineering roles. Strong programming skills in languages such as Python, Java, etc. Experience supporting AI/ML workloads (e.g., model training, inference, GPU orchestration). Deep understanding of Linux systems, cloud platforms (primarily Azure, AWS), and container orchestration. Experience with infrastructure‑as‑code tools (Terraform, Ansible, GitHub, etc.). Proficiency in monitoring and logging tools (Dynatrace, etc.). Solid grasp of networking, security, and distributed systems. Excellent communication and collaboration skills. Nice to Have
Experience with AI model observability, drift detection, or performance monitoring. Contributions to open‑source SRE, DevOps, or ML infrastructure tools. Certifications in cloud platforms. Benefits
Health, dental, vision, and life insurance plans. 401(k) savings plan with generous company matching contributions (up to 6%). Voya retirement plan – employer‑paid cash balance retirement plan (4%). Tuition reimbursement up to $5,250/year. Paid time off – 20 days paid time off, nine paid company holidays, flexible Diversity Celebration Day. Paid volunteer time – 40 hours per calendar year. Annual base salary range: $94,370 – $117,960 USD (location dependent). Equal Employment Opportunity
Voya Financial is an equal‑opportunity employer. Voya Financial provides equal opportunity to qualified individuals regardless of race, color, sex, national origin, citizenship status, religion, age, disability, veteran status, creed, marital status, sexual orientation, gender identity, genetic information, or any other status protected by state or local law. Reasonable Accommodations
Voya is committed to the inclusion of all qualified individuals. As part of this commitment, Voya will ensure that persons with disabilities are provided reasonable accommodations. If reasonable accommodation is needed to participate in the job application or interview process, to perform essential job functions, and/or to receive other benefits and privileges of employment, please reference resources for applicants with disabilities.
#J-18808-Ljbffr
Design, build, and maintain scalable infrastructure and automation tools for traditional and AI‑based systems. Develop software solutions to improve system reliability and reduce manual toil. Implement and manage CI/CD pipelines, including model deployment workflows. Monitor system performance, availability, and security using modern observability tools. Collaborate with data science and ML engineering teams to support AI/ML model training, serving, and lifecycle management. Lead incident response, root cause analysis, and post‑mortem processes. Advocate for SRE principles across engineering and AI teams. Qualifications
5+ years of experience in SRE, DevOps, or software engineering roles. Strong programming skills in languages such as Python, Java, etc. Experience supporting AI/ML workloads (e.g., model training, inference, GPU orchestration). Deep understanding of Linux systems, cloud platforms (primarily Azure, AWS), and container orchestration. Experience with infrastructure‑as‑code tools (Terraform, Ansible, GitHub, etc.). Proficiency in monitoring and logging tools (Dynatrace, etc.). Solid grasp of networking, security, and distributed systems. Excellent communication and collaboration skills. Nice to Have
Experience with AI model observability, drift detection, or performance monitoring. Contributions to open‑source SRE, DevOps, or ML infrastructure tools. Certifications in cloud platforms. Benefits
Health, dental, vision, and life insurance plans. 401(k) savings plan with generous company matching contributions (up to 6%). Voya retirement plan – employer‑paid cash balance retirement plan (4%). Tuition reimbursement up to $5,250/year. Paid time off – 20 days paid time off, nine paid company holidays, flexible Diversity Celebration Day. Paid volunteer time – 40 hours per calendar year. Annual base salary range: $94,370 – $117,960 USD (location dependent). Equal Employment Opportunity
Voya Financial is an equal‑opportunity employer. Voya Financial provides equal opportunity to qualified individuals regardless of race, color, sex, national origin, citizenship status, religion, age, disability, veteran status, creed, marital status, sexual orientation, gender identity, genetic information, or any other status protected by state or local law. Reasonable Accommodations
Voya is committed to the inclusion of all qualified individuals. As part of this commitment, Voya will ensure that persons with disabilities are provided reasonable accommodations. If reasonable accommodation is needed to participate in the job application or interview process, to perform essential job functions, and/or to receive other benefits and privileges of employment, please reference resources for applicants with disabilities.
#J-18808-Ljbffr