UnitedHealth Group
Software Engineering Lead-DevOps, Cloud, ADB, Splunk and Grafana
UnitedHealth Group, Indiana, Pennsylvania, us, 15705
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start
Caring. Connecting. Growing together.
Primary Responsibilities
Drive reliability, scalability, and performance across our systems, with a solid focus on leveraging AI and automation
Implement AI/ML models for predictive alerting, anomaly detection, and capacity planning
Integrate AI tools into incident management workflows to reduce MTTR and improve root cause analysis
Drive adoption of AI-powered observability platforms
Design and implement cloud-native solutions using AWS and GCP services
Architect scalable, resilient, and secure infrastructure using Infrastructure as Code (IaC) tools like Terraform or CloudFormation
Collaborate with development, DevOps, and security teams to integrate cloud solutions into CI/CD pipelines
Architectural experience on performing SRE activities on their own
Develop and enforce security policies, standards, and procedures
Monitor cloud environments and optimize performance, cost, and reliability
Triage and RCA of production incidents and management
Provide technical leadership and mentorship to junior engineers
Stay current with cloud trends and recommend best practices and new technologies
Required Qualifications
Undergraduate degree or equivalent experience
8+ years of experience in SRE, DevOps, or infrastructure engineering
2+ years in a leadership role managing SRE or platform teams
4+ years of solid understanding of cloud security (AWS, GCP, Azure), network security, and application security
Experience with monitoring and alerting tools, especially those with AI capabilities
Experience implementing AI/ML models for operational intelligence, observability and automation
Hands-on experience with security tools and platforms
Good experience on infrastructure architecture
Knowledge of AIOps platforms and frameworks
Proven excellent communication and stakeholder management skills
Preferred Qualifications
Knowledge of cloud principles
Knowledge of software security principles
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
#J-18808-Ljbffr
Caring. Connecting. Growing together.
Primary Responsibilities
Drive reliability, scalability, and performance across our systems, with a solid focus on leveraging AI and automation
Implement AI/ML models for predictive alerting, anomaly detection, and capacity planning
Integrate AI tools into incident management workflows to reduce MTTR and improve root cause analysis
Drive adoption of AI-powered observability platforms
Design and implement cloud-native solutions using AWS and GCP services
Architect scalable, resilient, and secure infrastructure using Infrastructure as Code (IaC) tools like Terraform or CloudFormation
Collaborate with development, DevOps, and security teams to integrate cloud solutions into CI/CD pipelines
Architectural experience on performing SRE activities on their own
Develop and enforce security policies, standards, and procedures
Monitor cloud environments and optimize performance, cost, and reliability
Triage and RCA of production incidents and management
Provide technical leadership and mentorship to junior engineers
Stay current with cloud trends and recommend best practices and new technologies
Required Qualifications
Undergraduate degree or equivalent experience
8+ years of experience in SRE, DevOps, or infrastructure engineering
2+ years in a leadership role managing SRE or platform teams
4+ years of solid understanding of cloud security (AWS, GCP, Azure), network security, and application security
Experience with monitoring and alerting tools, especially those with AI capabilities
Experience implementing AI/ML models for operational intelligence, observability and automation
Hands-on experience with security tools and platforms
Good experience on infrastructure architecture
Knowledge of AIOps platforms and frameworks
Proven excellent communication and stakeholder management skills
Preferred Qualifications
Knowledge of cloud principles
Knowledge of software security principles
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
#J-18808-Ljbffr