Hartford
Principal Reliability Engineering – IE06JE
The Hartford’s Corporate / HIMCO IT team is seeking an experienced and highly motivated Principal Engineer to drive Reliability Engineering for multiple applications and implement Gen AI and AI platform capabilities. The principal engineer will build, optimize, and maintain cloud automation to enable infrastructure provisioning, application availability, testing, quality, deployment, resiliency, recovery, and efficiency. The role will also ensure IT security and service hardening implementation. Key success measures include service reliability, technical debt reduction, and cost efficiency.
Responsibilities
Set strategy and advance best-in-class standards, tools, and design practices to enable highly available, high-performance customer-facing applications. Lead adoption of metrics for overall application health – availability, performance, monitoring, alerting, quality, currency, and resiliency.
Act as a technical expert for supported applications and infrastructure, requiring depth and breadth of knowledge in technologies, applications, integration, interfaces, and business domain.
Drive development and implementation of Gen AI and AI platform capabilities, evaluating and selecting AI/ML frameworks, platforms, and tools. Leverage cutting-edge technologies and methodologies to optimize business operations, enhance customer experience, and drive competitive advantage.
Develop strategy to ensure effective tooling, alerts, and response mechanisms to identify and address reliability and security risks, leveraging automation to support problem prevention, detection, mitigation, and resolution.
Develop strategy to enhance SDLC velocity by engineering appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability.
Identify, define, and implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines.
Lead migration of applications to open source platforms, PaaS, containers, serverless, event-based designs, and other cloud technology standards for cloud-enablement and platform agility.
Set strategy to drive simplification across the stack, ensuring technical designs can be effectively operated in a cost-efficient manner without adding operational complexity.
Lead inner- and open-sourcing practices to accelerate the development of self-service enterprise capabilities.
Design scalable SDLC environments using COTS, PaaS, SaaS products catering to data, application, and infrastructure-based pipeline needs.
Partner with infrastructure teams on strategy to design and implement intelligent automation and orchestration systems, enhanced monitoring/alerting capabilities, and rapid service restoration processes. Take proactive measures to prevent high-impact incidents.
Qualifications
Bachelor’s Degree in Computer Science or related discipline.
10+ years of experience in IT systems analysis, design, application development, IT Operations, and tech leadership.
5+ years in a Reliability Engineer, Multi Stack Engineer or Data Engineer role with managerial accountabilities.
Proven experience with FinOps, FMOps & LLMOps principles.
System thinking end-to-end with broad understanding of enterprise architectures and complex distributed systems.
2+ years leading AI/ML engineering organizations with expertise in building/manage large-scale AI, data, and analytics platforms.
Experience with solution architecture orientation to enable expedited troubleshooting, issue resolution, root-cause removal in a hybrid cloud environment.
Proven experience with CI/CD methodologies and tools: GitHub, Jenkins, Nexus, Rally, SonarQube, Jira, Azure DevOps, AWE Code Pipeline.
Experience with performance and observability tools: DynaTrace, CloudWatch, CloudTrail, AWS X-Ray, and related.
Hybrid cloud experience across IaaS, PaaS, SaaS.
Experience with IaC tools: Terraform, CloudFormation, etc.
Highly collaborative, partner with peers and stakeholders, passionate about delighting customers.
Strong communicator at all levels in the Enterprise; influence and negotiation skills.
Certifications (choose one or more):
AWS Certified Developer
AWS Certified Solution Architect
AWS Certified DevOps Engineer
Certified Kubernetes Administrator (CKA)
Certified Kubernetes Application Developer (CKAD)
Compensation Annualized base pay range: $151,280 – $226,920. Total compensation may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition.
Hybrid Work Schedule 3 days a week (Tuesday through Thursday) in one of the following offices: Columbus, OH; Chicago, IL; Hartford, CT; Charlotte, NC. Candidates must be authorized to work in the US without company sponsorship. The company will not support STEM OPT I-983 training endorsement.
EEO Statement The Hartford is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. The Hartford is an Equal Opportunity Employer/Sex/Race/Color/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age.
#J-18808-Ljbffr
The Hartford’s Corporate / HIMCO IT team is seeking an experienced and highly motivated Principal Engineer to drive Reliability Engineering for multiple applications and implement Gen AI and AI platform capabilities. The principal engineer will build, optimize, and maintain cloud automation to enable infrastructure provisioning, application availability, testing, quality, deployment, resiliency, recovery, and efficiency. The role will also ensure IT security and service hardening implementation. Key success measures include service reliability, technical debt reduction, and cost efficiency.
Responsibilities
Set strategy and advance best-in-class standards, tools, and design practices to enable highly available, high-performance customer-facing applications. Lead adoption of metrics for overall application health – availability, performance, monitoring, alerting, quality, currency, and resiliency.
Act as a technical expert for supported applications and infrastructure, requiring depth and breadth of knowledge in technologies, applications, integration, interfaces, and business domain.
Drive development and implementation of Gen AI and AI platform capabilities, evaluating and selecting AI/ML frameworks, platforms, and tools. Leverage cutting-edge technologies and methodologies to optimize business operations, enhance customer experience, and drive competitive advantage.
Develop strategy to ensure effective tooling, alerts, and response mechanisms to identify and address reliability and security risks, leveraging automation to support problem prevention, detection, mitigation, and resolution.
Develop strategy to enhance SDLC velocity by engineering appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability.
Identify, define, and implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines.
Lead migration of applications to open source platforms, PaaS, containers, serverless, event-based designs, and other cloud technology standards for cloud-enablement and platform agility.
Set strategy to drive simplification across the stack, ensuring technical designs can be effectively operated in a cost-efficient manner without adding operational complexity.
Lead inner- and open-sourcing practices to accelerate the development of self-service enterprise capabilities.
Design scalable SDLC environments using COTS, PaaS, SaaS products catering to data, application, and infrastructure-based pipeline needs.
Partner with infrastructure teams on strategy to design and implement intelligent automation and orchestration systems, enhanced monitoring/alerting capabilities, and rapid service restoration processes. Take proactive measures to prevent high-impact incidents.
Qualifications
Bachelor’s Degree in Computer Science or related discipline.
10+ years of experience in IT systems analysis, design, application development, IT Operations, and tech leadership.
5+ years in a Reliability Engineer, Multi Stack Engineer or Data Engineer role with managerial accountabilities.
Proven experience with FinOps, FMOps & LLMOps principles.
System thinking end-to-end with broad understanding of enterprise architectures and complex distributed systems.
2+ years leading AI/ML engineering organizations with expertise in building/manage large-scale AI, data, and analytics platforms.
Experience with solution architecture orientation to enable expedited troubleshooting, issue resolution, root-cause removal in a hybrid cloud environment.
Proven experience with CI/CD methodologies and tools: GitHub, Jenkins, Nexus, Rally, SonarQube, Jira, Azure DevOps, AWE Code Pipeline.
Experience with performance and observability tools: DynaTrace, CloudWatch, CloudTrail, AWS X-Ray, and related.
Hybrid cloud experience across IaaS, PaaS, SaaS.
Experience with IaC tools: Terraform, CloudFormation, etc.
Highly collaborative, partner with peers and stakeholders, passionate about delighting customers.
Strong communicator at all levels in the Enterprise; influence and negotiation skills.
Certifications (choose one or more):
AWS Certified Developer
AWS Certified Solution Architect
AWS Certified DevOps Engineer
Certified Kubernetes Administrator (CKA)
Certified Kubernetes Application Developer (CKAD)
Compensation Annualized base pay range: $151,280 – $226,920. Total compensation may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition.
Hybrid Work Schedule 3 days a week (Tuesday through Thursday) in one of the following offices: Columbus, OH; Chicago, IL; Hartford, CT; Charlotte, NC. Candidates must be authorized to work in the US without company sponsorship. The company will not support STEM OPT I-983 training endorsement.
EEO Statement The Hartford is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. The Hartford is an Equal Opportunity Employer/Sex/Race/Color/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age.
#J-18808-Ljbffr