Logo
Truist

Lead Infrastructure Engineer - Observability

Truist, Atlanta, Georgia, United States, 30383

Save Job

Overview

Lead Infrastructure Engineer-Observability at Truist. This role focuses on architecting, implementing, and evolving enterprise-grade observability capabilities across the Truist technology landscape, rooted in OpenTelemetry (Otel) and complemented by Prometheus, Grafana, Jaeger, and commercial APM solutions. The position is office-centric four days a week in either the Atlanta or Charlotte office. Responsibilities

Lead the strategy for metrics, traces, and synthetic monitoring to enable end-to-end visibility and accelerated incident response. Design and adopt a modern, scalable observability platform; standardize telemetry pipelines; embed observability into CI/CD workflows. Integrate signal-based insights into reliability, performance, and business outcomes; reduce mean-time-to-detect (MTTD) and accelerate root cause analysis. Collaborate with engineering teams to create a resilient, insight-rich environment that enables confident delivery. Perform problem tracking, diagnosis, root-cause analysis, replication, troubleshooting, and resolution for complex issues; engage in programming and debugging as needed. Respond to issues in a timely manner; analyze trends in technical issues and recommend long-term improvements. Document end-user interactions and steps taken to resolve incidents; communicate status to internal customers. Engage and manage outside vendors when applicable; provide guidance to teammates and potentially supervise a small team. Qualifications

Required Qualifications:

Bachelor’s degree and five years of experience in development or application support, or an equivalent combination of education and work experience. In-depth knowledge of information systems and ability to apply and implement best practices. Understanding of key business processes and competitive IT strategies. Ability to plan and manage projects; solve complex problems using best practices. Ability to provide direction and mentor less experienced teammates; interpret and convey complex information. Preferred Qualifications:

Bachelor’s degree and six years of experience or equivalent. Expertise with OpenTelemetry (Otel), including custom instrumentation, collector configuration, and pipeline design for traces, metrics, and logs. Hands-on experience with observability tooling (Prometheus, Grafana, Jaeger, Loki, Elastic, Splunk, and/or Dynatrace) in enterprise environments. Strong background in distributed systems, cloud-native architectures, and Kubernetes, with ability to identify observability gaps across service meshes, APIs, and event-driven platforms. Proficiency in scripting or development languages (Python, Go, Bash, Java) to automate telemetry integration and create custom exporters. Proven track record of driving enterprise adoption of observability standards across engineering, SRE, and platform teams. Benefits

General Description of Available Benefits for Eligible Employees of Truist Financial Corporation: All regular teammates (not temporary or contingent workers) working 20 hours or more per week are eligible for benefits. Benefits include medical, dental, vision, life insurance, disability, accidental death and dismemberment, tax-preferred savings accounts, and a 401k plan. Paid time off includes at least 10 vacation days and 10 sick days (prorated), plus paid holidays. Details are available on Truist’s Benefits site. Depending on position and division, the role may be eligible for additional plans such as a defined benefit pension, restricted stock units, and/or deferred compensation. Through the hiring process, you’ll learn more about benefits specific to the position and division. Truist is an Equal Opportunity Employer that does not discriminate on the basis of race, gender, color, religion, citizenship or national origin, age, sexual orientation, gender identity, disability, veteran status, or other protected classifications. Truist is a Drug Free Workplace. EEO is the Law; Pay Transparency Nondiscrimination Provision; E-Verify. Other Job Details

Seniority level: Not Applicable Employment type: Full-time Job function: Information Technology Referral notices: Referrals increase your chances of interviewing at Truist.

#J-18808-Ljbffr