Truist
Overview
Lead Infrastructure Engineer at Truist. The position description describes a lead observability engineer responsible for architecting, implementing, and evolving enterprise-grade observability capabilities across the Truist technology landscape. The role leads the design and adoption of a modern, scalable observability platform rooted in OpenTelemetry (Otel) and enriched by Prometheus, Grafana, Jaeger, and commercial APM solutions. The goal is to drive metrics, traces, and synthetic monitoring to enable end-to-end visibility, faster incident response, and a better developer experience. You will promote proactive, intelligence-driven observability and standardize telemetry pipelines, embed observability into CI/CD workflows, and integrate insights into reliability, performance, and business outcomes, aiming to reduce MTTR and MTTD and empower engineering teams." Responsibilities
Performs problem tracking, diagnosis and root-cause analysis, replication, troubleshooting, and resolution for complex issues. In this capacity, performs programming and debugging activities. Responds to issues in a timely manner by receiving and investigating incidents or service tickets. Analyzes and observes trends with technical issues and develops recommendations for long-term improvements. Documents all relevant end-user interactions and steps taken to resolve incidents. Has occasional contact with end-users. Communicates status of issue resolution to internal customers. May engage and manage outside vendors. Applies in-depth knowledge of application support and an understanding of best practices. Typically leads moderately complex projects and participates in larger, more complex initiatives. Solves complex technical and operational problems. Acts as a resource for teammates with less experience. May have people management responsibilities for a small team. Qualifications
Required Qualifications: Bachelor’s degree and five years of experience in development or application support or an equivalent combination of education and work experience. In-depth knowledge in information systems and ability to identify, apply, and implement best practices. Understanding of key business processes and competitive strategies related to the IT function. Ability to plan and manage projects. Ability to solve complex problems by applying best practices. Ability to provide direction and mentor less experienced teammates. Ability to interpret and convey complex, difficult, or sensitive information. Preferred Qualifications: Bachelor’s degree and six years of experience or an equivalent combination of education and work experience. Expertise with OpenTelemetry (Otel), including custom instrumentation, collector configuration, and pipeline design for traces, metrics, and logs. Hands-on experience with observability tooling, such as Prometheus, Grafana, Jaeger, Loki, Elastic, Splunk, and/or Dynatrace in enterprise-grade environments. Strong background in distributed systems, cloud-native architectures, and Kubernetes, with the ability to identify observability gaps across service meshes, APIs, and event-driven platforms. Proficiency in scripting or development languages (e.g., Python, Go, Bash, or Java) to automate telemetry integration, create custom exporters, and contribute to platform tooling. Proven track record of driving enterprise adoption of observability standards and practices, including influencing telemetry strategies across engineering, SRE, and platform teams. Other Job Requirements / Working Conditions
Sitting:
Frequently (25% - 50% of the time) Lifting:
Up to 25 lbs. Visual / Audio / Speaking:
Able to access and interpret client information received from the computer and able to hear and speak with individuals in person and on the phone. Manual Dexterity / Keyboarding:
Able to work standard office equipment, including PC keyboard and mouse, copy/fax machines, and printers. Availability:
Able to work all hours scheduled, including overtime as directed by manager/supervisor and required by business need. Travel:
Up to 25% Benefits
General description of available benefits for eligible employees of Truist Financial Corporation: All regular teammates (not temporary or contingent workers) working 20 hours or more per week are eligible for benefits. Truist offers medical, dental, vision, life insurance, disability, accidental death and dismemberment, tax-preferred savings accounts, and a 401k plan. Teammates also receive no less than 10 days of vacation (prorated based on date of hire and status) during their first year, along with 10 sick days (also prorated), and paid holidays. For more details, please visit Truist’s Benefits site. Depending on the position and division, this role may be eligible for Truist’s defined benefit pension plan, restricted stock units, and/or a deferred compensation plan. As you advance through the hiring process, you will learn more about the specific benefits available for any non-temporary position based on status, position, and division of work. Equal Employment Opportunity
statement: Truist is an Equal Opportunity Employer that does not discriminate on the basis of race, gender, color, religion, citizenship or national origin, age, sexual orientation, gender identity, disability, veteran status, or other classification protected by law. Truist is a Drug Free Workplace. Notes:
EEO is the Law; Pay Transparency Nondiscrimination Provision; E-Verify.
#J-18808-Ljbffr
Lead Infrastructure Engineer at Truist. The position description describes a lead observability engineer responsible for architecting, implementing, and evolving enterprise-grade observability capabilities across the Truist technology landscape. The role leads the design and adoption of a modern, scalable observability platform rooted in OpenTelemetry (Otel) and enriched by Prometheus, Grafana, Jaeger, and commercial APM solutions. The goal is to drive metrics, traces, and synthetic monitoring to enable end-to-end visibility, faster incident response, and a better developer experience. You will promote proactive, intelligence-driven observability and standardize telemetry pipelines, embed observability into CI/CD workflows, and integrate insights into reliability, performance, and business outcomes, aiming to reduce MTTR and MTTD and empower engineering teams." Responsibilities
Performs problem tracking, diagnosis and root-cause analysis, replication, troubleshooting, and resolution for complex issues. In this capacity, performs programming and debugging activities. Responds to issues in a timely manner by receiving and investigating incidents or service tickets. Analyzes and observes trends with technical issues and develops recommendations for long-term improvements. Documents all relevant end-user interactions and steps taken to resolve incidents. Has occasional contact with end-users. Communicates status of issue resolution to internal customers. May engage and manage outside vendors. Applies in-depth knowledge of application support and an understanding of best practices. Typically leads moderately complex projects and participates in larger, more complex initiatives. Solves complex technical and operational problems. Acts as a resource for teammates with less experience. May have people management responsibilities for a small team. Qualifications
Required Qualifications: Bachelor’s degree and five years of experience in development or application support or an equivalent combination of education and work experience. In-depth knowledge in information systems and ability to identify, apply, and implement best practices. Understanding of key business processes and competitive strategies related to the IT function. Ability to plan and manage projects. Ability to solve complex problems by applying best practices. Ability to provide direction and mentor less experienced teammates. Ability to interpret and convey complex, difficult, or sensitive information. Preferred Qualifications: Bachelor’s degree and six years of experience or an equivalent combination of education and work experience. Expertise with OpenTelemetry (Otel), including custom instrumentation, collector configuration, and pipeline design for traces, metrics, and logs. Hands-on experience with observability tooling, such as Prometheus, Grafana, Jaeger, Loki, Elastic, Splunk, and/or Dynatrace in enterprise-grade environments. Strong background in distributed systems, cloud-native architectures, and Kubernetes, with the ability to identify observability gaps across service meshes, APIs, and event-driven platforms. Proficiency in scripting or development languages (e.g., Python, Go, Bash, or Java) to automate telemetry integration, create custom exporters, and contribute to platform tooling. Proven track record of driving enterprise adoption of observability standards and practices, including influencing telemetry strategies across engineering, SRE, and platform teams. Other Job Requirements / Working Conditions
Sitting:
Frequently (25% - 50% of the time) Lifting:
Up to 25 lbs. Visual / Audio / Speaking:
Able to access and interpret client information received from the computer and able to hear and speak with individuals in person and on the phone. Manual Dexterity / Keyboarding:
Able to work standard office equipment, including PC keyboard and mouse, copy/fax machines, and printers. Availability:
Able to work all hours scheduled, including overtime as directed by manager/supervisor and required by business need. Travel:
Up to 25% Benefits
General description of available benefits for eligible employees of Truist Financial Corporation: All regular teammates (not temporary or contingent workers) working 20 hours or more per week are eligible for benefits. Truist offers medical, dental, vision, life insurance, disability, accidental death and dismemberment, tax-preferred savings accounts, and a 401k plan. Teammates also receive no less than 10 days of vacation (prorated based on date of hire and status) during their first year, along with 10 sick days (also prorated), and paid holidays. For more details, please visit Truist’s Benefits site. Depending on the position and division, this role may be eligible for Truist’s defined benefit pension plan, restricted stock units, and/or a deferred compensation plan. As you advance through the hiring process, you will learn more about the specific benefits available for any non-temporary position based on status, position, and division of work. Equal Employment Opportunity
statement: Truist is an Equal Opportunity Employer that does not discriminate on the basis of race, gender, color, religion, citizenship or national origin, age, sexual orientation, gender identity, disability, veteran status, or other classification protected by law. Truist is a Drug Free Workplace. Notes:
EEO is the Law; Pay Transparency Nondiscrimination Provision; E-Verify.
#J-18808-Ljbffr