TekLeaders, Inc
SRE Lead (Only W2 Resources)
Location: Cincinnati, OH (Onsite)
Contract role.
Role Description As a Site Reliability Engineer – Lead, you will drive the reliability, scalability, and performance of mission‑critical systems and services while leading a team of SREs. This role combines deep technical expertise with leadership, mentoring, and strategic planning. You will set standards for operational excellence, guide incident response, and foster a culture of automation and continuous improvement. Collaboration with engineering, operations, and product teams is essential to align reliability initiatives with business objectives and ensure seamless service delivery.
Required Skills
Proven experience in site reliability, DevOps, or systems engineering, with prior leadership or team lead responsibilities
Strong programming/scripting skills (e.g., Python, Go, Bash, or similar)
Deep expertise in Linux/Unix system administration and networking
Experience architecting and operating cloud platforms (AWS, Azure, Google Cloud Platform)
Proficiency with infrastructure‑as‑code and automation tools (e.g., Terraform, Ansible, CloudFormation)
Advanced knowledge of monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana, ELK, Datadog)
Nice‑to‑have Skills
Reliability Strategy & Architecture: Technical expertise, hands‑on experience, and ability to lead the development team.
Incident & Problem Management: Oversee incident response, root cause analysis, and post‑mortem processes
Contact Raja Borra
5151 Headquarters Dr, Suite # 105, Plano, TX 75024.
#J-18808-Ljbffr
Contract role.
Role Description As a Site Reliability Engineer – Lead, you will drive the reliability, scalability, and performance of mission‑critical systems and services while leading a team of SREs. This role combines deep technical expertise with leadership, mentoring, and strategic planning. You will set standards for operational excellence, guide incident response, and foster a culture of automation and continuous improvement. Collaboration with engineering, operations, and product teams is essential to align reliability initiatives with business objectives and ensure seamless service delivery.
Required Skills
Proven experience in site reliability, DevOps, or systems engineering, with prior leadership or team lead responsibilities
Strong programming/scripting skills (e.g., Python, Go, Bash, or similar)
Deep expertise in Linux/Unix system administration and networking
Experience architecting and operating cloud platforms (AWS, Azure, Google Cloud Platform)
Proficiency with infrastructure‑as‑code and automation tools (e.g., Terraform, Ansible, CloudFormation)
Advanced knowledge of monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana, ELK, Datadog)
Nice‑to‑have Skills
Reliability Strategy & Architecture: Technical expertise, hands‑on experience, and ability to lead the development team.
Incident & Problem Management: Oversee incident response, root cause analysis, and post‑mortem processes
Contact Raja Borra
5151 Headquarters Dr, Suite # 105, Plano, TX 75024.
#J-18808-Ljbffr