Logo
Tata Consultancy Services

Site Reliability Engineer

Tata Consultancy Services, Irving, Texas, United States, 75084

Save Job

Overview

Join to apply for the

Site Reliability Engineer

role at

Tata Consultancy Services . Be among the first applicants in Irving, TX. Responsibilities

Ensure high availability, scalability, and reliability of OpenShift clusters across production and non-production environments. Monitor system performance, resource utilization, and proactively address bottlenecks or failures. Implement and maintain observability tools (Prometheus, Grafana, ELK, etc.) for real-time monitoring and alerting. Automation & Infrastructure as Code

Develop and maintain automation scripts using Ansible, Terraform, or Helm for provisioning and managing OpenShift resources. Automate routine operational tasks such as deployments, scaling, patching, and backups. Incident Management & Root Cause Analysis

Respond to incidents, perform impact analysis, and drive resolution within defined SLAs. Conduct post-incident reviews and implement corrective actions to prevent recurrence. Security & Compliance

Enforce security best practices across OpenShift clusters including RBAC, network policies, and vulnerability patching. Ensure compliance with internal and external regulatory requirements (e.g., PCI-DSS, GDPR). CI/CD Pipeline Support

Collaborate with DevOps and application teams to integrate CI/CD pipelines with OpenShift. Support containerized application deployments using Jenkins, GitLab CI, or Tekton. Capacity Planning & Cost Optimization

Analyze usage trends and forecast capacity needs to support business growth. Optimize resource allocation and cloud spend through efficient cluster management. Upgrades & Maintenance

Plan and execute OpenShift platform upgrades, including cluster versioning and operator lifecycle management. Maintain compatibility with underlying infrastructure (VMs, storage, networking). Documentation & Knowledge Sharing

Maintain detailed documentation of architecture, configurations, runbooks, and troubleshooting guides. Mentor junior engineers and contribute to internal knowledge bases. Required Skills & Experience

5+ years of experience in SRE, DevOps, or Cloud Infrastructure roles. 2+ years of hands-on experience with Red Hat OpenShift (v4.x preferred). Strong knowledge of Kubernetes, container orchestration, and Linux systems. Experience with cloud platforms (AWS, Azure, GCP) and hybrid cloud setups. Proficiency in scripting languages (Bash, Python, Go) and automation tools. Familiarity with GitOps practices and tools like ArgoCD or Flux. Preferred Qualifications

Red Hat Certified Specialist in OpenShift Administration. Experience in BFSI or regulated industries with high availability and compliance requirements. Exposure to service mesh (Istio), API gateways, and ingress controllers. Understanding of MongoDB, Oracle, or other enterprise databases in containerized environments. Salary Range: $110,000 to $150,000 per year Qualifications:

BACHELOR OF COMPUTER SCIENCE Seniority level

Mid-Senior level Employment type

Full-time Job function

Engineering and Information Technology Industries

IT Services and IT Consulting Referrals increase your chances of interviewing at Tata Consultancy Services by 2x Get notified about new Site Reliability Engineer jobs in

Irving, TX .

#J-18808-Ljbffr