Tata Consultancy Services
SRE DevOps Engineer
We are seeking experienced DevOps Engineers with strong Site Reliability Engineering (SRE) capabilities who can work independently, think critically, and contribute immediately to our technical operations. This role requires professionals who can troubleshoot complex issues, write code, and collaborate effectively with development teams to solve problems proactively.
Required Technical Skills Minimum Competency Level: E2 (Medium) across ALL listed technologies
Core DevOps & Infrastructure • Azure DevOps (E2 minimum) - Pipeline creation, management, and optimization • CI/CD (E2 minimum) - End-to-end pipeline design and implementation • AWS (E2 minimum) - EC2, S3, Lambda, RDS, CloudWatch, IAM • Docker (E2 minimum) - Container creation, optimization, and troubleshooting • Kubernetes (E2 minimum) - Cluster management, troubleshooting, and optimization
Development & Automation • Core Java (E2 minimum) - Code review, debugging, performance optimization • Python (E2 minimum) - Automation scripting, tool development, API integration • PowerShell (E2 minimum) - Windows automation and system management • Ansible (E2 minimum) - Configuration management and automation
Quality & Security Tools • JFrog Artifactory (E2 minimum) - Artifact management and repository operations • SonarQube (E2 minimum) - Code quality analysis and remediation
Observability & Monitoring • Application Performance Monitoring: Experience with AppDynamics, Grafana, Zabbix • Modern Observability Platforms: Datadog or Dynatrace experience STRONGLY PREFERRED • Log Analysis: ELK Stack, Splunk, or equivalent • Infrastructure Monitoring: Prometheus, CloudWatch, or similar
Essential Professional Competencies: Problem-Solving & Critical Thinking • Root Cause Analysis: Ability to systematically identify and resolve complex technical issues • Incident Response: Experience with incident management, escalation procedures, and post-mortem analysis • Performance Optimization: Proactive identification and resolution of performance bottlenecks • Capacity Planning: Understanding of resource utilization and scaling strategies Development Collaboration • Code Review & Debugging: Ability to review application code and identify issues • Application Troubleshooting: Work directly with developers to resolve application-level problems • Performance Profiling: Use A PM tools to identify and resolve application performance issues • Security Implementation: Implement and maintain security best practices across the pipeline Self-Direction & Initiative • Independent Problem Solving: Ability to research, analyze, and resolve issues with minimal guidance • Proactive Communication: Regularly communicate status, blockers, and recommendations • Continuous Learning: Stay current with technology trends and best practices • Documentation: Create and maintain clear technical documentation
Behavioral Expectations Work Ethic & Resourcefulness • Self-Motivated: Takes ownership of tasks and sees them through to completion • Resourceful: Uses available resources (documentation, community, colleagues) to find solutions • Proactive: Identifies potential issues before they become problems • Quality-Focused: Delivers work that meets high standards without extensive rework Communication & Collaboration • Clear Communication: Articulates technical concepts clearly to both technical and non-technical stakeholders • Question When Needed: Asks clarifying questions to ensure understanding • Knowledge Sharing: Contributes to team knowledge through documentation and mentoring • Cultural Fit: Works well in a collaborative, fast-paced environment Experience Requirements • Minimum 3-5 years in DevOps/SRE roles • Hands-on experience with modern cloud-native architectures • Proven track record of incident resolution and system optimization • Experience working in Agile environments with rapid deployment cycles Success Criteria (90-day evaluation) 1. Technical Proficiency: Demonstrates competency across all required tools 2. Problem Resolution: Successfully resolves incidents and technical issues independently 3. Code Contributions: Makes meaningful contributions to automation and tooling 4. Team Integration: Effectively collaborates with existing team members 5. Process Improvement: Identifies and implements improvements to existing processes Preferred Certifications • AWS Certified Solutions Architect or DevOps Engineer • Microsoft Azure DevOps Engineer Expert • Kubernetes certifications (CKA, CKAD) • Datadog or Dynatrace certifications
Salary Range- $90,000-$100,000 a year
#LI-SP3 #LI-VX1
We are seeking experienced DevOps Engineers with strong Site Reliability Engineering (SRE) capabilities who can work independently, think critically, and contribute immediately to our technical operations. This role requires professionals who can troubleshoot complex issues, write code, and collaborate effectively with development teams to solve problems proactively.
Required Technical Skills Minimum Competency Level: E2 (Medium) across ALL listed technologies
Core DevOps & Infrastructure • Azure DevOps (E2 minimum) - Pipeline creation, management, and optimization • CI/CD (E2 minimum) - End-to-end pipeline design and implementation • AWS (E2 minimum) - EC2, S3, Lambda, RDS, CloudWatch, IAM • Docker (E2 minimum) - Container creation, optimization, and troubleshooting • Kubernetes (E2 minimum) - Cluster management, troubleshooting, and optimization
Development & Automation • Core Java (E2 minimum) - Code review, debugging, performance optimization • Python (E2 minimum) - Automation scripting, tool development, API integration • PowerShell (E2 minimum) - Windows automation and system management • Ansible (E2 minimum) - Configuration management and automation
Quality & Security Tools • JFrog Artifactory (E2 minimum) - Artifact management and repository operations • SonarQube (E2 minimum) - Code quality analysis and remediation
Observability & Monitoring • Application Performance Monitoring: Experience with AppDynamics, Grafana, Zabbix • Modern Observability Platforms: Datadog or Dynatrace experience STRONGLY PREFERRED • Log Analysis: ELK Stack, Splunk, or equivalent • Infrastructure Monitoring: Prometheus, CloudWatch, or similar
Essential Professional Competencies: Problem-Solving & Critical Thinking • Root Cause Analysis: Ability to systematically identify and resolve complex technical issues • Incident Response: Experience with incident management, escalation procedures, and post-mortem analysis • Performance Optimization: Proactive identification and resolution of performance bottlenecks • Capacity Planning: Understanding of resource utilization and scaling strategies Development Collaboration • Code Review & Debugging: Ability to review application code and identify issues • Application Troubleshooting: Work directly with developers to resolve application-level problems • Performance Profiling: Use A PM tools to identify and resolve application performance issues • Security Implementation: Implement and maintain security best practices across the pipeline Self-Direction & Initiative • Independent Problem Solving: Ability to research, analyze, and resolve issues with minimal guidance • Proactive Communication: Regularly communicate status, blockers, and recommendations • Continuous Learning: Stay current with technology trends and best practices • Documentation: Create and maintain clear technical documentation
Behavioral Expectations Work Ethic & Resourcefulness • Self-Motivated: Takes ownership of tasks and sees them through to completion • Resourceful: Uses available resources (documentation, community, colleagues) to find solutions • Proactive: Identifies potential issues before they become problems • Quality-Focused: Delivers work that meets high standards without extensive rework Communication & Collaboration • Clear Communication: Articulates technical concepts clearly to both technical and non-technical stakeholders • Question When Needed: Asks clarifying questions to ensure understanding • Knowledge Sharing: Contributes to team knowledge through documentation and mentoring • Cultural Fit: Works well in a collaborative, fast-paced environment Experience Requirements • Minimum 3-5 years in DevOps/SRE roles • Hands-on experience with modern cloud-native architectures • Proven track record of incident resolution and system optimization • Experience working in Agile environments with rapid deployment cycles Success Criteria (90-day evaluation) 1. Technical Proficiency: Demonstrates competency across all required tools 2. Problem Resolution: Successfully resolves incidents and technical issues independently 3. Code Contributions: Makes meaningful contributions to automation and tooling 4. Team Integration: Effectively collaborates with existing team members 5. Process Improvement: Identifies and implements improvements to existing processes Preferred Certifications • AWS Certified Solutions Architect or DevOps Engineer • Microsoft Azure DevOps Engineer Expert • Kubernetes certifications (CKA, CKAD) • Datadog or Dynatrace certifications
Salary Range- $90,000-$100,000 a year
#LI-SP3 #LI-VX1