Tata Consultancy Services
Cloud and Datacenter Engineer
Tata Consultancy Services, Culver City, California, United States, 90232
Infrastructure, DevOps & Reliability Engineer
Salary Range: $100,000-$140,000 a year
Seniority level Mid-Senior level
Employment type Full-time
Job function Information Technology
Industries IT Services and IT Consulting
Experience 10+ Years in IT Infrastructure and Cloud
Roles & Responsibilities Highly experienced professional with over 10 years in leadership roles, specializing as a Subject Matter Expert (SME) in Cloud and Data Centre technologies. The role is responsible for defining and overseeing cloud infrastructure strategies, managing data center operations, and providing expert technical guidance to ensure optimized, scalable, and secure solutions.
This individual will act as the technical authority, driving innovation, operational excellence, and strategic alignment across cloud and data center initiatives.
Must Have Technical/Functional Skills
Extensive experience with cloud platforms (AWS, Azure, GCP) and hybrid environments.
Deep knowledge of data center technologies, including networking, storage, virtualization, and server infrastructure.
Hands-on experience with automation tools, orchestration, and monitoring solutions.
Strong understanding of security, compliance, and disaster recovery frameworks.
Deep knowledge of cloud platforms (AWS, Azure, GCP) including IaaS, PaaS, and hybrid cloud architectures.
Strong understanding of data center technologies: networking, storage, virtualization, compute, and server infrastructure.
Hands-on experience with cloud migration, integration, and hybrid cloud strategies.
Expertise in disaster recovery, high availability, and business continuity planning.
Proven experience in leading technical teams and mentoring engineers.
Ability to define, implement, and execute infrastructure strategies aligned with business goals.
Capability to influence and guide senior stakeholders on technical decisions.
Experience managing large-scale data center operations and cloud infrastructure.
Knowledge of ITIL, operational best practices, and incident/problem management.
Awareness of compliance, regulatory, and security standards (ISO, SOC, GDPR).
Excellent verbal and written communication skills to interact with technical and non-technical stakeholders.
Strong collaboration skills for working with cross-functional teams including architects, project managers, and vendors.
Windows Server Administration
Installation, configuration, and administration of Windows Server (2016 / 2019 / 2022).
Managing Active Directory, DNS, DHCP, Group Policies, and local security policies.
System performance monitoring, service management, event log analysis, and OS troubleshooting.
Hands-on experience in user and permission management, drive mapping, and file share setup.
Linux Server Administration
Installation, configuration, and patching of Linux servers (RHEL / CentOS / Ubuntu).
Proficiency with Linux commands: top, df, du, free, ps, systemctl, journalctl, etc.
User/group management, file system management, and disk partitioning.
Managing network configurations, service restarts, and OS-level performance optimization.
Perform day-to-day administration, operation, and maintenance of Windows Server (2016/2019/2022)
and Linux (RHEL/CentOS/Ubuntu) platforms
Handle user, group, and permission management, service configuration, and routine health checks
Manage Active Directory, DNS, DHCP, GPOs, file shares, and print servers in Windows environments
For Linux systems, manage system services, process monitoring, log review, and cron jobs.
Implement OS-level security baselines and hardening guidelines.
Plan, schedule, and execute Windows and Linux patching cycles in alignment with change management processes.
Troubleshoot issues related to VM performance, host connectivity, or storage utilization.
Coordinate with storage and network teams for VM migrations, expansions, and backup integrations.
Monitor and manage server backups using tools such as Commvault or Pure
Verify backup completion and perform periodic restore tests to validate recoverability.
Respond to and resolve server-related incidents raised via ITSM tools (ServiceNow, Remedy, etc.).
Experience with Infrastructure-as-Code tools (Terraform, Ansible, CloudFormation).
Scripting skills in Python, PowerShell, or Bash for automating operational tasks.
Exposure to AI/ML workloads in cloud environments.
Familiarity with edge computing or hyperconverged infrastructure solutions.
Experience in managing large-scale infrastructure projects, including budgeting and resource planning.
Knowledge of Agile/DevOps methodologies applied to infrastructure teams.
#J-18808-Ljbffr
Seniority level Mid-Senior level
Employment type Full-time
Job function Information Technology
Industries IT Services and IT Consulting
Experience 10+ Years in IT Infrastructure and Cloud
Roles & Responsibilities Highly experienced professional with over 10 years in leadership roles, specializing as a Subject Matter Expert (SME) in Cloud and Data Centre technologies. The role is responsible for defining and overseeing cloud infrastructure strategies, managing data center operations, and providing expert technical guidance to ensure optimized, scalable, and secure solutions.
This individual will act as the technical authority, driving innovation, operational excellence, and strategic alignment across cloud and data center initiatives.
Must Have Technical/Functional Skills
Extensive experience with cloud platforms (AWS, Azure, GCP) and hybrid environments.
Deep knowledge of data center technologies, including networking, storage, virtualization, and server infrastructure.
Hands-on experience with automation tools, orchestration, and monitoring solutions.
Strong understanding of security, compliance, and disaster recovery frameworks.
Deep knowledge of cloud platforms (AWS, Azure, GCP) including IaaS, PaaS, and hybrid cloud architectures.
Strong understanding of data center technologies: networking, storage, virtualization, compute, and server infrastructure.
Hands-on experience with cloud migration, integration, and hybrid cloud strategies.
Expertise in disaster recovery, high availability, and business continuity planning.
Proven experience in leading technical teams and mentoring engineers.
Ability to define, implement, and execute infrastructure strategies aligned with business goals.
Capability to influence and guide senior stakeholders on technical decisions.
Experience managing large-scale data center operations and cloud infrastructure.
Knowledge of ITIL, operational best practices, and incident/problem management.
Awareness of compliance, regulatory, and security standards (ISO, SOC, GDPR).
Excellent verbal and written communication skills to interact with technical and non-technical stakeholders.
Strong collaboration skills for working with cross-functional teams including architects, project managers, and vendors.
Windows Server Administration
Installation, configuration, and administration of Windows Server (2016 / 2019 / 2022).
Managing Active Directory, DNS, DHCP, Group Policies, and local security policies.
System performance monitoring, service management, event log analysis, and OS troubleshooting.
Hands-on experience in user and permission management, drive mapping, and file share setup.
Linux Server Administration
Installation, configuration, and patching of Linux servers (RHEL / CentOS / Ubuntu).
Proficiency with Linux commands: top, df, du, free, ps, systemctl, journalctl, etc.
User/group management, file system management, and disk partitioning.
Managing network configurations, service restarts, and OS-level performance optimization.
Perform day-to-day administration, operation, and maintenance of Windows Server (2016/2019/2022)
and Linux (RHEL/CentOS/Ubuntu) platforms
Handle user, group, and permission management, service configuration, and routine health checks
Manage Active Directory, DNS, DHCP, GPOs, file shares, and print servers in Windows environments
For Linux systems, manage system services, process monitoring, log review, and cron jobs.
Implement OS-level security baselines and hardening guidelines.
Plan, schedule, and execute Windows and Linux patching cycles in alignment with change management processes.
Troubleshoot issues related to VM performance, host connectivity, or storage utilization.
Coordinate with storage and network teams for VM migrations, expansions, and backup integrations.
Monitor and manage server backups using tools such as Commvault or Pure
Verify backup completion and perform periodic restore tests to validate recoverability.
Respond to and resolve server-related incidents raised via ITSM tools (ServiceNow, Remedy, etc.).
Experience with Infrastructure-as-Code tools (Terraform, Ansible, CloudFormation).
Scripting skills in Python, PowerShell, or Bash for automating operational tasks.
Exposure to AI/ML workloads in cloud environments.
Familiarity with edge computing or hyperconverged infrastructure solutions.
Experience in managing large-scale infrastructure projects, including budgeting and resource planning.
Knowledge of Agile/DevOps methodologies applied to infrastructure teams.
#J-18808-Ljbffr