EPAM Systems
Senior Cloud Engineer (Gcp) (Los Angeles)
EPAM Systems, Los Angeles, California, United States, 90079
Senior Cloud EngineerWe are seeking a highly skilled Senior Cloud Engineer to join EPAM's team and help optimize and manage cloud infrastructure environments.
This role involves providing technical leadership, designing scalable and secure solutions, and driving strategic cloud initiatives.
The idóneo candidate will have deep expertise in Google Cloud Platform (GCP), advanced automation skills using Infrastructure as Code (IaC), and the ability to mentor team members while leading operational improvements.ResponsibilitiesArchitect, provision, and manage GCP cloud resources, ensuring scalability, security, and best practicesDesign reusable infrastructure solutions with Terraform and implement automation for streamlined resource provisioningOversee GKE node operating system upgrades, container OS version management, and patching processesDrive enhancements in infrastructure performance, automation, and resource optimizationLead 24/7 monitoring efforts to proactively detect and resolve performance and availability issuesDevelop operational procedures and automation to prevent recurring problems while addressing complex incidentsImplement and validate backup schedules, retention policies, and restoration processesEnsure compliance with backup governance standards and troubleshoot backup-related failuresEnforce cloud governance policies, including tagging frameworks, access management, and utilization best practicesBuild automated workflows to improve governance and simplify infrastructure managementProvide architectural guidance and recommendations to enhance application resiliency and infrastructure performanceCollaborate with customer teams to address cloud infrastructure challenges and deliver scalable solutionsManage escalations to GCP and oversee the resolution of critical technical issuesAnalyze infrastructure utilization, incident patterns, and service requests to drive cost optimization and efficiency initiativesSupport strategic cloud initiatives, including onboarding processes and alignment with customer change management requirementsRequirementsAt least 3 years of experience in cloud engineering or related rolesExtensive expertise in Google Cloud Platform (GCP), including provisioning, automation, and infrastructure managementAdvanced knowledge of Infrastructure as Code (IaC) tools like Terraform and experience creating reusable automation solutionsStrong proficiency in cloud monitoring, incident management, and performance optimization techniquesIn-depth understanding of governance processes, including access controls, tagging frameworks, compliance standards, and backup/recovery strategiesExperience managing containerized environments such as GKE and Docker, including patching and maintenanceProven leadership skills with the ability to mentor junior engineers and facilitate cross-team collaborationFamiliarity with cost optimization principles for cloud environments, including analyzing utilization trends and driving efficiencyFluent English skills, both written and spoken, at a B2+ level or higherBenefitsInternational projects with top brandsWork with global teams of highly skilled, diverse peersEmployee financial programsPaid time off and sick leaveUpskilling, reskilling and certification coursesUnlimited access to the LinkedIn Learning library and 22,000+ coursesGlobal career opportunitiesVolunteer and community involvement opportunitiesEPAM Employee GroupsAward-winning culture recognized by Glassdoor, Newsweek and LinkedIn
#J-*****-Ljbffr #J-18808-Ljbffr
This role involves providing technical leadership, designing scalable and secure solutions, and driving strategic cloud initiatives.
The idóneo candidate will have deep expertise in Google Cloud Platform (GCP), advanced automation skills using Infrastructure as Code (IaC), and the ability to mentor team members while leading operational improvements.ResponsibilitiesArchitect, provision, and manage GCP cloud resources, ensuring scalability, security, and best practicesDesign reusable infrastructure solutions with Terraform and implement automation for streamlined resource provisioningOversee GKE node operating system upgrades, container OS version management, and patching processesDrive enhancements in infrastructure performance, automation, and resource optimizationLead 24/7 monitoring efforts to proactively detect and resolve performance and availability issuesDevelop operational procedures and automation to prevent recurring problems while addressing complex incidentsImplement and validate backup schedules, retention policies, and restoration processesEnsure compliance with backup governance standards and troubleshoot backup-related failuresEnforce cloud governance policies, including tagging frameworks, access management, and utilization best practicesBuild automated workflows to improve governance and simplify infrastructure managementProvide architectural guidance and recommendations to enhance application resiliency and infrastructure performanceCollaborate with customer teams to address cloud infrastructure challenges and deliver scalable solutionsManage escalations to GCP and oversee the resolution of critical technical issuesAnalyze infrastructure utilization, incident patterns, and service requests to drive cost optimization and efficiency initiativesSupport strategic cloud initiatives, including onboarding processes and alignment with customer change management requirementsRequirementsAt least 3 years of experience in cloud engineering or related rolesExtensive expertise in Google Cloud Platform (GCP), including provisioning, automation, and infrastructure managementAdvanced knowledge of Infrastructure as Code (IaC) tools like Terraform and experience creating reusable automation solutionsStrong proficiency in cloud monitoring, incident management, and performance optimization techniquesIn-depth understanding of governance processes, including access controls, tagging frameworks, compliance standards, and backup/recovery strategiesExperience managing containerized environments such as GKE and Docker, including patching and maintenanceProven leadership skills with the ability to mentor junior engineers and facilitate cross-team collaborationFamiliarity with cost optimization principles for cloud environments, including analyzing utilization trends and driving efficiencyFluent English skills, both written and spoken, at a B2+ level or higherBenefitsInternational projects with top brandsWork with global teams of highly skilled, diverse peersEmployee financial programsPaid time off and sick leaveUpskilling, reskilling and certification coursesUnlimited access to the LinkedIn Learning library and 22,000+ coursesGlobal career opportunitiesVolunteer and community involvement opportunitiesEPAM Employee GroupsAward-winning culture recognized by Glassdoor, Newsweek and LinkedIn
#J-*****-Ljbffr #J-18808-Ljbffr