Logo
Advanced Micro Devices, Inc

Senior Cloud Administrator

Advanced Micro Devices, Inc, San Jose, California, United States, 95199

Save Job

Overview WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

The Role The Cloud Administrator will be responsible for providing technical support to Engineering and Corporate organizations at AMD. This position will be required to support AMD’s global cloud infrastructure in a dynamic, fast-paced environment. Furthermore, this person will be collaborating globally on efforts for various IT activities related to the AMD Engineering AI / GPU - Compute Environment, in accordance with AMD Worldwide IT strategies and objectives.

The Person You're a highly motivated team player with a strong development background, problem solving mentality, excellent communication skills, ability to prioritize tasks along with willingness to learn and adapt. Excellent teamwork skills and capable of working independently.

Responsibilities

Design, develop, deploy, monitor, maintain, and evolve cloud-native resources, tools, services, reusable modules (infrastructure-as-code-practices) and frameworks to secure and automate provisioning of cloud infrastructure that empowers our users across Azure, AWS, GCP.

Provide customers with standards and best practices on how to deploy and consume cloud-based services.

Proactively seek opportunities to improve operational efficiency of teams and usage of cloud services.

Contribute to a strong team-culture and an atmosphere of cross-functional teamwork.

Work with internal customers in managing incident tickets to achieve operational excellence.

Work with global team to provide support and complete IT projects.

Create secure hybrid deployments of virtual machines, and PaaS solutions in Azure, AWS, GCP.

Work with Project teams to understand and accommodate application architecture and the App’s specific requirements for Azure, AWS, and GCP.

Collaborate with other engineers and stakeholders to share knowledge and build expertise for IaaS, PaaS, and SaaS deployment.

Collaborate with onshore and offshore resources.

Implement and automate security controls, governance processes, and compliance validation by closely partnering with the Security Team to incorporate respective requirements and best practices to keep our Cloud Env safe and secure.

Apply experience in migrating on-premises applications and workloads to Azure, AWS, GCP using cloud technologies and provide support.

Drive identity (IAM), access, and configuration management for cloud native tools.

Responsible for the Recovery and Continuity process for cloud environments.

Preferred Experience

Cloud Systems Engineer general experience of various CSPs fundamentals with experiences in Azure, AWS, GCP.

Terraform, YAML, Jenkins, GitHub actions, HashiCorp, CI / CD buildout.

Programming and scripting: Python, Golang, Shell, Java / J2EE, NodeJS, ReactJS, HTML5.

AI/ML: PyTorch, TensorFlow, REST API, GraphQL, Design Patterns, NOSQL, RDBMS, Elasticsearch, Redis Cache.

Able to build and support a full CI / CD pipeline to support consistent code deployment.

Understanding of AI frameworks to model large datasets, build, and test AI software for model performance.

Experience developing and implementing machine learning models and algorithms.

Managing GPU clusters and GPU-based services/tools/software.

Container technologies (GKE, EKS, ECS, Docker, Kubernetes).

Change Management / Release Process.

Strong analytical, problem-solving, and communication skills; Agile / Scrum experience.

Infrastructure automation (Ansible, Terraform, CloudFormation), Deployment Management, and Resource Management.

Designing and implementing solutions to improve efficiency and reduce costs via Kubernetes/containers, virtualization, functions, and automation.

Building and managing complex cloud environments in Azure, AWS, GCP with security measures for encryption, authorization, and protocols.

Monitoring, capacity planning, trend analysis, and automation-driven service improvements.

Collaboration with software development teams to troubleshoot and resolve issues.

Cloud networking (VPCs), Load balancers, WAFs, and CDNs; experience in Hybrid deployments.

Knowledge of HPC environments including cloud providers and partners.

Cloud native monitoring tools; Nagios, ELK stack, Kibana/Prometheus.

Proactive and empathetic mindset; strong stakeholder engagement and ability to juggle multiple projects.

Strong organizational ability.

Academic Credentials Bachelor's degree in Computer Science, Engineering, or a related field.

Location San Jose, CA

Benefits Benefits offered are described: AMD benefits at a glance.

Equal Opportunity AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout the recruitment and selection process.

#J-18808-Ljbffr