Logo
Hewlett Packard Enterprise

Senior Principal Cloud Developer (GEN AI)

Hewlett Packard Enterprise, San Jose, California, United States, 95199

Save Job

Senior Principal Cloud Developer (Gen Ai)

Joining our Hybrid Cloud BU and working as part of our HPE OpsRamp team is a chance to make history and be a driving force in the industry. We are revolutionizing cloud computing by building a large-scale, enterprise-ready platform that powers a hybrid edge-to-cloud world. Our platform enables the world's largest and most diverse enterprise IT teams and managed service providers to control the chaos of modern digital infrastructure, and to deliver quickly, efficiently, and at scale while keeping their data secure and meeting sustainability goals. We do this through hybrid discovery and monitoring, event and incident management, remediation, and automation, powered by AI. We help our enterprise and MSP customers avoid costly outages and performance issues that result in lost revenue and productivity. With over 100,000 dedicated customers and 1 million devices in production, we are committed to accelerating transformation across data, connectivity, cloud, and security, providing essential solutions for businesses of all sizes. Together, we make the impossible possible, and we are confident in our ability to lead the way in shaping the future of cloud computing. What you'll do: We're solving the world's most complex challenges, and our people are at the forefront of progress. We are seeking a highly skilled hands-on senior architect with a passion to innovate and create Agentic AI applications for OpsRamp's observability SaaS product. This individual must demonstrate strong hands-on knowledge of designing and building large scale distributed systems. Understand cloud-native architecture concepts and have knowledge of best practices for high availability, scalability, resilience, performance, and security requirements in the cloud and apply them towards developing next-generation distributed multi-agent GenAI applications. Help transition proof-of-concept implementations into R&D teams to accelerate new product delivery. Creates technical content such as designs, specifications, and initial software implementations. Guides and mentors less-experienced staff members to set an example of software systems design and development innovation and excellence, helping to grow engineers into more senior technical roles. Collect product feedback from field interactions to provide input into Engineering and Product Management to influence product roadmap direction. Maintain a high level of knowledge of OpsRamp SaaS product and product road maps, as well as that of the competition and prospective strategic partners. What you'll need: Must have 10-15 years of experience in developing highly scalable cloud and cloud-native applications using a diverse range of technology stacks, architecture, design, development, and support. Experience developing products with deep Linux Kernel knowledge and hands-on multi-year experience in eBpf technologies. At least one year of recent multi-agent Agentic and RAG GenAI Software Development experience/exposure applied to Networking and/or Observability domains. Deep understanding and experience in developing infrastructure and applications stacks with multiple frameworks. Proven skills and programming experience in Golang, scalable concurrent processing, REST, Data Caching Services, DB schema design and data access technologies. Experience in building, orchestrating, and deploying highly scalable REST based stateless APIs/web services for web applications in Kubernetes environment. Facilitate continuous integration/continuous deployment (CI/CD) by integrating various parts of the development process. Familiarity with developing with AI-assisted code development frameworks such as GitHub Copilot. Lead a team of software developers, nurturing their skills, overseeing their development work, and ensuring adherence to best practices and standards. Collaborate closely with the product team to translate functional requirements into technical solutions. Drive the adoption of cloud-native principles, systems and technologies within the team. Develop comprehensive monitoring solutions to provide full visibility to the different platform components using tools and services that are part of the cloud infrastructure. Monitor and manage performance, cost, security and availability of the applications. Collaborate with other business units to understand and address their needs and translate them into application and operational requirements. Ability to communicate with senior Executives and with customers. Familiarity with code versioning tools - such as Git. Preferred: Expertise in Linux Kernel and eBpf (User and Kernel space) (Any 2 or more domains: Security, Networking, Observability, Auto Instrumentation for APM) Deep knowledge in the Kubernetes ecosystem w.r.t Kubernetes object hierarchy and controller architectures. Strong knowledge of container architectures including Docker, Cri-o, Container etc. Strong knowledge developing Kubernetes applications with Observability principles. Rich experience and/or exposure to highly scalable and compute intensive Kubernetes environments. Deep knowledge of Cloud-based Kubernetes environments (EKS, GKS etc.) Knowledge of the principles of CNI, CSI etc. Familiarity with a few CNIs and CSIs. Understanding of microservice design and architectural patterns Knowledge of hybrid cloud environments Comfortable collaborating with team-mates working from around the globe Working knowledge of developing ReAct systems on top of latest Thinking Models. GenAI applications for Network Observability or any domain is highly desirable. Desired (as many as possible): Prompt engineering, Context Management and Agentic Workflows. Knowledge of Agentic Protocols such as MCP, A2A Knowledge of Kubernetes Orchestration tools such as Rancher Desktop etc. Knowledge of Kubernetes and Application observability. Knowledge of Linux Networking eBPF based observability (sockets, netdevices, namespaces, routing, bridges, iptables, conntrack, networking device drivers, storage etc.), Container Networking technologies (Docker), Linux Storage architecture etc. Knowledge of complex event processing and event-driven architecture At HPE, we're: Ranked 19th on Fortune's 2022 list of 100 Best Companies to Work For. Ranked 7th on Newsweek's list of America's Most Responsible Companies 2022 Named a Best Place to Work for Disability Inclusion for the sixth year in row Named one of the 100 Best Large Workplaces for Millennials in 2021 Recognized as one of the Best Companies for Multicultural Women by Seramount Ready to take the next step? Open up opportunities with HPE.