Hewlett Packard Enterprise
Senior Software Engineer – HPC & AI Advanced Development
Hewlett Packard Enterprise, Ammon, Idaho, United States
Senior Software Engineer – HPC & AI Advanced Development
Apply for the Senior Software Engineer – HPC & AI Advanced Development role at Hewlett Packard Enterprise. This position is remote within the United States and requires valid U.S. work authorization.
Who We Are Hewlett Packard Enterprise is a global edge‑to‑cloud company that helps companies connect, protect, analyze, and act on their data and applications wherever they live. Our culture thrives on new ideas, values diverse backgrounds, and supports flexible work and personal needs.
Job Description HPE is seeking a motivated and skilled Senior Software Engineer to join the Advanced Programming Team within the HPC & AI Advanced Development organization. In this role, you will collaborate to solve challenges in scaling high‑fidelity, discrete‑event simulations on HPE supercomputers, using distributed memory and resilient execution techniques such as checkpointing. You will also develop workflows for distributed, large‑scale data analysis of traces, logs, and telemetry from simulations and HPC systems.
Key Responsibilities
Develop, experiment, and test distributed HPC/AI workflows for interactive processing of large‑scale telemetry datasets (terabytes to petabytes).
Build solutions by composing existing open‑source solutions and using distributed and parallel programming approaches to scale data and simulation size.
Participate actively in a collaborative, consensus‑driven design process.
Work in an Agile development environment.
Create documentation, collaborate with users, and present progress through writing, slides, and verbal communication.
Required Skills And Qualifications
6‑8 years of industry or comparable experience in software engineering.
Proficiency in one or more of C, C++, or Python.
Exposure to high‑performance computing (HPC) or scientific computing.
Experience designing, building, or operating distributed large‑scale systems in production environments.
Experience with software engineering workflows, including version control, code reviews, automated testing, and CI/CD pipelines.
Excellent communication skills for conveying technical concepts through documentation, presentations, and design discussions.
Strong analytical and problem‑solving skills.
Nice to Haves
Experience collaborating with scientists or engineers on data science, data analytics, simulations, or modeling.
Experience with distributed‑memory parallel programming on supercomputers or large‑scale clusters.
Background in digital twin software development and integration with visualization tools and AI/ML workflows.
Experience with containerization and orchestration (Docker, Podman, Apptainer, Slurm, Kubernetes).
Experience developing or supporting workflows for HPC system design and operation.
Experience developing AI surrogates for real‑time HPC system error detection.
Experience incorporating and fine‑tuning large language models for a chat interface.
Knowledge of parallel and discrete‑event simulation, especially with SST.
Familiarity with checkpointing techniques, performance debugging, and optimization at scale.
Familiarity with Pandas, NumPy, Dask, Spark, or other data science technologies.
Familiarity with Developer Operations (AIOps).
Additional Skills Artificial Intelligence Technologies, Cross Domain Knowledge, Data Engineering, Data Science, Design Thinking, Development Fundamentals, Full Stack Development, IT Performance, Machine Learning Operations, Scalability Testing, Security‑First Mindset.
What We Can Offer You Health & Wellbeing We provide a comprehensive suite of benefits that supports physical, financial, and emotional wellbeing for employees and their loved ones.
Personal & Professional Development We invest in your career through programs that help you reach any goal, whether becoming a subject‑matter expert or applying your skills to another division.
Unconditional Inclusion We celebrate diverse backgrounds, provide flexible work, and foster a supportive environment where all employees can thrive.
Let’s Stay Connected Follow @HPECareers on Instagram for updates on people, culture, and tech at HPE.
Job Level TCP_04
Employee Compensation The expected salary for this position varies by location:
California & New York: $155,500 – $315,000
Arizona, Idaho, Texas, Washington: $137,000 – $315,000
Variable incentives may also be offered. EEO Statement HPE is an Equal Employment Opportunity/Veteran/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category. All decisions are made on qualifications, merit, and business need. HPE complies with all applicable laws related to employer use of arrest and conviction records.
No Fees Notice & Recruitment Fraud Disclaimer Hewlett Packard Enterprise will never charge any candidate a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process. Any agency claiming to work with HPE must be verified by candidates; any reliance on fraudulent representations is at the candidate’s own risk.
#J-18808-Ljbffr
Who We Are Hewlett Packard Enterprise is a global edge‑to‑cloud company that helps companies connect, protect, analyze, and act on their data and applications wherever they live. Our culture thrives on new ideas, values diverse backgrounds, and supports flexible work and personal needs.
Job Description HPE is seeking a motivated and skilled Senior Software Engineer to join the Advanced Programming Team within the HPC & AI Advanced Development organization. In this role, you will collaborate to solve challenges in scaling high‑fidelity, discrete‑event simulations on HPE supercomputers, using distributed memory and resilient execution techniques such as checkpointing. You will also develop workflows for distributed, large‑scale data analysis of traces, logs, and telemetry from simulations and HPC systems.
Key Responsibilities
Develop, experiment, and test distributed HPC/AI workflows for interactive processing of large‑scale telemetry datasets (terabytes to petabytes).
Build solutions by composing existing open‑source solutions and using distributed and parallel programming approaches to scale data and simulation size.
Participate actively in a collaborative, consensus‑driven design process.
Work in an Agile development environment.
Create documentation, collaborate with users, and present progress through writing, slides, and verbal communication.
Required Skills And Qualifications
6‑8 years of industry or comparable experience in software engineering.
Proficiency in one or more of C, C++, or Python.
Exposure to high‑performance computing (HPC) or scientific computing.
Experience designing, building, or operating distributed large‑scale systems in production environments.
Experience with software engineering workflows, including version control, code reviews, automated testing, and CI/CD pipelines.
Excellent communication skills for conveying technical concepts through documentation, presentations, and design discussions.
Strong analytical and problem‑solving skills.
Nice to Haves
Experience collaborating with scientists or engineers on data science, data analytics, simulations, or modeling.
Experience with distributed‑memory parallel programming on supercomputers or large‑scale clusters.
Background in digital twin software development and integration with visualization tools and AI/ML workflows.
Experience with containerization and orchestration (Docker, Podman, Apptainer, Slurm, Kubernetes).
Experience developing or supporting workflows for HPC system design and operation.
Experience developing AI surrogates for real‑time HPC system error detection.
Experience incorporating and fine‑tuning large language models for a chat interface.
Knowledge of parallel and discrete‑event simulation, especially with SST.
Familiarity with checkpointing techniques, performance debugging, and optimization at scale.
Familiarity with Pandas, NumPy, Dask, Spark, or other data science technologies.
Familiarity with Developer Operations (AIOps).
Additional Skills Artificial Intelligence Technologies, Cross Domain Knowledge, Data Engineering, Data Science, Design Thinking, Development Fundamentals, Full Stack Development, IT Performance, Machine Learning Operations, Scalability Testing, Security‑First Mindset.
What We Can Offer You Health & Wellbeing We provide a comprehensive suite of benefits that supports physical, financial, and emotional wellbeing for employees and their loved ones.
Personal & Professional Development We invest in your career through programs that help you reach any goal, whether becoming a subject‑matter expert or applying your skills to another division.
Unconditional Inclusion We celebrate diverse backgrounds, provide flexible work, and foster a supportive environment where all employees can thrive.
Let’s Stay Connected Follow @HPECareers on Instagram for updates on people, culture, and tech at HPE.
Job Level TCP_04
Employee Compensation The expected salary for this position varies by location:
California & New York: $155,500 – $315,000
Arizona, Idaho, Texas, Washington: $137,000 – $315,000
Variable incentives may also be offered. EEO Statement HPE is an Equal Employment Opportunity/Veteran/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category. All decisions are made on qualifications, merit, and business need. HPE complies with all applicable laws related to employer use of arrest and conviction records.
No Fees Notice & Recruitment Fraud Disclaimer Hewlett Packard Enterprise will never charge any candidate a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process. Any agency claiming to work with HPE must be verified by candidates; any reliance on fraudulent representations is at the candidate’s own risk.
#J-18808-Ljbffr