Texas A&M University
Job Title : Senior HPC Engineer
Agency : Texas A&M University
Department : Technology Services - IT Enterprise Operations
Proposed Minimum Salary : Commensurate
Job Location : College Station, Texas
Job Type : Staff
Job Description We are making a bold leap into the future of artificial intelligence with a $45 million investment in an NVIDIA DGX SuperPOD.
This investment underscores our commitment to all Texas A&M System members’ faculty and staff providing cutting-edge research and super computing needs.
As a Senior High Performance Computing Engineer (HPC), you will provide technical expertise and consultation for the design and deployment of HPC systems.
Get in on the ground floor with a team that is shaping the next generation of innovation.
This position is security sensitive requiring U.S. Citizenship.
Opportunities to Contribute
Manage large-scale HPC cluster operations, including OS upgrades, firmware patching, and performance tuning.
Oversee networking, security, and infrastructure for HPC systems.
Lead the development of specialized HPC computing clouds and scalable storage systems.
Collaborate with stakeholders to develop service-based solutions.
Serve as a strategic technical resource across departments.
Lead enterprise-wide HPC projects using established project management protocols.
Mentor junior system administrators and enforce performance standards.
What you need to know Salary: $125-136K; Location: In-person role in College Station, Texas.
Schedule: This role may require working outside of standard office hours, including evenings, weekends, and holidays, to support the demands of technology services and ensure the seamless operation of essential systems.
Citizenship: Must be a United States citizen, permanent resident, or a person granted asylum or refugee status in accordance with 15 CFR, Part 762; 22 CFR §§122.5, 123.22 and 123.26; and 31 CFR § 501.601.
Qualifications
Bachelor’s degree in applicable field or equivalent combination of education and experience.
12 years of related experience.
A well-qualified candidate should possess one or more of the following
Experience with High Performance Computing (HPC) environments.
Advanced Linux system administration skills.
Familiarity with computer networking concepts and protocols.
Experience with container orchestration tools such as Kubernetes.
Knowledge of Run:ai for AI workload management.
Proficiency with Slurm workload manager.
Experience working with NVIDIA DGX systems.
Understanding of virtualization technologies.
Familiarity with Infrastructure as a Service (IaaS) platforms.
Experience with DDN storage solutions.
Knowledge of network-attached storage systems.
What to do Apply! A cover letter and resume will assist in the review of your application materials. You can upload them during the application process at CV/Resume.
Why Texas A&M University? Texas A&M University is committed to enriching the learning and working environment by promoting a culture that respects all perspectives, talents & lived experiences. Embracing varying opinions and perspectives strengthens our core values: Respect, Excellence, Leadership, Loyalty, Integrity, and Selfless Service.
We are a prestigious university with strong traditions, Core Values, and a community of caring and collaboration. Amenities associated with a major university, such as sporting and cultural events, state‑of‑the‑art recreation facilities, the Bush Library and Museum, and much more await you. Experience all that a big city has to offer but with a reasonable cost‑of‑living and no long commutes.
Benefits and Perks
Health, dental, vision, life and long‑term disability insurance with Texas A&M contributing to employee health and basic life premiums.
12‑15 days of annual paid holidays.
Up to eight hours of paid sick leave and at least eight hours of paid vacation each month.
Automatically enrollment in the Teacher Retirement System of Texas.
Health and Wellness: Free exercise programs and release time.
Professional Development: All employees have access to free LinkedIn Learning training, webinars, and limited financial support to attend conferences, workshops, and more.
Educational release time and tuition assistance for completing a degree while a Texas A&M employee.
All positions are security-sensitive. Applicants are subject to a criminal history investigation, and employment is contingent upon the institution’s verification of credentials and/or other information required by the institution’s procedures, including the completion of the criminal history check.
Equal Opportunity/Veterans/Disability Employer.
#J-18808-Ljbffr
Agency : Texas A&M University
Department : Technology Services - IT Enterprise Operations
Proposed Minimum Salary : Commensurate
Job Location : College Station, Texas
Job Type : Staff
Job Description We are making a bold leap into the future of artificial intelligence with a $45 million investment in an NVIDIA DGX SuperPOD.
This investment underscores our commitment to all Texas A&M System members’ faculty and staff providing cutting-edge research and super computing needs.
As a Senior High Performance Computing Engineer (HPC), you will provide technical expertise and consultation for the design and deployment of HPC systems.
Get in on the ground floor with a team that is shaping the next generation of innovation.
This position is security sensitive requiring U.S. Citizenship.
Opportunities to Contribute
Manage large-scale HPC cluster operations, including OS upgrades, firmware patching, and performance tuning.
Oversee networking, security, and infrastructure for HPC systems.
Lead the development of specialized HPC computing clouds and scalable storage systems.
Collaborate with stakeholders to develop service-based solutions.
Serve as a strategic technical resource across departments.
Lead enterprise-wide HPC projects using established project management protocols.
Mentor junior system administrators and enforce performance standards.
What you need to know Salary: $125-136K; Location: In-person role in College Station, Texas.
Schedule: This role may require working outside of standard office hours, including evenings, weekends, and holidays, to support the demands of technology services and ensure the seamless operation of essential systems.
Citizenship: Must be a United States citizen, permanent resident, or a person granted asylum or refugee status in accordance with 15 CFR, Part 762; 22 CFR §§122.5, 123.22 and 123.26; and 31 CFR § 501.601.
Qualifications
Bachelor’s degree in applicable field or equivalent combination of education and experience.
12 years of related experience.
A well-qualified candidate should possess one or more of the following
Experience with High Performance Computing (HPC) environments.
Advanced Linux system administration skills.
Familiarity with computer networking concepts and protocols.
Experience with container orchestration tools such as Kubernetes.
Knowledge of Run:ai for AI workload management.
Proficiency with Slurm workload manager.
Experience working with NVIDIA DGX systems.
Understanding of virtualization technologies.
Familiarity with Infrastructure as a Service (IaaS) platforms.
Experience with DDN storage solutions.
Knowledge of network-attached storage systems.
What to do Apply! A cover letter and resume will assist in the review of your application materials. You can upload them during the application process at CV/Resume.
Why Texas A&M University? Texas A&M University is committed to enriching the learning and working environment by promoting a culture that respects all perspectives, talents & lived experiences. Embracing varying opinions and perspectives strengthens our core values: Respect, Excellence, Leadership, Loyalty, Integrity, and Selfless Service.
We are a prestigious university with strong traditions, Core Values, and a community of caring and collaboration. Amenities associated with a major university, such as sporting and cultural events, state‑of‑the‑art recreation facilities, the Bush Library and Museum, and much more await you. Experience all that a big city has to offer but with a reasonable cost‑of‑living and no long commutes.
Benefits and Perks
Health, dental, vision, life and long‑term disability insurance with Texas A&M contributing to employee health and basic life premiums.
12‑15 days of annual paid holidays.
Up to eight hours of paid sick leave and at least eight hours of paid vacation each month.
Automatically enrollment in the Teacher Retirement System of Texas.
Health and Wellness: Free exercise programs and release time.
Professional Development: All employees have access to free LinkedIn Learning training, webinars, and limited financial support to attend conferences, workshops, and more.
Educational release time and tuition assistance for completing a degree while a Texas A&M employee.
All positions are security-sensitive. Applicants are subject to a criminal history investigation, and employment is contingent upon the institution’s verification of credentials and/or other information required by the institution’s procedures, including the completion of the criminal history check.
Equal Opportunity/Veterans/Disability Employer.
#J-18808-Ljbffr