Logo
Infobell IT

HPC System Administrator

Infobell IT, Austin, Texas, us, 78716

Save Job

Payroll and Compliance Manager @ Infobell IT | Enhancing Employee Satisfaction We are seeking an experienced HPC System Administrator to manage, maintain, and optimize HPC infrastructure, ensuring reliability, performance, and security.

Key Responsibilities

Administer HPC systems (installation, configuration, patching, tuning).

Support HPC users (job submission, troubleshooting, training).

Monitor system health, performance, and resource utilization.

Diagnose and resolve hardware/software/network issues.

Ensure compliance with security policies and implement data protection.

Maintain documentation and generate performance reports.

Qualifications

Experience with workload managers (SLURM, PBS, LSF, Torque).

Knowledge of parallel filesystems (Lustre, GPFS) & high-speed interconnects (InfiniBand).

Familiarity with monitoring tools (Nagios, Grafana, Prometheus).

Understanding of HPC security best practices.

Strong problem-solving skills, able to work independently and collaboratively.

#J-18808-Ljbffr