Corvid Technologies LLC
HPC Systems Engineer
Corvid Technologies LLC, Mooresville, North Carolina, United States, 28117
Job Description
Job Description
Corvid Technologies is seeking HPC Systems Engineers with a strong background and enthusiasm for Linux to support our Linux-based High Performance Computer consisting of 80,000+ processor cores. If you enjoy learning, playing with hardware, optimizing performance, efficiency, and spend most of your time on the command line, this is the job for you. Candidates will be responsible for the following: Supporting software installation and configuration of license management servers (e.g., FlexLM or RLM) Implement site-to-site VPNs (e.g., IPSEC tunnels) to customers on customer HPC clusters Troubleshoot slow, hanging, or failing HPC jobs on internal or customer HPC clusters Automate repetitive tasks and implement custom solutions using
scripting/programming
languages such as Bash or Python Provide guidance and support on HPC best practices and solutions for internal and external customers Troubleshoot hardware and software issues on Linux servers Installation of new hardware into existing compute clusters Design, test, and implement an HPC environment consisting of a provisioner (e.g. xcat, warewulf), scheduler (e.g. Slurm, SGE, PBS), RDMA connections (e.g InfiniBand), a subnet manager, and 5+ compute nodes within the first 180 days of employment Obtaining a CompTIA Security+ certification within the first year of employment Requirements : Bachelor's degree in Engineering or related STEM field (master's preferred) Scripting experience Professional/personal
experience using command-line Linux (RHEL derivatives preferred) Experience in one or more engineering computational code OR 2+ years of IT-related experience (e.g., user support, basic networking, Linux server administration, a home Linux environment) Obtain and maintain a U.S. security clearance Preferred Skills: Past experience as an HPC user on a large-scale cluster Past experience managing information systems within a classified environment Experience installing, configuring, and maintaining job management tools (such as SLURM, Moab, TORQUE, PBS, etc.) Experience configuring, installing, and troubleshooting MPI and OpenMP applications Experience with operating system deployment tools (e.g. XCAT, ROCKS) Hands-on experience of at least one distributed file system (Spectrum Scale-GPFS, Lustre, BeeGFS, Gluster, IMRIX, PVFS, etc.) Direct experience working with InfiniBand Experience configuring, installing, tuning, and maintaining scientific software on large-scale systems Experience supporting HPC compilers and libraries Experience with configuration management tools such as Ansible or Puppet Familiarity with authentication and access control systems (ADFS, LDAP, Kerberos) Active U.S. security clearance Current and active CompTIA Security+ certification About Corvid: Corvid Technologies is an engineering firm specializing in high-fidelity, computational modeling and simulation to analyze, design, and manufacture products for aerospace, DoD, and commercial customers. We offer a fast-paced and flexible work environment, where we tackle difficult, cutting-edge technical challenges using state-of-the-art technologies and resources. Why Corvid: We value our employee-owners and in addition to offering challenging work opportunities and competitive pay, Corvid believes in providing a strong benefits package that delivers value to our team members at all stages of their career. Our benefits include: Employee ownership through our generous 401(k) match in Corvid Stock Medical insurance via Blue Cross - PPO and High-Deductible plans (with company HSA contribution) Paid Time Off (PTO) starting at 3 weeks - based on years of industry experience not tenure Career development and continuing education opportunities Company provided life, long-term, and short-term disability insurance Incentive opportunities to reward strong performance and corporate growth Attractive campus facilities including Lake Norman access, kayaks, paddle boards, basketball and pickleball courts, grills, and more Paid gym membership
Job Description
Corvid Technologies is seeking HPC Systems Engineers with a strong background and enthusiasm for Linux to support our Linux-based High Performance Computer consisting of 80,000+ processor cores. If you enjoy learning, playing with hardware, optimizing performance, efficiency, and spend most of your time on the command line, this is the job for you. Candidates will be responsible for the following: Supporting software installation and configuration of license management servers (e.g., FlexLM or RLM) Implement site-to-site VPNs (e.g., IPSEC tunnels) to customers on customer HPC clusters Troubleshoot slow, hanging, or failing HPC jobs on internal or customer HPC clusters Automate repetitive tasks and implement custom solutions using
scripting/programming
languages such as Bash or Python Provide guidance and support on HPC best practices and solutions for internal and external customers Troubleshoot hardware and software issues on Linux servers Installation of new hardware into existing compute clusters Design, test, and implement an HPC environment consisting of a provisioner (e.g. xcat, warewulf), scheduler (e.g. Slurm, SGE, PBS), RDMA connections (e.g InfiniBand), a subnet manager, and 5+ compute nodes within the first 180 days of employment Obtaining a CompTIA Security+ certification within the first year of employment Requirements : Bachelor's degree in Engineering or related STEM field (master's preferred) Scripting experience Professional/personal
experience using command-line Linux (RHEL derivatives preferred) Experience in one or more engineering computational code OR 2+ years of IT-related experience (e.g., user support, basic networking, Linux server administration, a home Linux environment) Obtain and maintain a U.S. security clearance Preferred Skills: Past experience as an HPC user on a large-scale cluster Past experience managing information systems within a classified environment Experience installing, configuring, and maintaining job management tools (such as SLURM, Moab, TORQUE, PBS, etc.) Experience configuring, installing, and troubleshooting MPI and OpenMP applications Experience with operating system deployment tools (e.g. XCAT, ROCKS) Hands-on experience of at least one distributed file system (Spectrum Scale-GPFS, Lustre, BeeGFS, Gluster, IMRIX, PVFS, etc.) Direct experience working with InfiniBand Experience configuring, installing, tuning, and maintaining scientific software on large-scale systems Experience supporting HPC compilers and libraries Experience with configuration management tools such as Ansible or Puppet Familiarity with authentication and access control systems (ADFS, LDAP, Kerberos) Active U.S. security clearance Current and active CompTIA Security+ certification About Corvid: Corvid Technologies is an engineering firm specializing in high-fidelity, computational modeling and simulation to analyze, design, and manufacture products for aerospace, DoD, and commercial customers. We offer a fast-paced and flexible work environment, where we tackle difficult, cutting-edge technical challenges using state-of-the-art technologies and resources. Why Corvid: We value our employee-owners and in addition to offering challenging work opportunities and competitive pay, Corvid believes in providing a strong benefits package that delivers value to our team members at all stages of their career. Our benefits include: Employee ownership through our generous 401(k) match in Corvid Stock Medical insurance via Blue Cross - PPO and High-Deductible plans (with company HSA contribution) Paid Time Off (PTO) starting at 3 weeks - based on years of industry experience not tenure Career development and continuing education opportunities Company provided life, long-term, and short-term disability insurance Incentive opportunities to reward strong performance and corporate growth Attractive campus facilities including Lake Norman access, kayaks, paddle boards, basketball and pickleball courts, grills, and more Paid gym membership