Covestic
Role Summary
We are seeking a highly skilled and motivated Data Center Operations Technician 3 to join our dynamic infrastructure operations team. In this role, you will be responsible for the diagnosis, troubleshooting, and repair of servers. The ideal candidate will possess a deep understanding of data center environments, repair experience, and a proficiency in Linux-based operating systems. You will play a critical role in ensuring the stability, reliability, and efficiency in data center infrastructure.
Responsibilities & Tasks
Hardware Troubleshooting & Repair:
Diagnose and resolve complex hardware failures on a variety of servers, including motherboards, CPUs, RAM, and storage devices Perform component-level repairs and replacements on servers and other data center hardware Manage and execute the hardware break/fix process, ensuring minimal downtime and adherence to service level agreements (SLAs) Conduct root cause analysis of hardware failures and provide recommendations for preventative measures Linux System Administration:
Utilize Linux command-line interface (CLI) for system monitoring, troubleshooting, and configuration Assist in the deployment and provisioning of new servers running various Linux distributions Troubleshoot boot-level and OS-level issues on Linux servers Work with engineering teams to resolve escalated technical issues related to the interaction between hardware and the Linux OS Data Center Operations:
Manage data center inventory, including spare parts and retired assets Maintain detailed documentation of all hardware repairs, changes, and configurations Participate in on-call rotations to respond to after-hours emergencies (if applicable) Adhere to all safety and security protocols within the data center environment Mentorship & Collaboration:
Provide training and guidance to team members on best practices for hardware repair and troubleshooting Collaborate with network, storage, and other infrastructure teams to resolve complex, cross-functional issues Experience, Skills and Qualifications Required Experience:
Minimum of 3-5 years of experience in a data center environment, with a significant focus on hardware troubleshooting and repair A minimum of 2 years of hands-on experience with Linux operating systems (e.g., RHEL, CentOS, Ubuntu) in a server environment is mandatory Technical Skills:
Expert knowledge of x86 server architecture and components Proficiency in diagnosing and repairing server hardware Strong understanding of network hardware, including switches, routers, and firewalls Solid command of Linux/Unix command-line tools for diagnostics and troubleshooting Familiarity with data center infrastructure management (DCIM) and ticketing systems Experience with structured cabling and fiber optic connectivity Certifications (Preferred but not required)
CompTIA A+ CompTIA Server+ CompTIA Linux+ or LPI certification Vendor-specific hardware certifications Physical Requirements:
Ability to lift and move equipment up to 50 lbs. Ability to work in a temperature-controlled environment with moderate noise levels Must be able to perform physical tasks such as standing, walking, bending, and kneeling for extended periods #J-18808-Ljbffr
Diagnose and resolve complex hardware failures on a variety of servers, including motherboards, CPUs, RAM, and storage devices Perform component-level repairs and replacements on servers and other data center hardware Manage and execute the hardware break/fix process, ensuring minimal downtime and adherence to service level agreements (SLAs) Conduct root cause analysis of hardware failures and provide recommendations for preventative measures Linux System Administration:
Utilize Linux command-line interface (CLI) for system monitoring, troubleshooting, and configuration Assist in the deployment and provisioning of new servers running various Linux distributions Troubleshoot boot-level and OS-level issues on Linux servers Work with engineering teams to resolve escalated technical issues related to the interaction between hardware and the Linux OS Data Center Operations:
Manage data center inventory, including spare parts and retired assets Maintain detailed documentation of all hardware repairs, changes, and configurations Participate in on-call rotations to respond to after-hours emergencies (if applicable) Adhere to all safety and security protocols within the data center environment Mentorship & Collaboration:
Provide training and guidance to team members on best practices for hardware repair and troubleshooting Collaborate with network, storage, and other infrastructure teams to resolve complex, cross-functional issues Experience, Skills and Qualifications Required Experience:
Minimum of 3-5 years of experience in a data center environment, with a significant focus on hardware troubleshooting and repair A minimum of 2 years of hands-on experience with Linux operating systems (e.g., RHEL, CentOS, Ubuntu) in a server environment is mandatory Technical Skills:
Expert knowledge of x86 server architecture and components Proficiency in diagnosing and repairing server hardware Strong understanding of network hardware, including switches, routers, and firewalls Solid command of Linux/Unix command-line tools for diagnostics and troubleshooting Familiarity with data center infrastructure management (DCIM) and ticketing systems Experience with structured cabling and fiber optic connectivity Certifications (Preferred but not required)
CompTIA A+ CompTIA Server+ CompTIA Linux+ or LPI certification Vendor-specific hardware certifications Physical Requirements:
Ability to lift and move equipment up to 50 lbs. Ability to work in a temperature-controlled environment with moderate noise levels Must be able to perform physical tasks such as standing, walking, bending, and kneeling for extended periods #J-18808-Ljbffr