IT Server Operations Manager
The Jupiter Group, Inc - Houston, Texas, United States, 77246
Work at The Jupiter Group, Inc
Overview
- View job
Overview
enterprise server infrastructure reliability , optimizing performance, and ensuring business continuity across both legacy systems and cloud environments. This role is essential for driving operational excellence and technological advancement within the organization's infrastructure landscape. Other responsibilities include, but are not limited to: Manage and lead a high-performing team of server operations specialists through effective mentorship, performance management, and professional development initiatives Establish and oversee comprehensive monitoring solutions across traditional infrastructure and cloud-based environments Ensure operational coverage by implementing comprehensive scheduling for server operations personnel within the Network Operations Center Manage server procurement processes and oversee installation, provisioning, and maintenance activities in both local and remote data center environments As part of the Network Operation Center, collaborate with network operations to define, implement, and continuously enhance monitoring frameworks for both current infrastructure and future environment deployments Define, implement, and maintain service level agreements (SLAs), operational metrics, and executive performance dashboards Spearhead continuous improvement initiatives to enhance operational efficiency and service delivery Facilitate knowledge transfer between legacy monitoring systems and modern infrastructure tooling Partner with IT leadership on infrastructure strategy development and technology roadmap planning Orchestrate incident response protocols and implement robust preventative measures Direct capacity planning efforts and resource optimization across the server infrastructure ecosystem Communicate effectively with technical and non-technical stakeholders, translating complex concepts into business-relevant insights Qualifications Minimum 5 years of progressive experience managing IT support engineers and specialists Minimum 10 years of professional experience in enterprise server operations Proven track record working within a Network Operations Center (NOC) environment, with expertise in server performance monitoring, health assessment, and availability management Demonstrated leadership capabilities with a history of building and developing high-performance technical teams Established record of successful collaboration with security teams on infrastructure protection initiatives Advanced proficiency with enterprise monitoring platforms including Dynatrace, SolarWinds Orion, Prometheus, and Grafana Expertise in server performance management, including proactive monitoring frameworks, capacity planning, and infrastructure optimization Comprehensive knowledge of Windows Server and Linux Server environments Demonstrated experience implementing automated remediation solutions for common server issues Strong background in infrastructure automation and scripting (Ansible, Python, PowerShell, Bash) Experience implementing and managing monitoring solutions for Kubernetes clusters In-depth knowledge of AWS and Azure operations and monitoring toolsets Proven success managing hybrid infrastructure environments Thorough understanding of infrastructure-as-code principles and implementation methodologies Experience developing and implementing cost optimization strategies across cloud environments Knowledge of cloud security best practices and regulatory compliance frameworks Seniority level
Mid-Senior level Employment type
Full-time Job function
Management Industries
Oil and Gas
#J-18808-Ljbffr