Logo
Altice USA

Manager Tools & Observability

Altice USA, Bethpage, New York, United States, 11714

Save Job

Manager Tools & Observability

We are seeking an experienced and forward-thinking Manager, Tools & Observability to lead the strategy, implementation, and life cycle management of all observability tools and platforms across Network and IT domains. This role will own and evolve the monitoring and observability ecosystem to ensure high availability, performance, and visibility across our complex telecommunications infrastructure. The ideal candidate will bring deep experience in operations, automation, and observability within the telecommunications or technology sector. Responsibilities

Observability Strategy: Develop and execute a comprehensive observability roadmap covering all technology domains, including Network, IT, and Cloud environments. Tool Ownership & Management: Lead the deployment, configuration, lifecycle management, and integration of observability tools such as: Network Management Systems (NMS) Grafana, Kafka, Flink Alarm Manager, Splunk, AppDynamics Device42 and additional emerging platforms End-to-End Monitoring: Drive real-time visibility, alerting, telemetry, logging, and metrics collection strategies to enable proactive issue identification and resolution. Automation & Integration: Champion automation of observability workflows and integration with ITSM/DevOps pipelines to improve responsiveness and reduce manual intervention. Incident Intelligence: Enable intelligent alerting and anomaly detection to support incident response and problem management processes. Cross-Functional Collaboration: Partner with Engineering, IT, Security, DevOps, and Network Operations teams to ensure observability tools align with business and operational needs. Lifecycle Governance: Maintain versioning, upgrades, scalability plans, and decommissioning strategies for all tools in the observability stack. Team Leadership: Build, mentor, and lead a team of observability engineers and platform specialists, ensuring knowledge growth and operational excellence. Vendor Management: Manage vendor relationships, evaluate new solutions, and drive platform optimization and cost-efficiency. Qualifications

Experience: 10+ years in the Telecommunications or Technology industry with a strong focus on Network and IT Operations, Automation, and Observability. Education: Bachelor's or Master's degree in Computer Science, Engineering, Telecommunications, or a related discipline. Technical Expertise: In-depth knowledge of observability, monitoring, and alerting architectures across hybrid environments (on-prem/cloud) Proven experience with observability platforms including Grafana, Splunk, AppDynamics, Kafka, Flink, and NMS tools Familiarity with scripting, automation, and integration technologies (e.g., Python, REST APIs, Ansible) Understanding of modern infrastructure (e.g., containers, microservices, cloud-native environments) Leadership Skills: Strong project and people management capabilities with a track record of driving operational excellence and tool adoption. Certifications (Preferred): Relevant certifications in observability tools, cloud (AWS, Azure, GCP), ITIL, or DevOps practices.