Logo
SS&C Technologies

Senior Observability Platform Engineer

SS&C Technologies, Albuquerque, New Mexico, United States

Save Job

Overview

Join to apply for the

Senior Observability Platform Engineer

role at

SS&C Technologies . SS&C is a leading financial services and healthcare technology company headquartered in Windsor, Connecticut, with 27,000+ employees in 35 countries. The role focuses on open source software, Linux, Kubernetes, and Observability, with a monitoring stack spanning system metrics, database performance, network health, and message queues. The team will oversee applications across diverse cloud platforms, including Kubernetes and ESXi, as well as on bare-metal servers, virtual machines, and containers in the SS&C Private Cloud. Responsibilities

Design, develop, implement, and maintain a comprehensive observability stack (tracing, telemetry, logging, health monitoring, visualization, dashboards) to ensure reliability, performance, and operational efficiency of services. Design and implement a robust observability framework using composable open source solutions such as Prometheus, Alertmanager, OpenTelemetry, Grafana, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar. Develop and maintain health monitoring and alerting systems for compute platforms, databases, network infrastructure, and Kubernetes-based platforms including GPU-enabled environments. Create and manage visualization dashboards to monitor system performance, resource utilization, and operational health. Implement scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve issues. Collaborate with development and operations teams to integrate observability practices into the development lifecycle. Conduct performance analysis and optimization to improve system reliability and efficiency. Stay updated with the latest trends and technologies in observability and performance monitoring. Collaborate with Cloud Engineering, Network, and DevOps/Solutions Engineering teams to troubleshoot and resolve infrastructure issues. Preferred Qualifications

Experience in observability, system and network monitoring, and performance analysis in cloud or data center environments. Expertise with observability tools and technologies (Prometheus, Alertmanager, OpenTelemetry, Grafana, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar solutions). Hands-on experience with Kubernetes. Experience with infrastructure-as-code and configuration management tools (e.g., Terraform, Consul, GitHub, Salt Stack). Scripting and automation skills in Go, Python, Shell. Strong problem-solving abilities and the capacity to work independently or in a team. Excellent communication skills and ability to thrive in a fast-paced environment. Educational Qualifications

Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field. SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws. Employment details

Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and Information Technology Industries: Software Development

#J-18808-Ljbffr