SS&C Technologies
Senior Observability Platform Engineer
SS&C Technologies, Hartford, Connecticut, United States
Overview
Senior Observability Platform Engineer role at SS&C Technologies. The position offers an exciting opportunity for software engineers passionate about open source software, Linux, Kubernetes, and Observability. The monitoring stack will provide comprehensive monitoring across system metrics, database performance, network health, and message queues. It will also oversee applications running on diverse cloud platforms, including Kubernetes and ESXi, as well as on bare-metal servers, virtual machines, and containers in the SS&C Private Cloud. Responsibilities
Design, develop, implement, and maintain the observability stack, including tracing, telemetry, logging, health monitoring, visualization, and dashboards to ensure reliability, performance, and operational efficiency of services. Design and implement a robust observability framework using composable open source solutions such as Prometheus, Alertmanager, OpenTelemetry, Grafana, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar. Develop and maintain health monitoring and alerting systems for compute platforms, databases, network infrastructure, and Kubernetes-based platforms, including GPU-supported environments. Create and manage visualization dashboards to monitor system performance, resource utilization, and operational health. Implement scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve system issues. Conduct performance analysis and optimization to ensure system reliability and efficiency. Stay updated with the latest trends and technologies in observability and performance monitoring. Collaborate with cross-functional teams (Cloud Engineering, Network, and DevOps/Solutions Engineering) to troubleshoot and resolve infrastructure issues. Preferred Qualifications
Proven experience in observability, system and network monitoring, and system performance analysis, particularly in cloud or data center environments. Expertise in implementing and managing observability tools and technologies such as Prometheus, Alertmanager, OpenTelemetry, Grafana, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar solutions. Hands-on experience with Kubernetes. Experience with infrastructure-as-code and configuration management tools such as Consul, GitHub, Salt Stack, Terraform, etc. Proficiency in scripting and automation using Go, Python, Shell. Excellent problem-solving skills and the ability to work independently or as part of a team. Strong communication skills and the ability to work in a fast-paced, dynamic environment. Educational Qualifications
Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field. Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. SS&C offers excellent benefits including health, dental, 401k plan, tuition and professional development reimbursement plan. SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws. Job Details
Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and Information Technology Industries: Software Development
#J-18808-Ljbffr
Senior Observability Platform Engineer role at SS&C Technologies. The position offers an exciting opportunity for software engineers passionate about open source software, Linux, Kubernetes, and Observability. The monitoring stack will provide comprehensive monitoring across system metrics, database performance, network health, and message queues. It will also oversee applications running on diverse cloud platforms, including Kubernetes and ESXi, as well as on bare-metal servers, virtual machines, and containers in the SS&C Private Cloud. Responsibilities
Design, develop, implement, and maintain the observability stack, including tracing, telemetry, logging, health monitoring, visualization, and dashboards to ensure reliability, performance, and operational efficiency of services. Design and implement a robust observability framework using composable open source solutions such as Prometheus, Alertmanager, OpenTelemetry, Grafana, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar. Develop and maintain health monitoring and alerting systems for compute platforms, databases, network infrastructure, and Kubernetes-based platforms, including GPU-supported environments. Create and manage visualization dashboards to monitor system performance, resource utilization, and operational health. Implement scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve system issues. Conduct performance analysis and optimization to ensure system reliability and efficiency. Stay updated with the latest trends and technologies in observability and performance monitoring. Collaborate with cross-functional teams (Cloud Engineering, Network, and DevOps/Solutions Engineering) to troubleshoot and resolve infrastructure issues. Preferred Qualifications
Proven experience in observability, system and network monitoring, and system performance analysis, particularly in cloud or data center environments. Expertise in implementing and managing observability tools and technologies such as Prometheus, Alertmanager, OpenTelemetry, Grafana, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar solutions. Hands-on experience with Kubernetes. Experience with infrastructure-as-code and configuration management tools such as Consul, GitHub, Salt Stack, Terraform, etc. Proficiency in scripting and automation using Go, Python, Shell. Excellent problem-solving skills and the ability to work independently or as part of a team. Strong communication skills and the ability to work in a fast-paced, dynamic environment. Educational Qualifications
Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field. Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. SS&C offers excellent benefits including health, dental, 401k plan, tuition and professional development reimbursement plan. SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws. Job Details
Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and Information Technology Industries: Software Development
#J-18808-Ljbffr