Coforge
AIOps Enterprise Architect
Location: Windsor, CT
Experience: 12-18 Years
Responsibilities
Responsible for designing and governing the enterprise-wide observability and intelligent operations architecture.
Ensure real-time visibility, predictive analytics, and autonomous remediation capabilities across hybrid and cloud-native environments.
Define and maintain the enterprise observability and AIOps reference architecture.
Develop and enforce architecture principles, standards, and best practices for monitoring, logging, tracing, and event management.
Align observability strategies with enterprise IT and business goals, ensuring scalability, security, and compliance.
Architect and integrate unified observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, ELK, Splunk, Dynatrace, Datadog).
Design telemetry pipelines for metrics, logs, traces, and events across distributed systems.
Implement AIOps capabilities such as anomaly detection, event correlation, root cause analysis, and predictive alerting.
Closely work with DevOps, SRE, Cloud, Security, and Application teams to embed observability into the SDLC and CI/CD pipelines.
Lead cross-functional teams in the selection, integration, and optimization of observability and AIOps tools.
Provide architectural guidance to solution architects and engineering teams.
Drive automation of incident response and operational workflows using AI/ML and rule-based systems.
Define and monitor SLOs, SLIs, and KPIs to ensure system reliability and performance.
Support real-time analytics and business-impact insights through enriched telemetry data.
Qualifications
15+ years of experience in IT architecture, infrastructure, or software engineering.
5+ years of experience in observability, monitoring, or AIOps domains.
Deep expertise in observability tools (e.g., Splunk, Prometheus, Grafana, Dynatrace, ELK, OpenTelemetry).
Strong understanding of cloud platforms (AWS, Azure, GCP) and hybrid cloud architectures.
Proficiency in scripting and automation (Python, Bash, Terraform, Ansible).
Experience with CI/CD pipelines and DevOps/DevSecOps practices.
Familiarity with ITSM tools (e.g., ServiceNow) and CMDB integration.
Experience in Finance industries is a plus.
#J-18808-Ljbffr
Experience: 12-18 Years
Responsibilities
Responsible for designing and governing the enterprise-wide observability and intelligent operations architecture.
Ensure real-time visibility, predictive analytics, and autonomous remediation capabilities across hybrid and cloud-native environments.
Define and maintain the enterprise observability and AIOps reference architecture.
Develop and enforce architecture principles, standards, and best practices for monitoring, logging, tracing, and event management.
Align observability strategies with enterprise IT and business goals, ensuring scalability, security, and compliance.
Architect and integrate unified observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, ELK, Splunk, Dynatrace, Datadog).
Design telemetry pipelines for metrics, logs, traces, and events across distributed systems.
Implement AIOps capabilities such as anomaly detection, event correlation, root cause analysis, and predictive alerting.
Closely work with DevOps, SRE, Cloud, Security, and Application teams to embed observability into the SDLC and CI/CD pipelines.
Lead cross-functional teams in the selection, integration, and optimization of observability and AIOps tools.
Provide architectural guidance to solution architects and engineering teams.
Drive automation of incident response and operational workflows using AI/ML and rule-based systems.
Define and monitor SLOs, SLIs, and KPIs to ensure system reliability and performance.
Support real-time analytics and business-impact insights through enriched telemetry data.
Qualifications
15+ years of experience in IT architecture, infrastructure, or software engineering.
5+ years of experience in observability, monitoring, or AIOps domains.
Deep expertise in observability tools (e.g., Splunk, Prometheus, Grafana, Dynatrace, ELK, OpenTelemetry).
Strong understanding of cloud platforms (AWS, Azure, GCP) and hybrid cloud architectures.
Proficiency in scripting and automation (Python, Bash, Terraform, Ansible).
Experience with CI/CD pipelines and DevOps/DevSecOps practices.
Familiarity with ITSM tools (e.g., ServiceNow) and CMDB integration.
Experience in Finance industries is a plus.
#J-18808-Ljbffr