Wellmark Blue Cross and Blue Shield
Platform Engineer - Observability
Wellmark Blue Cross and Blue Shield, Des Moines, Iowa, United States, 50319
Overview
The Observability Platform Engineer is responsible for designing, building, and maintaining observability platform tools and frameworks that enable development and operations teams to monitor and improve the performance, availability, and reliability of systems. The engineer will collaborate with development, site reliability engineering, DevOps, and infrastructure teams to deliver a seamless observability ecosystem. What you will own
Design, build, and maintain observability platforms with reusability across services in mind. Develop scalable, automated pipelines for ingesting, transforming, and visualizing telemetry data. Integrate observability tools (e.g., Dynatrace, Splunk, Prometheus, Grafana, Datadog, New Relic, OpenTelemetry) with existing infrastructure and applications. Enable root cause analysis through correlation of metrics, logs, and traces. Analyze telemetry data to identify performance bottlenecks and optimize resource allocation for improved efficiency. Define SLIs, SLOs, and error budgets with stakeholders for critical services. Improve incident response by enhancing monitoring dashboards, alerts, and automated notifications. Qualifications
Preferred: 3–5 years of experience in Site Reliability Engineering, DevOps, or Observability/Monitoring engineering roles. Proven experience building or administering observability platforms in production environments. Track record of improving system reliability and reducing MTTR. Hands-on experience with one or more observability platforms: Dynatrace, Prometheus, Grafana, OpenTelemetry, Elastic Stack, Splunk, Datadog, New Relic, AppDynamics, Honeycomb. Strong knowledge of observability concepts: metrics, logs, traces, SLOs/SLIs, error budgets. Experience working within an Agile team environment and deploying OpenTelemetry-based observability pipelines. Prior experience in highly regulated environments with compliance observability needs. Contributions to observability open-source projects and familiarity with chaos engineering practices to validate monitoring and resilience. Certifications from AWS, Microsoft Azure, or Google Cloud. Demonstrated experience coaching/mentoring others and providing guidance to strengthen knowledge and skills. Excellent problem-solving, written and verbal communication skills; ability to explain complex topics to engineers and business stakeholders. Proficiency in programming or scripting languages (Python, Go, Java, Bash) for observability automation. Experience with containerization and orchestration platforms (Docker, Kubernetes). Deep knowledge of cloud platforms (AWS, Azure, GCP), observability/monitoring services, operating systems (Windows/Linux), networking, and containerization. Strong understanding of distributed systems, microservices, and cloud-native architectures. Proficiency in CI/CD pipelines and how observability integrates into DevOps workflows. Knowledge of incident management and on-call practices. Experience with supporting observability and monitoring for Artificial Intelligence agents. Required: Bachelor\'s Degree or direct and applicable work experience Minimum 7 years of experience including development, IT infrastructure, architecture design, and operations roles Experience with development technologies such as Angular 2+, NodeJS, TypeScript, C#, .NET, Java, SQL Proven ability to adapt to major changes in work tasks or environment Informal leadership experience, typically gained through leading projects Proven experience with designing technical architecture and keeping abreast of existing and emerging technologies Experience consulting with stakeholders to understand needs and guide action Demonstrated problem solving/troubleshooting skills and ability to communicate clearly to stakeholders Proficiency in designing and implementing observability within complex systems Additional Information
Lead the technical designs for highly integrated complex application platforms to optimize security, information leverage and reuse, integration, performance, and availability, ensuring adherence to architecture standards and SLAs. Consult with Solution Architects and project teams to create and document design deliverables for platform solutions, promoting reuse and improvement initiatives. May oversee planning, development, and estimation of technical solutions; collaborate with other technical teams as needed. Collaborate with Lead Architect and stakeholders to provide direction on process improvements. Ensure architectures align with business needs and provide exceptional customer service and solutions. Develop and apply industry best practices for platform-specific designs; research and recommend emerging technologies. Other duties as assigned. All your information will be kept confidential according to EEO guidelines. This position is an Equal Opportunity Employer. Applicants requiring a reasonable accommodation should contact careers@wellmark.com. Wellmark is not considering applicants who require immigration sponsorship now or in the future. There is no sponsorship for this role at this time.
#J-18808-Ljbffr
The Observability Platform Engineer is responsible for designing, building, and maintaining observability platform tools and frameworks that enable development and operations teams to monitor and improve the performance, availability, and reliability of systems. The engineer will collaborate with development, site reliability engineering, DevOps, and infrastructure teams to deliver a seamless observability ecosystem. What you will own
Design, build, and maintain observability platforms with reusability across services in mind. Develop scalable, automated pipelines for ingesting, transforming, and visualizing telemetry data. Integrate observability tools (e.g., Dynatrace, Splunk, Prometheus, Grafana, Datadog, New Relic, OpenTelemetry) with existing infrastructure and applications. Enable root cause analysis through correlation of metrics, logs, and traces. Analyze telemetry data to identify performance bottlenecks and optimize resource allocation for improved efficiency. Define SLIs, SLOs, and error budgets with stakeholders for critical services. Improve incident response by enhancing monitoring dashboards, alerts, and automated notifications. Qualifications
Preferred: 3–5 years of experience in Site Reliability Engineering, DevOps, or Observability/Monitoring engineering roles. Proven experience building or administering observability platforms in production environments. Track record of improving system reliability and reducing MTTR. Hands-on experience with one or more observability platforms: Dynatrace, Prometheus, Grafana, OpenTelemetry, Elastic Stack, Splunk, Datadog, New Relic, AppDynamics, Honeycomb. Strong knowledge of observability concepts: metrics, logs, traces, SLOs/SLIs, error budgets. Experience working within an Agile team environment and deploying OpenTelemetry-based observability pipelines. Prior experience in highly regulated environments with compliance observability needs. Contributions to observability open-source projects and familiarity with chaos engineering practices to validate monitoring and resilience. Certifications from AWS, Microsoft Azure, or Google Cloud. Demonstrated experience coaching/mentoring others and providing guidance to strengthen knowledge and skills. Excellent problem-solving, written and verbal communication skills; ability to explain complex topics to engineers and business stakeholders. Proficiency in programming or scripting languages (Python, Go, Java, Bash) for observability automation. Experience with containerization and orchestration platforms (Docker, Kubernetes). Deep knowledge of cloud platforms (AWS, Azure, GCP), observability/monitoring services, operating systems (Windows/Linux), networking, and containerization. Strong understanding of distributed systems, microservices, and cloud-native architectures. Proficiency in CI/CD pipelines and how observability integrates into DevOps workflows. Knowledge of incident management and on-call practices. Experience with supporting observability and monitoring for Artificial Intelligence agents. Required: Bachelor\'s Degree or direct and applicable work experience Minimum 7 years of experience including development, IT infrastructure, architecture design, and operations roles Experience with development technologies such as Angular 2+, NodeJS, TypeScript, C#, .NET, Java, SQL Proven ability to adapt to major changes in work tasks or environment Informal leadership experience, typically gained through leading projects Proven experience with designing technical architecture and keeping abreast of existing and emerging technologies Experience consulting with stakeholders to understand needs and guide action Demonstrated problem solving/troubleshooting skills and ability to communicate clearly to stakeholders Proficiency in designing and implementing observability within complex systems Additional Information
Lead the technical designs for highly integrated complex application platforms to optimize security, information leverage and reuse, integration, performance, and availability, ensuring adherence to architecture standards and SLAs. Consult with Solution Architects and project teams to create and document design deliverables for platform solutions, promoting reuse and improvement initiatives. May oversee planning, development, and estimation of technical solutions; collaborate with other technical teams as needed. Collaborate with Lead Architect and stakeholders to provide direction on process improvements. Ensure architectures align with business needs and provide exceptional customer service and solutions. Develop and apply industry best practices for platform-specific designs; research and recommend emerging technologies. Other duties as assigned. All your information will be kept confidential according to EEO guidelines. This position is an Equal Opportunity Employer. Applicants requiring a reasonable accommodation should contact careers@wellmark.com. Wellmark is not considering applicants who require immigration sponsorship now or in the future. There is no sponsorship for this role at this time.
#J-18808-Ljbffr