Vivun
Lead Observability Engineer (Remote, North America)
This role is part of Vivun, an AI sales platform that builds observable and reliable systems across agentic and SaaS infrastructure. As Lead Observability Engineer you will design and implement end‑to‑end observability patterns that span infrastructure, applications, and agentic workloads.
Own the end‑to‑end observability strategy for Ava, defining standards, tools, and patterns that ensure reliable visibility across infrastructure and agentic components.
Design and implement correlation models that link agent behavior, LLM interactions, and SaaS telemetry into actionable insights.
Unify observability tooling across teams, ensuring metrics, logs, and traces flow into a central platform (e.g., Observe, Datadog, or equivalent).
Collaborate with engineering, QA, and product to embed observability best practices into development workflows, CI/CD, and quality gates.
Establish enablement frameworks—documentation, dashboards, and templates—that make observability self‑serve for all engineering teams.
Partner with teammates to align observability with infrastructure reliability, alerting, and incident response patterns.
Contribute to performance and reliability strategy, helping define how we measure agent quality, responsiveness, and system scalability.
Desired Skills & Experience
6+ years in SRE, DevOps, or Observability Engineering, with at least 2+ years leading or designing observability initiatives.
Deep knowledge of observability tooling (OpenTelemetry, Prometheus, Grafana, Datadog, Honeycomb, Observe, etc.) and distributed tracing practices.
Experience with Agentic / LLM‑based systems, including LangChain, Celery, OpenAI APIs, or similar orchestration frameworks.
Strong understanding of how to instrument, trace, and correlate AI/LLM workflows with infrastructure‑level telemetry.
Proven ability to define cross‑team standards, influence engineering culture, and establish scalable monitoring patterns.
Strong collaboration and communication skills—you enable, not dictate.
Nice to Have
Experience building observability into hybrid SaaS + agent architectures.
Background in data pipelines or analytics observability (tracing data lineage, monitoring model drift).
Familiarity with Python‑ or Node.js‑based observability SDKs.
Prior experience scaling observability in a startup or rapid‑growth environment.
You Are
A believer in Vivun’s core values: Set the Standard. Take Ownership. Stay Curious. Fast & Focused.
Builder at heart: you want to build the observability foundations for a next‑generation agentic platform.
Innovative problem solver: you are eager to tackle cutting‑edge monitoring challenges at the intersection of SaaS and AI.
Collaborative by nature: you thrive in a high‑impact engineering culture that values enablement, empowerment, and shared ownership.
Experienced in high‑growth startup environments and able to move fast, adapt, and thrive in dynamic startup settings where you derive priorities, requirements, and goals from company context.
What You Will Have at Vivun
Competitive salary and full health benefits.
Stock options at a well‑funded, pre‑IPO company on a fast‑growth track.
Flexible work schedule and fully remote work.
Unlimited PTO with two weeks designated as a “quiet period” each year.
An experienced team that will fight beside you in the trenches to accomplish your goals.
Compensation Range: $185,000 – $205,000 per year.
#J-18808-Ljbffr
Own the end‑to‑end observability strategy for Ava, defining standards, tools, and patterns that ensure reliable visibility across infrastructure and agentic components.
Design and implement correlation models that link agent behavior, LLM interactions, and SaaS telemetry into actionable insights.
Unify observability tooling across teams, ensuring metrics, logs, and traces flow into a central platform (e.g., Observe, Datadog, or equivalent).
Collaborate with engineering, QA, and product to embed observability best practices into development workflows, CI/CD, and quality gates.
Establish enablement frameworks—documentation, dashboards, and templates—that make observability self‑serve for all engineering teams.
Partner with teammates to align observability with infrastructure reliability, alerting, and incident response patterns.
Contribute to performance and reliability strategy, helping define how we measure agent quality, responsiveness, and system scalability.
Desired Skills & Experience
6+ years in SRE, DevOps, or Observability Engineering, with at least 2+ years leading or designing observability initiatives.
Deep knowledge of observability tooling (OpenTelemetry, Prometheus, Grafana, Datadog, Honeycomb, Observe, etc.) and distributed tracing practices.
Experience with Agentic / LLM‑based systems, including LangChain, Celery, OpenAI APIs, or similar orchestration frameworks.
Strong understanding of how to instrument, trace, and correlate AI/LLM workflows with infrastructure‑level telemetry.
Proven ability to define cross‑team standards, influence engineering culture, and establish scalable monitoring patterns.
Strong collaboration and communication skills—you enable, not dictate.
Nice to Have
Experience building observability into hybrid SaaS + agent architectures.
Background in data pipelines or analytics observability (tracing data lineage, monitoring model drift).
Familiarity with Python‑ or Node.js‑based observability SDKs.
Prior experience scaling observability in a startup or rapid‑growth environment.
You Are
A believer in Vivun’s core values: Set the Standard. Take Ownership. Stay Curious. Fast & Focused.
Builder at heart: you want to build the observability foundations for a next‑generation agentic platform.
Innovative problem solver: you are eager to tackle cutting‑edge monitoring challenges at the intersection of SaaS and AI.
Collaborative by nature: you thrive in a high‑impact engineering culture that values enablement, empowerment, and shared ownership.
Experienced in high‑growth startup environments and able to move fast, adapt, and thrive in dynamic startup settings where you derive priorities, requirements, and goals from company context.
What You Will Have at Vivun
Competitive salary and full health benefits.
Stock options at a well‑funded, pre‑IPO company on a fast‑growth track.
Flexible work schedule and fully remote work.
Unlimited PTO with two weeks designated as a “quiet period” each year.
An experienced team that will fight beside you in the trenches to accomplish your goals.
Compensation Range: $185,000 – $205,000 per year.
#J-18808-Ljbffr