Traversal Inc.
Overview
Traversal is the AI Site Reliability Engineer (SRE) for the enterprise — already trusted by some of the largest companies in the world to troubleshoot, remediate, and prevent the most complex production incidents. Our mission is to free engineers from endless firefighting and enable them to focus on creative, high-impact work. Our roots are embedded in AI research, and we’re building the premier AI agent lab for the enterprise with researchers from MIT, Harvard, Berkeley and engineers from Citadel Securities, Cockroach Labs, Datadog, DE Shaw, Meta, Hebbia, Perplexity, Glean, Pinecone, and more. The Role As an AI Platform Engineer at Traversal, you’ll work on the core foundations that make Traversal’s AI possible, spanning both agent infrastructure and evaluation systems. Responsibilities
Agent Infrastructure — Build the frameworks, orchestration layers, and developer tooling that power Traversal’s AI agents for root cause analysis, alert triage, and “chat with your infrastructure/telemetry.” This involves designing scalable distributed systems and abstractions (e.g., MCP servers, multi-agent orchestration, toolkits) that balance research flexibility with production reliability. Evaluation — Define what “good” looks like for AI performance in the incident management domain. You’ll build live evaluation pipelines, automated scoring systems, and benchmarks; integrate evaluation into the developer lifecycle; and surface these insights to customers as a value-add. This work combines research (agentic architectures, benchmarking, calibration, finetuning) with engineering (production-scale infra, APIs, distributed systems) to accelerate the entire AI loop: build → evaluate → improve → ship. Requirements
Proven production-scale software engineering experience. Experience with LLM-based applications and/or multi-agent systems. Strong data modeling skills and a track record of writing clean, maintainable code. Collaborative, impact-driven mindset and ability to work across research and engineering teams. Nice to Have
Knowledge of software incidents and production SRE workflows. Prior experience with AI benchmarking or evaluation systems. Experience creating quantitative scoring systems or benchmarks in new problem domains. Familiarity with observability stacks (logs, metrics, traces) and telemetry systems. Background in agentic architectures, orchestration frameworks, or applied AI research. Compensation
We offer competitive compensation, startup equity, health insurance, and additional benefits. The U.S. base salary range for this full-time, in-person role in New York is $150,000–$300,000, plus equity and benefits. Salary ranges are based on location, level, and role; individual compensation is determined by experience, skills, and job-related knowledge. Why You Should Join Us
We’ll make sure you’re fully supported with health insurance, a great tech setup, flexible time off, and plenty of in-office snacks. We offer competitive salary and equity packages, and take thoughtful consideration with every hire on our small, high-impact team. Traversal is fully in-office, 5 days a week, based in New York near Madison Square Park. We have a collaborative, hard-working culture and are energized by building the future of AI-powered software maintenance. Location
In-person in New York City office (Chelsea/Flatiron near Madison Square Park). Onsite Monday through Friday. Note
Equal Employment Opportunity: Traversal does not discriminate on the basis of any protected status. This role is based in our NYC office and requires being onsite five days a week.
#J-18808-Ljbffr
Traversal is the AI Site Reliability Engineer (SRE) for the enterprise — already trusted by some of the largest companies in the world to troubleshoot, remediate, and prevent the most complex production incidents. Our mission is to free engineers from endless firefighting and enable them to focus on creative, high-impact work. Our roots are embedded in AI research, and we’re building the premier AI agent lab for the enterprise with researchers from MIT, Harvard, Berkeley and engineers from Citadel Securities, Cockroach Labs, Datadog, DE Shaw, Meta, Hebbia, Perplexity, Glean, Pinecone, and more. The Role As an AI Platform Engineer at Traversal, you’ll work on the core foundations that make Traversal’s AI possible, spanning both agent infrastructure and evaluation systems. Responsibilities
Agent Infrastructure — Build the frameworks, orchestration layers, and developer tooling that power Traversal’s AI agents for root cause analysis, alert triage, and “chat with your infrastructure/telemetry.” This involves designing scalable distributed systems and abstractions (e.g., MCP servers, multi-agent orchestration, toolkits) that balance research flexibility with production reliability. Evaluation — Define what “good” looks like for AI performance in the incident management domain. You’ll build live evaluation pipelines, automated scoring systems, and benchmarks; integrate evaluation into the developer lifecycle; and surface these insights to customers as a value-add. This work combines research (agentic architectures, benchmarking, calibration, finetuning) with engineering (production-scale infra, APIs, distributed systems) to accelerate the entire AI loop: build → evaluate → improve → ship. Requirements
Proven production-scale software engineering experience. Experience with LLM-based applications and/or multi-agent systems. Strong data modeling skills and a track record of writing clean, maintainable code. Collaborative, impact-driven mindset and ability to work across research and engineering teams. Nice to Have
Knowledge of software incidents and production SRE workflows. Prior experience with AI benchmarking or evaluation systems. Experience creating quantitative scoring systems or benchmarks in new problem domains. Familiarity with observability stacks (logs, metrics, traces) and telemetry systems. Background in agentic architectures, orchestration frameworks, or applied AI research. Compensation
We offer competitive compensation, startup equity, health insurance, and additional benefits. The U.S. base salary range for this full-time, in-person role in New York is $150,000–$300,000, plus equity and benefits. Salary ranges are based on location, level, and role; individual compensation is determined by experience, skills, and job-related knowledge. Why You Should Join Us
We’ll make sure you’re fully supported with health insurance, a great tech setup, flexible time off, and plenty of in-office snacks. We offer competitive salary and equity packages, and take thoughtful consideration with every hire on our small, high-impact team. Traversal is fully in-office, 5 days a week, based in New York near Madison Square Park. We have a collaborative, hard-working culture and are energized by building the future of AI-powered software maintenance. Location
In-person in New York City office (Chelsea/Flatiron near Madison Square Park). Onsite Monday through Friday. Note
Equal Employment Opportunity: Traversal does not discriminate on the basis of any protected status. This role is based in our NYC office and requires being onsite five days a week.
#J-18808-Ljbffr