Software Engineer, Site Reliability Engineering (SRE)
Join to apply for the Software Engineer, Site Reliability Engineering (SRE) role at LMArena
Continue with Google Continue with Google
Software Engineer, Site Reliability Engineering (SRE)
1 week ago Be among the first 25 applicants
Join to apply for the Software Engineer, Site Reliability Engineering (SRE) role at LMArena
About The Company
LMArena is an engineering-first startup redefining how the world evaluates large language models. Created in 2023 by UC Berkeley researchers, our neutral, community-driven benchmarking platform attracts over one million monthly users—pairwise comparing leading models from OpenAI, Google, Anthropic, and more—to deliver real-time insights into the rapidly evolving LLM landscape. LMArena is scaling fast to build the next generation of AI testing infrastructure and set the industry standard for model evaluation.
About The Company
LMArena is an engineering-first startup redefining how the world evaluates large language models. Created in 2023 by UC Berkeley researchers, our neutral, community-driven benchmarking platform attracts over one million monthly users—pairwise comparing leading models from OpenAI, Google, Anthropic, and more—to deliver real-time insights into the rapidly evolving LLM landscape. LMArena is scaling fast to build the next generation of AI testing infrastructure and set the industry standard for model evaluation.
Position Overview
We are seeking an experienced, security‑minded Site Reliability Engineer to own and elevate our infrastructure, processes, and operational security. We are cognizant that much of the security domain sits within the SRE’s world these days and are building our team accordingly. You will:
- Take end‑to‑end ownership of infrastructure operations across Cloudflare, Vercel, and our CI/CD pipelines.
- Embed security best practices into every layer of the stack, ensuring resilience against emerging threats.
- Establish processes and procedures that promote efficient onboarding and ramping up new team members, and mentor incoming and more junior members of the team.
Key Responsibilities
- Infrastructure as Code – Manage Terraform modules and secrets pipelines; champion immutable, auditable infrastructure.
- Cloudflare Operations – Configure, monitor, and harden WAF, DDoS protections, bot management, and CDN caching strategies.
- Vercel & Edge Runtime – Own deployment architecture, performance tuning, and incident response for our Next.js‑based front end and Edge Functions.
- CI/CD & Release Engineering – Design, implement, and maintain secure pipelines (GitHub Actions, Vercel integrations) with automated testing and vulnerability scanning.
- Change Management & Documentation – Establish and enforce a lightweight but disciplined RFC/change‑control process; maintain comprehensive runbooks and architecture diagrams.
- Observability & Incident Response – Expand monitoring, logging, and alerting; lead post‑incident reviews and drive continual improvement.
- Mentorship – Provide day‑to‑day guidance to engineers and junior SREs, fostering a culture of ownership and learning.
- Compliance Support – Partner with ProdSec and GRC teams on SOC2, ISO27001, and customer security questionnaires.
- Manage and maintain internal and external facing infrastructure
- Maintain and configure log aggregation requirements, and the infrastructure used to store them across the business
- 7+years in SRE/DevOps roles for high‑traffic SaaS or consumer web products.
- Proven expertise securing and scaling Cloudflare and Vercel (or comparable CDN/edge and serverless platforms).
- Deep understanding of web application security, networking, TLS, and zero‑trust principles.
- Strong proficiency with infrastructure as code (Terraform, Pulumi, or similar), and serverless build pipelines (GitHub Actions or similar)
- Strong programming abilities (Golang, python, TypeScript) and scripting
- Demonstrated success designing and enforcing change‑management workflows.
- Excellent written communication—able to produce clear runbooks and architecture docs.
- Track record mentoring or leading junior engineers.
- Experience with container orchestration (Kubernetes or Nomad).
- Experience with serverless stacks.
- Certifications such as AWS/GCP Professional, GIAC‑GCSA, CKS, or CISSP.
- Impact – You’ll set the foundation for reliability and security across a rapidly growing AI benchmarking platform.
- Culture – Engineering‑first, documentation‑driven, and community‑obsessed.
- Compensation – Competitive salary, meaningful equity, comprehensive benefits, and professional‑development budget.
Seniority level
Seniority level
Mid-Senior level
Employment type
Employment type
Full-time
Job function
Job function
Engineering and Information TechnologyIndustries
Research Services
Referrals increase your chances of interviewing at LMArena by 2x
Get notified about new Software Engineer jobs in San Francisco Bay Area .
San Francisco, CA $160,000 - $180,000 2 days ago
Software Engineer, AI Intern (Fall 2025)
San Francisco Bay Area $57 - $61 2 weeks ago
San Francisco, CA $130,000 - $238,000 2 weeks ago
San Francisco, CA $40,000 - $70,000 2 weeks ago
San Francisco, CA
$145,000.00
-
$230,000.00
2 weeks ago
Mountain View, CA
$125,400.00
-
$188,100.00
2 weeks ago
Software Engineer, AI Platform - New Grad
San Francisco, CA
$220,000.00
-
$350,000.00
2 weeks ago
San Jose, CA
$130,000.00
-
$180,000.00
2 weeks ago
San Francisco, CA
$130,000.00
-
$140,000.00
2 weeks ago
Software Engineer (L4), Content & Business Products
Software Engineer, Frontend (All Levels)
San Francisco, CA
$150,000.00
-
$220,000.00
2 weeks ago
San Francisco, CA
$150,000.00
-
$230,000.00
2 months ago
Full Stack Software Engineer - Post-training
San Francisco, CA
$130,000.00
-
$240,000.00
2 days ago
San Francisco, CA
$150,000.00
-
$283,000.00
2 weeks ago
(General Hire) Software Engineer Graduate (Advertisement Team) - 2025 Start (BS/MS)
San Jose, CA
$113,500.00
-
$250,000.00
2 weeks ago
San Francisco, CA $105,600 - $198,000 4 days ago
Full Stack Software Engineer (L4), Product Localization Engineering
New Grads 2025 - Software Engineer, Algorithm
San Jose, CA $120,000 - $165,000 9 months ago
Palo Alto, CA $152,400 - $228,700 2 weeks ago
San Francisco, CA $140,000 - $195,000 4 days ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr