Logo
InStride

Principal Site Reliability Engineer (SRE)

InStride, California, Missouri, United States, 65018

Save Job

Principal Site Reliability Engineer (SRE)

Join to apply for the Principal Site Reliability Engineer (SRE) role at InStride. At InStride, people are our purpose. We partner with leading employers to unlock opportunities for employees through education programs that align with personal career goals and company business goals. We empower our partners’ employees to advance their careers and achieve meaningful growth. Candidates must be located in one of the following states to be considered: AZ, CA, CO, CT, FL, GA, IL, IN, KS, LA, MD, MA, MI, MO, NV, NH, NJ, NY, PA, OH, OR, TX, VA, WA, WI.

What we're looking for We’re looking for a Principal Site Reliability Engineer (SRE) to join InStride’s growing engineering team. This is a highly technical role for an individual contributor who thrives at the intersection of cloud architecture, automation, and reliability engineering. You will be the go-to AWS expert for complex initiatives, setting technical direction, and raising the bar for operational excellence across our platform. Every system you design, every automation you implement, and every safeguard you put in place will directly support our mission of expanding access to life-changing education for working adults around the globe.

Cloud Architecture & Strategy: Design and optimize AWS environments that balance scalability, resilience, and cost efficiency for enterprise workloads.

Technical Leadership & Mentorship: Guide engineers on Kubernetes, DevSecOps, and AWS-native design patterns.

Infrastructure as Code Mastery: Build reusable IaC libraries with AWS CDK, Terraform, or CloudFormation.

Security & Compliance by Design: Enforce least-privilege IAM, encryption-by-default, and policy-as-code guardrails.

Observability & Reliability Engineering: Define SLIs/SLOs, manage error budgets, and implement monitoring with Prometheus, Grafana, and AWS tools.

CI/CD Excellence: Optimize pipelines with Harness and GitHub for faster, safer deployments.

Networking & Resilience: Architect secure VPCs, load balancing, and multi-region failover with AWS networking services.

Automation & Self-Service Enablement: Deliver automation and internal developer portal capabilities for self-provisioning infrastructure.

Who you are

10+ years of experience in SRE, DevOps, or Platform Engineering with production AWS workloads.

Hands-on expertise with AWS EKS, Kubernetes networking, Helm, autoscaling (Karpenter/Cluster Autoscaler), serverless architectures, and API Gateways.

Proven delivery of service mesh solutions (Istio, Linkerd, or AWS App Mesh).

Proficiency with IaC using AWS CDK, Terraform, or CloudFormation.

Strong programming and automation skills in Go, Python, or TypeScript; Bash proficiency helpful.

Experience with policy-as-code (OPA/Rego) in CI/CD pipelines.

Solid understanding of SLI/SLO/error-budget methodologies and monitoring (Prometheus, Grafana, CloudWatch).

Knowledge of AWS security best practices (IAM, encryption, OS hardening, compliance).

Excellent communication skills to translate reliability metrics into business impact.

Experience mentoring engineers and influencing enterprise AWS/DevOps strategies.

Familiarity with Internal Developer Portals (Backstage, Port, Cortex) and self-service automation is a plus.

How you will create impact

Elevate platform reliability: design and operate multi-region, fault-tolerant systems for constant availability.

Advance automation at scale: deliver IaC libraries, CI/CD pipelines, and self-service capabilities.

Champion security and compliance: implement defense-in-depth and policy-as-code guardrails.

Drive observability maturity: define and enforce SLIs/SLOs and monitoring frameworks.

Enable seamless service connectivity: manage service meshes across Kubernetes workloads.

Influence technical direction: shape InStride’s AWS strategy for scalability and cost efficiency.

Mentor and uplift engineers: lead design reviews and promote modern DevOps and SRE practices.

Compensation At InStride, final offer amounts depend on location, depth of experience, interview performance, and equity with other team members. Compensation range: $165,000—$185,000 USD. We encourage conversations with your recruiter to learn about total compensation and benefits.

Benefits InStride offers benefits including a 401(k) plan with company match, flexible vacation, paid family leave, best-in-class health coverage, and more. The Step Forward program provides access to 2,800+ online certificates and degrees, with tuition coverage starting Day 1.

Diversity and Inclusion InStride fosters a culture of belonging, authenticity, and inclusion. If you have a disability or special need that requires accommodation, please let your recruiter know.

Policies & Disclosure InStride may require COVID vaccination for office entry or events in the future but does not require it at this time. For questions on how we use personal information of job applicants, refer to InStride's Job Applicant Privacy Policy. Beware of recruiting scams. InStride does not require financial transactions to be eligible for employment. If you receive a message purporting to be from InStride asking for financial information, do not respond and notify recruiting@instride.com.

About InStride InStride is a human capital management company that helps organizations upskill employees and fill critical workforce roles through education programs. We partner with companies to drive meaningful social and business outcomes. Visit instride.com or follow InStride on LinkedIn for more information.

#J-18808-Ljbffr