Veza
Staff/Principal Site Reliability Engineer
We are seeking an exceptional Staff/Principal Site Reliability Engineer to lead critical infrastructure initiatives and drive Innovation across our organization. You’ll architect scalable solutions, navigate complex technical challenges independently, and deliver results under tight deadlines in a fast paced environment. You will work cross‑functionally alongside builders who have helped shape the success of companies such all ways as Google, Okta, AWS, and Snowflake.
Strategic Leadership & Technical Execution
Lead enterprise‑wide reliability and infrastructure projects across multiple teams with high autonomy
Navigate ambiguous problem spaces and deliver innovative solutions under tight deadlines
Architect and deploy solutions for Cloud Prem and SaaS customers at scale
Drive technical innovation and establish SRE best practices across the organization
Respond to critical incidents, lead root cause analysis, and implement long‑term resolutions
Develop automation solutions to streamline operations and reduce manual workload
Participate in on‑call rotation and ensure effective incident handoff and documentation
Cross‑Functional Collaboration & Communication
Partner with Engineering, Product, and Customer Success teams to align reliability goals with business objectives
Communicate complex technical concepts effectively to technical and non‑technical audiences, including executives
Influence technical decisions across teams through thought leadership and demonstrated expertise
Build consensus and Drive adoption of new tools, processes, and architectural patterns
Customer‑Facing Technical Leadership
Provide tier 2/3 technical support to enterprise customers for complex troubleshooting
Work directly with customer technical teams to resolve deployment, configuration, and integration challenges
Conduct technical onboarding and provide expert guidance on platform architecture and best practices
Create customer‑facing documentation, troubleshooting guides, and run‑books
Lead customer calls and technical discussions as a trusted advisor
Team Development
Mentor SRE and engineering team members, elevating technical capabilities
Foster a culture of reliability, operational excellence, and continuous improvement
You have: Required Experience
BS degree in Computer Science or related field (or equivalent practical experience)
7+ years in Site Reliability Engineering, DevOps, or Infrastructure Engineering
Proven track record leading large‑scale, cross‑team infrastructure projects from conception to production
Demonstrated ability to work autonomously on ambiguous projects with tight deadlines
Technical Expertise
5+ years with AWS (VPC, EC2, RDS, EKS, CloudFormation) and cloud automation
Expert‑level experience with Kubernetes, Helm, Linux, and Terraform
Strong experience with GitOps model, distributed version control, and CI/CD pipelines
Proficiency with monitoring tools (Prometheus, Grafana, DataDog)
Strong programming/scripting skills (Python, Go, Bash) for automation
Deep understanding of distributed systems, microservices, and reliability patterns
Experience with Bazel and CueLang a plus
Leadership & Communication
Exceptional ability to articulate complex technical concepts to diverse audiences
Track record of Driving technical change across organizational boundaries
Successfully Delivered multiple complex projects under tight deadlines
Strong customer service orientation with patience and empathy
Work Style
Thrives in ambiguous environments and makes progress without perfect information
Hands‑on, "can do" attitude with bias for action
Low ego and high intellectual curiosity
Comfortable working across time zones
Self‑motivated with strong ownership mentality
Compensation Disclosure $184,000—$240,000 USD
Compensation depends on skills, qualifications, experience, and work location. Variable compensation such as commission is not included.
Our Culture
Ownership Mindset
Act with Integrity
Guardians of our Customers
Opinionated Humility
Build Trust, Earn Trust
Veza is proud to be an equal opportunity employer. We are committed to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other applicable legally protected characteristics. We also consider qualified applicants according to applicable federal, state, and local laws. If a candidate with a disability requires an accommodation during the recruitment process, please email recruiting@veza.com.
#J-18808-Ljbffr
Strategic Leadership & Technical Execution
Lead enterprise‑wide reliability and infrastructure projects across multiple teams with high autonomy
Navigate ambiguous problem spaces and deliver innovative solutions under tight deadlines
Architect and deploy solutions for Cloud Prem and SaaS customers at scale
Drive technical innovation and establish SRE best practices across the organization
Respond to critical incidents, lead root cause analysis, and implement long‑term resolutions
Develop automation solutions to streamline operations and reduce manual workload
Participate in on‑call rotation and ensure effective incident handoff and documentation
Cross‑Functional Collaboration & Communication
Partner with Engineering, Product, and Customer Success teams to align reliability goals with business objectives
Communicate complex technical concepts effectively to technical and non‑technical audiences, including executives
Influence technical decisions across teams through thought leadership and demonstrated expertise
Build consensus and Drive adoption of new tools, processes, and architectural patterns
Customer‑Facing Technical Leadership
Provide tier 2/3 technical support to enterprise customers for complex troubleshooting
Work directly with customer technical teams to resolve deployment, configuration, and integration challenges
Conduct technical onboarding and provide expert guidance on platform architecture and best practices
Create customer‑facing documentation, troubleshooting guides, and run‑books
Lead customer calls and technical discussions as a trusted advisor
Team Development
Mentor SRE and engineering team members, elevating technical capabilities
Foster a culture of reliability, operational excellence, and continuous improvement
You have: Required Experience
BS degree in Computer Science or related field (or equivalent practical experience)
7+ years in Site Reliability Engineering, DevOps, or Infrastructure Engineering
Proven track record leading large‑scale, cross‑team infrastructure projects from conception to production
Demonstrated ability to work autonomously on ambiguous projects with tight deadlines
Technical Expertise
5+ years with AWS (VPC, EC2, RDS, EKS, CloudFormation) and cloud automation
Expert‑level experience with Kubernetes, Helm, Linux, and Terraform
Strong experience with GitOps model, distributed version control, and CI/CD pipelines
Proficiency with monitoring tools (Prometheus, Grafana, DataDog)
Strong programming/scripting skills (Python, Go, Bash) for automation
Deep understanding of distributed systems, microservices, and reliability patterns
Experience with Bazel and CueLang a plus
Leadership & Communication
Exceptional ability to articulate complex technical concepts to diverse audiences
Track record of Driving technical change across organizational boundaries
Successfully Delivered multiple complex projects under tight deadlines
Strong customer service orientation with patience and empathy
Work Style
Thrives in ambiguous environments and makes progress without perfect information
Hands‑on, "can do" attitude with bias for action
Low ego and high intellectual curiosity
Comfortable working across time zones
Self‑motivated with strong ownership mentality
Compensation Disclosure $184,000—$240,000 USD
Compensation depends on skills, qualifications, experience, and work location. Variable compensation such as commission is not included.
Our Culture
Ownership Mindset
Act with Integrity
Guardians of our Customers
Opinionated Humility
Build Trust, Earn Trust
Veza is proud to be an equal opportunity employer. We are committed to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other applicable legally protected characteristics. We also consider qualified applicants according to applicable federal, state, and local laws. If a candidate with a disability requires an accommodation during the recruitment process, please email recruiting@veza.com.
#J-18808-Ljbffr