Logo
Okta, Inc.

Principal Site Reliability Engineer, Observability

Okta, Inc., Bellevue

Save Job

Principal Site Reliability Engineer, Observability

Bellevue, WA

Get to know Okta
Okta is The World’s Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.
At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences.
Join our team! We’re building a world where Identity belongs to you.

We're searching for a Principal Site Reliability Engineer (SRE) with a profound passion for observability to join our team. This isn't just a hands-on role; you'll be a thought leader, shaping the strategy and execution of our observability services—logs, metrics, and tracing—both within the Observability team and across the broader organization. We're looking for someone who can help us see clearly when things get cloudy!

Your expertise in Kubernetes will be crucial as we undergo a significant replatforming initiative. You will guide the design, implementation, and operation of our advanced observability capabilities on the new platform.

A cornerstone of this role is your exceptional ability to manage and influence stakeholders, ensuring their needs are met, expectations are managed, and they're delighted with the insights our observability services provide. We believe that our important stakeholders deserve metric-ulous attention.

What You'll Be Doing

  • Becoming deeply familiar with all corners of a critical SaaS platform utilized by millions of customers daily, with an eye towards providing unparalleled observability insights into its behavior and performance.
  • Engaging with stakeholders across the group to not only understand their component boundaries and dependencies but also to drive the adoption of observability best practices as a guide and coach for your teammates and the wider engineering organization.
  • Championing the evolution of our SDLC : defining how we ideate, onboard, operate, and scale microservices and features in a secure, performant, always-on manner, with observability (logs, metrics, tracing) as a foundational element from inception.
  • Identifying, understanding, and automating away manual processes through clever code and smart architecture, particularly focusing on how automation can enhance the collection, analysis, and actionability of observability data.
  • Supporting a 24x7 online environment as part of a global on-call rotation, leveraging your deep observability expertise to rapidly identify, diagnose, and resolve the most complex incidents.
  • Advocating for and establishing best practices for scalable, reliable, and resilient systems and services across all of WIC engineering, with a strong emphasis on fostering an observability-driven culture.

What You'll Bring to the Role

  • 9+ years of experience as a site reliability or platform engineer, preferably in a fast-scaling environment, with a significant and demonstrable track record in leading observability initiatives.
  • 2+ years of experience designing, scaling, and operating observability solutions for applications within a Kubernetes environment. You’ll be adept at leveraging Kubernetes capabilities to gain insights into workload performance and health.
  • Familiarity with large-scale containerized deployments, both microservice and monolithic, coupled with a deep understanding of their unique observability challenges and solutions.
  • A proactive and tenacious mindset: always willing to go the extra mile to identify a problem and drive its resolution, especially when it pertains to improving system visibility and reliability.
  • A strong passion for mentoring and encouraging the development of engineering peers, leading by example in adopting and promoting robust observability practices.
  • Deep knowledge of CI/CD principles, Linux fundamentals, OS hardening, networking concepts, and Internet protocols, applied strategically to build resilient and observable systems.
  • Strong skills in multiple operational tooling languages such as Python, Rust, or Go, for automating sophisticated observability tasks and integrations.
  • Proven ability to effectively manage and influence diverse stakeholders, translating complex technical observability concepts into clear, actionable insights, and ensuring high levels of satisfaction with observability services.
  • Expert proficiency with Splunk or similar for large-scale log management and advanced analysis.
  • Extensive experience with Grafana for designing and implementing sophisticated dashboards and visualizations of critical metrics.

This role requires in-person onboarding and travel to our San Francisco, CA HQ office during the first week of employment.

#LI-LSS1

Below is the annual base salary range for candidates located in California, Colorado, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: .

The annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, New York, and Washington is between:

Get to know Okta
Okta is The World’s Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.
At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences.
Join our team! We’re building a world where Identity belongs to you.

We're searching for a Principal Site Reliability Engineer (SRE) with a profound passion for observability to join our team. This isn't just a hands-on role; you'll be a thought leader, shaping the strategy and execution of our observability services—logs, metrics, and tracing—both within the Observability team and across the broader organization. We're looking for someone who can help us see clearly when things get cloudy!

Your expertise in Kubernetes will be crucial as we undergo a significant replatforming initiative. You will guide the design, implementation, and operation of our advanced observability capabilities on the new platform.

A cornerstone of this role is your exceptional ability to manage and influence stakeholders, ensuring their needs are met, expectations are managed, and they're delighted with the insights our observability services provide. We believe that our important stakeholders deserve metric-ulous attention.

What You'll Be Doing

  • Becoming deeply familiar with all corners of a critical SaaS platform utilized by millions of customers daily, with an eye towards providing unparalleled observability insights into its behavior and performance.
  • Engaging with stakeholders across the group to not only understand their component boundaries and dependencies but also to drive the adoption of observability best practices as a guide and coach for your teammates and the wider engineering organization.
  • Championing the evolution of our SDLC : defining how we ideate, onboard, operate, and scale microservices and features in a secure, performant, always-on manner, with observability (logs, metrics, tracing) as a foundational element from inception.
  • Identifying, understanding, and automating away manual processes through clever code and smart architecture, particularly focusing on how automation can enhance the collection, analysis, and actionability of observability data.
  • Supporting a 24x7 online environment as part of a global on-call rotation, leveraging your deep observability expertise to rapidly identify, diagnose, and resolve the most complex incidents.
  • Advocating for and establishing best practices for scalable, reliable, and resilient systems and services across all of WIC engineering, with a strong emphasis on fostering an observability-driven culture.

What You'll Bring to the Role

  • 9+ years of experience as a site reliability or platform engineer, preferably in a fast-scaling environment, with a significant and demonstrable track record in leading observability initiatives.
  • 2+ years of experience designing, scaling, and operating observability solutions for applications within a Kubernetes environment. You’ll be adept at leveraging Kubernetes capabilities to gain insights into workload performance and health.
  • Familiarity with large-scale containerized deployments, both microservice and monolithic, coupled with a deep understanding of their unique observability challenges and solutions.
  • A proactive and tenacious mindset: always willing to go the extra mile to identify a problem and drive its resolution, especially when it pertains to improving system visibility and reliability.
  • A strong passion for mentoring and encouraging the development of engineering peers, leading by example in adopting and promoting robust observability practices.
  • Deep knowledge of CI/CD principles, Linux fundamentals, OS hardening, networking concepts, and Internet protocols, applied strategically to build resilient and observable systems.
  • Strong skills in multiple operational tooling languages such as Python, Rust, or Go, for automating sophisticated observability tasks and integrations.
  • Proven ability to effectively manage and influence diverse stakeholders, translating complex technical observability concepts into clear, actionable insights, and ensuring high levels of satisfaction with observability services.
  • Expert proficiency with Splunk or similar for large-scale log management and advanced analysis.
  • Extensive experience with Grafana for designing and implementing sophisticated dashboards and visualizations of critical metrics.

This role requires in-person onboarding and travel to our San Francisco, CA HQ office during the first week of employment.

#LI-LSS1

Below is the annual base salary range for candidates located in California, Colorado, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: .

The annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, New York, and Washington is between: $194,000 — $290,000 USD

What you can look forward to as a Full-Time Okta employee!

  • Amazing Benefits
  • Making Social Impact
  • Developing Talent and Fostering Connection + Community at Okta

Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today! .
Some roles may require travel to one of our office locations for in-person onboarding.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.
If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Privacy Policy at .

U.S. Equal Opportunity Employment Information
Read more

Individuals seeking employment at this company are considered without regards to race, color, religion,national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, genderidentity, or sexual orientation. When submitting your application above, you are being given theopportunity to provide information about your race/ethnicity, gender, and veteran status.

Completion of the form is entirely voluntary . Whatever your decision, it will not beconsidered in the hiring process or thereafter. Any information that you do provide will be recorded andmaintained in a confidential file.

If you believe you belong to any of the categories of protected veterans listed below, please indicate bymaking the appropriate selection. As a government contractor subject to Vietnam Era Veterans ReadjustmentAssistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreachand positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categoriesis as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air servicewho is entitled to compensation (or who but for the receipt of military retired pay would be entitled tocompensation) under laws administered by the Secretary of Veterans Affairs; or a person who was dischargedor released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date ofsuch veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S.military, ground, naval or air service during a war, or in a campaign or expedition for which a campaignbadge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S.military, ground, naval or air service, participated in a United States military operation for which anArmed Forces service medal was awarded pursuant to Executive Order 12985.

Individuals seeking employment at this company are considered without regards to race, color, religion,national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, genderidentity, or sexual orientation. When submitting your application above, you are being given theopportunity to provide information about your race/ethnicity, gender, and veteran status.

Completion of the form is entirely voluntary . Whatever your decision, it will not beconsidered in the hiring process or thereafter. Any information that you do provide will be recorded andmaintained in a confidential file.

If you believe you belong to any of the categories of protected veterans listed below, please indicate bymaking the appropriate selection. As a government contractor subject to Vietnam Era Veterans ReadjustmentAssistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreachand positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categoriesis as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air servicewho is entitled to compensation (or who but for the receipt of military retired pay would be entitled tocompensation) under laws administered by the Secretary of Veterans Affairs; or a person who was dischargedor released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date ofsuch veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S.military, ground, naval or air service during a war, or in a campaign or expedition for which a campaignbadge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S.military, ground, naval or air service, participated in a United States military operation for which anArmed Forces service medal was awarded pursuant to Executive Order 12985.

Pay Transparency

Okta complies with all applicable federal, state, and local pay transparency rules. For additionalinformation about the federal requirements, click here .

Voluntary Self-Identification of Disability
Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years. Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor's Office of Federal Contract Compliance Programs (OFCCP) website at

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at .

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

The foundation for secure connections between people and technology

Okta is the leading independent provider of identity for the enterprise. The Okta Identity Cloud enables organizations to securely connect the right people to the right technologies at the right time. With over 7,000 pre-built integrations to applications and infrastructure providers, Okta customers can easily and securely use the best technologies for their business. More than 19,300 organizations, including JetBlue, Nordstrom, Slack, T-Mobile, Takeda, Teach for America, and Twilio, trust Okta to help protect the identities of their workforces and customers.

Follow Okta

First Name

Last Name

Email

Phone

Resume

Upload PDF

Paste

Resume/CV Upload Resume/CV (PDF must be less than 8 MB )

Resume/CV

Upload PDF

Paste

Upload Cover Letter (PDF must be less than 8 MB )

LinkedIn Profile

Website

Are you a U.S. Person (U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee), and can you provide documentation establishing U.S. Person status upon hire?Please note:Candidates who require visa sponsorship (e.g. H-1B, F-1, O-1, L-1, etc.) are not U.S. Persons.

To the best of your knowledge, do you have any family members / relatives or personal relationships at Okta or at any suppliers, partners, or vendors that have a business relationship with Okta?(For purposes of this question, a “family member / relative or personal relationship” is defined as close personal friends (including sexual and/or romantic relationships), close relatives (spouse, partner, children, cousins, aunts, uncles, nieces, nephews, grandparents or grandchildren), someone who lives in your household, or anyone else with whom you have a close enough personal relationship or connection that it could improperly bias your conduct or decision making or be perceived to be capable of impacting your conduct or decision making.

If yes, please identify name of person / vendor and describe relationship / association:

Do you have any outside business activity(ies) (advisory, consulting, or board roles, or side businesses) that you would continue engaging in or plan to engage in if you joined Okta in this role?

If yes, please describe:

Have you worked for Okta in the past?

I acknowledge and agree to the processing of my personal data in accordance with Okta's Privacy Policy.

I would like to be considered for future positions at Okta.

Yes

Are you legally eligible to work in the United States without sponsorship now or in the future?

Do you live within 50 miles of Bellevue?

Where do you currently reside?

Do you have a minimum of 9+ years of experience as a site reliability or platform engineer?

Do you have a minimum of 2+ years of experience designing, scaling, and operating observability solutions for applications within a Kubernetes environment

#J-18808-Ljbffr