TikTok
Site Reliability Engineer, Infrastructure and Assurance Services - USDS
TikTok, San Jose, California, United States, 95199
Site Reliability Engineer, Infrastructure and Assurance Services - USDS
1 week ago Be among the first 25 applicants
Responsibilities The Infra SRE‑Infrastructure‑Assurance team extends TikTok’s infrastructure operability, observability, visibility, and automation. We aim to provide holistic insights and solutions with minimal manual interventions, solving large‑scale complex issues in a hyper‑growth team. To facilitate collaboration, the organization follows a hybrid work schedule that requires employees to work in the office three days a week or as directed by their manager. We regularly review this model, and specific requirements may change at any time.
Perform SRE duties and operations on supported services in production, including on‑call rotations, maintenance, change management, monitoring, incident response, capacity planning, and disaster recovery.
Maximize system uptime, availability, and stability to ensure functional and performance SLAs.
Contribute to and build effective documentation such as operational runbooks, SOPs, SLA/SLO.
Initiate and lead scripting, tooling, and automation to streamline processes and minimize human resource.
Work cross‑functionally and regionally with SRE, Dev, QA, and PM teams to handle incidents and improve processes.
Manage and prioritize tasks/projects for high productivity and precise deliveries.
Qualifications Minimum Qualifications:
Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
Demonstrated experience in software development with one or more programming languages.
Experience in Linux Operating Systems, networking, database concepts, monitoring, and shell scripting.
Superb analytical ability, problem‑solving, and critical‑thinking skills.
Excellent communication, team‑player, self‑starter, and fast learner.
Preferred Qualifications:
Master’s degree in Computer Science, Engineering or a related field.
Proficiency in any of the following languages: Python, Go, C++.
Expertise in SRE philosophy, AIOPS, APM, or disaster recovery.
Experience with Kubernetes, ElasticSearch, ClickHouse, message queue, OpenTSDB, or service mesh.
Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.
About USDS TikTok is the leading destination for short‑form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S. This new, security‑first division focuses on oversight and protection of TikTok platform and U.S. user data, ensuring millions of Americans can safely use our service.
Why Join Us Inspiring creativity is at the core of TikTok’s mission. Our innovative product is built to help people authentically express themselves, discover and connect. We lead with curiosity, humility, and a desire to make an impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users.
Diversity & Inclusion TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. We celebrate diverse voices and aim to build an environment that reflects the many communities we reach.
USDS Reasonable Accommodation USDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/USDS-RA.
Job Information Compensation:
$118,657 – $259,200 annually (may vary based on qualifications, skills, competencies, and location). Base pay is part of a total package that may include discretionary bonuses, incentives, and restricted stock units.
Benefits:
Medical, dental, and vision insurance; 401(k) savings plan with company match; paid parental leave; short‑term and long‑term disability coverage; life insurance; wellbeing benefits; 10 paid holidays per year; 10 paid sick days per year; 17 days of paid personal time (prorated upon hire with increasing accruals by tenure).
Seniority level:
Associate Employment type:
Full‑time Job function:
Engineering and Information Technology
#J-18808-Ljbffr
Responsibilities The Infra SRE‑Infrastructure‑Assurance team extends TikTok’s infrastructure operability, observability, visibility, and automation. We aim to provide holistic insights and solutions with minimal manual interventions, solving large‑scale complex issues in a hyper‑growth team. To facilitate collaboration, the organization follows a hybrid work schedule that requires employees to work in the office three days a week or as directed by their manager. We regularly review this model, and specific requirements may change at any time.
Perform SRE duties and operations on supported services in production, including on‑call rotations, maintenance, change management, monitoring, incident response, capacity planning, and disaster recovery.
Maximize system uptime, availability, and stability to ensure functional and performance SLAs.
Contribute to and build effective documentation such as operational runbooks, SOPs, SLA/SLO.
Initiate and lead scripting, tooling, and automation to streamline processes and minimize human resource.
Work cross‑functionally and regionally with SRE, Dev, QA, and PM teams to handle incidents and improve processes.
Manage and prioritize tasks/projects for high productivity and precise deliveries.
Qualifications Minimum Qualifications:
Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
Demonstrated experience in software development with one or more programming languages.
Experience in Linux Operating Systems, networking, database concepts, monitoring, and shell scripting.
Superb analytical ability, problem‑solving, and critical‑thinking skills.
Excellent communication, team‑player, self‑starter, and fast learner.
Preferred Qualifications:
Master’s degree in Computer Science, Engineering or a related field.
Proficiency in any of the following languages: Python, Go, C++.
Expertise in SRE philosophy, AIOPS, APM, or disaster recovery.
Experience with Kubernetes, ElasticSearch, ClickHouse, message queue, OpenTSDB, or service mesh.
Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.
About USDS TikTok is the leading destination for short‑form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S. This new, security‑first division focuses on oversight and protection of TikTok platform and U.S. user data, ensuring millions of Americans can safely use our service.
Why Join Us Inspiring creativity is at the core of TikTok’s mission. Our innovative product is built to help people authentically express themselves, discover and connect. We lead with curiosity, humility, and a desire to make an impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users.
Diversity & Inclusion TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. We celebrate diverse voices and aim to build an environment that reflects the many communities we reach.
USDS Reasonable Accommodation USDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/USDS-RA.
Job Information Compensation:
$118,657 – $259,200 annually (may vary based on qualifications, skills, competencies, and location). Base pay is part of a total package that may include discretionary bonuses, incentives, and restricted stock units.
Benefits:
Medical, dental, and vision insurance; 401(k) savings plan with company match; paid parental leave; short‑term and long‑term disability coverage; life insurance; wellbeing benefits; 10 paid holidays per year; 10 paid sick days per year; 17 days of paid personal time (prorated upon hire with increasing accruals by tenure).
Seniority level:
Associate Employment type:
Full‑time Job function:
Engineering and Information Technology
#J-18808-Ljbffr