Logo
TikTok

Senior Site Reliability Engineer, Cloud Infrastructure - USDS

TikTok, Seattle, Washington, us, 98127

Save Job

Senior Site Reliability Engineer, Cloud Infrastructure - USDS Responsibilities

Drive infrastructure automation and tooling: Design, develop, and maintain solutions for efficient operation, optimization, and comprehensive monitoring of global infrastructure, minimizing manual intervention.

Collaborate on service lifecycle management: Partner with engineering teams to design, deploy, operate, and continuously improve robust and scalable systems and services, from inception to refinement.

Ensure service reliability and performance: Proactively monitor system health, conduct performance testing, and manage incidents to maximize uptime, availability, and adherence to defined SLAs/SLOs.

Execute core SRE practices: Perform on-call duties and production operations, including change management, capacity planning, and disaster recovery, while contributing to documentation and process improvements across teams.

Support cross-functional collaboration to enable internal platforms that serve business units such as Product, e-Commerce, Ads/Monetization, etc., in compliance with standards.

Qualifications

Minimum Qualifications

Proficient in one or more programming languages (e.g., Python, Go, Java, C++).

Strong understanding of Linux operating systems and open-source technologies.

Experience in network architecture and troubleshooting, database modeling, cloud systems, and large-scale distributed systems.

Knowledge of monitoring tools and methodologies (such as Prometheus, Grafana), AIOPS, APM, Disaster Recovery.

Experience in designing, analyzing, and building automation and tools for large-scale systems.

Experience in building solutions with AWS, GCP, Azure, and other cloud services.

Preferred qualifications

Expertise in Kubernetes, ElasticSearch, ClickHouse, Message Queue, OpenTSDB, Service Mesh, MySQL, Redis, etc.

Master's degree in Computer Science, Engineering, or a related field.

As a condition of employment, all successful candidates must be able to establish authorization to work in the United States. The Company does not provide sponsorship for immigration-related benefits.

About USDS

TikTok is the leading destination for short-form mobile video. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. focused on data protection policies and content assurance protocols to safeguard U.S. user data.

Data Security Statement: This role requires the ability to work with and support systems designed to protect sensitive data and information and may be subject to security screenings.

USDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities or other protected reasons. If you need assistance or a reasonable accommodation, please reach out to us via our accommodations contact.

USDS is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

#J-18808-Ljbffr