Roku
Senior Software Engineer, DevOps - Data Platform
Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico, and we are building toward powering every television in the world. We connect consumers to the content they love, enable publishers to build and monetize large audiences, and provide advertisers unique capabilities to engage consumers.
From your first day at Roku, you'll make a valuable contribution. We're a fast-growing public company where no one is a bystander. We offer you the opportunity to delight millions of TV streamers around the world while gaining meaningful experience across a variety of disciplines.
About the Team Roku runs one of the largest data lakes in the world. We store over 70 petabytes of data, run more than 10 million queries per month, and scan over 100 petabytes of data per month. The Big Data team builds, runs, and supports the platform that makes this possible. We provide tooling to acquire, generate, process, monitor, validate, and access data in the lake for both streaming and batch data, and we generate the foundational data. Systems include Scribe, Kafka, Hive, Presto, Spark, Flink, Pinot, and others. The team is actively involved in Open Source, with plans to increase engagement over time.
About the Role We are seeking a skilled engineer with exceptional DevOps skills to join our team. Responsibilities include automating and scaling Big Data and Analytics technology stacks on cloud infrastructure, building CI/CD pipelines, setting up monitoring and alerting for production infrastructure, and keeping our technology stacks up to date.
What you’ll be doing:
Develop best practices around cloud infrastructure provisioning, disaster recovery, and guiding developers on adoption
Scale Big Data and distributed systems
Collaborate on system architecture with developers for optimal scaling, resource utilization, fault tolerance, reliability, and availability
Conduct low-level systems debugging, performance measurement & optimization on large production clusters and low-latency services
Create scripts and automation that can react quickly to infrastructure issues and take corrective actions
Participate in architecture discussions, influence product roadmap, and take ownership over new projects
Collaborate and communicate with a geographically distributed team
We’re excited if you have:
Bachelor’s degree, or equivalent work experience
8+ years of experience in DevOps or Site Reliability Engineering
Experience with Cloud infrastructure such as Amazon AWS, Google Cloud Platform (GCP), Microsoft Azure, or other Public Cloud platforms. GCP is preferred.
Experience with at least 3 of the technologies/tools mentioned here: Big Data / Hadoop, Kafka, Spark, Airflow, Presto, Druid, Opensearch, HA Proxy, or Hive
Experience with Kubernetes and Docker
Experience with Terraform
Strong background in Linux/Unix
Experience with system engineering around edge cases, failure modes, and disaster recovery
Experience with shell scripting, or equivalent programming skills in Python
Experience working with monitoring and alerting tools such as Grafana and PagerDuty, and being part of call rotations
Experience with Chef, Puppet, or Ansible
Experience with Networking, Network Security, and Data Security
AI literacy and curiosity. You have either 1) tried Gen AI in your previous work or outside of work, or 2) are curious about Gen AI and have explored it.
Benefits Roku offers a diverse range of benefits as part of our compensation package to support employees and their families. Benefits include global access to mental health and financial wellness resources, as well as location-based healthcare (medical, dental, vision), life, disability, retirement options (401(k)/pension), and paid time off. Not all benefits are available in every location or for every role; consult with your recruiter for location-specific details.
The Roku Culture Roku is a fast-paced environment where the company’s success takes priority. We value independent thinkers who act boldly, move fast, and excel through collaboration and trust. We’re proud of our problem-solving culture and focus on delivering customer value. To learn more about Roku, visit our factsheet.
By providing your information, you acknowledge Roku may contact you about job roles and have read Roku’s Applicant Privacy Notice. You may unsubscribe from future communications at any time.
Location: Austin, TX
#J-18808-Ljbffr
From your first day at Roku, you'll make a valuable contribution. We're a fast-growing public company where no one is a bystander. We offer you the opportunity to delight millions of TV streamers around the world while gaining meaningful experience across a variety of disciplines.
About the Team Roku runs one of the largest data lakes in the world. We store over 70 petabytes of data, run more than 10 million queries per month, and scan over 100 petabytes of data per month. The Big Data team builds, runs, and supports the platform that makes this possible. We provide tooling to acquire, generate, process, monitor, validate, and access data in the lake for both streaming and batch data, and we generate the foundational data. Systems include Scribe, Kafka, Hive, Presto, Spark, Flink, Pinot, and others. The team is actively involved in Open Source, with plans to increase engagement over time.
About the Role We are seeking a skilled engineer with exceptional DevOps skills to join our team. Responsibilities include automating and scaling Big Data and Analytics technology stacks on cloud infrastructure, building CI/CD pipelines, setting up monitoring and alerting for production infrastructure, and keeping our technology stacks up to date.
What you’ll be doing:
Develop best practices around cloud infrastructure provisioning, disaster recovery, and guiding developers on adoption
Scale Big Data and distributed systems
Collaborate on system architecture with developers for optimal scaling, resource utilization, fault tolerance, reliability, and availability
Conduct low-level systems debugging, performance measurement & optimization on large production clusters and low-latency services
Create scripts and automation that can react quickly to infrastructure issues and take corrective actions
Participate in architecture discussions, influence product roadmap, and take ownership over new projects
Collaborate and communicate with a geographically distributed team
We’re excited if you have:
Bachelor’s degree, or equivalent work experience
8+ years of experience in DevOps or Site Reliability Engineering
Experience with Cloud infrastructure such as Amazon AWS, Google Cloud Platform (GCP), Microsoft Azure, or other Public Cloud platforms. GCP is preferred.
Experience with at least 3 of the technologies/tools mentioned here: Big Data / Hadoop, Kafka, Spark, Airflow, Presto, Druid, Opensearch, HA Proxy, or Hive
Experience with Kubernetes and Docker
Experience with Terraform
Strong background in Linux/Unix
Experience with system engineering around edge cases, failure modes, and disaster recovery
Experience with shell scripting, or equivalent programming skills in Python
Experience working with monitoring and alerting tools such as Grafana and PagerDuty, and being part of call rotations
Experience with Chef, Puppet, or Ansible
Experience with Networking, Network Security, and Data Security
AI literacy and curiosity. You have either 1) tried Gen AI in your previous work or outside of work, or 2) are curious about Gen AI and have explored it.
Benefits Roku offers a diverse range of benefits as part of our compensation package to support employees and their families. Benefits include global access to mental health and financial wellness resources, as well as location-based healthcare (medical, dental, vision), life, disability, retirement options (401(k)/pension), and paid time off. Not all benefits are available in every location or for every role; consult with your recruiter for location-specific details.
The Roku Culture Roku is a fast-paced environment where the company’s success takes priority. We value independent thinkers who act boldly, move fast, and excel through collaboration and trust. We’re proud of our problem-solving culture and focus on delivering customer value. To learn more about Roku, visit our factsheet.
By providing your information, you acknowledge Roku may contact you about job roles and have read Roku’s Applicant Privacy Notice. You may unsubscribe from future communications at any time.
Location: Austin, TX
#J-18808-Ljbffr