Monks
Site Reliability Engineer (SRE) - Media Production Infrastructure
Monks, Cupertino, California, United States, 95014
Site Reliability Engineer (SRE) - Media Production Infrastructure
Please note that we will never request payment or bank account information at any stage of the recruitment process. As we continue to grow our teams, we urge you to be cautious of fraudulent job postings or recruitment activities that misuse our company name and information. Please protect your personal information during any recruitment process. While Monks may contact potential candidates via LinkedIn, all applications must be submitted through our official website (monks.com/careers).
About The Role We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our Platform Engineering team, supporting a world‑class media production environment for a leading global technology company. This is a crucial role within a Managed Services model, focused on ensuring the high availability, performance, and resilience of critical server, storage, and media workflow systems. You will be one of two dedicated on‑site SREs who will partner with remote and consulting staff to provide around‑the‑clock operational support and continuous infrastructure improvement.
Key Responsibilities
Infrastructure Management: Maintain and troubleshoot all production hardware, servers, and storage infrastructure, with a specialized focus on the Storage Area Network (SAN).
Storage Expertise: Execute key maintenance and support for the SAN environment, including firmware/software updates for fiber switches, RAIDs, and ape systems.
Networking and System Administration: Manage Directory services, network services (DNS, static IPs, subnet masks), and configure shares and permissions on the SAN.
Monitoring and Observability: Manage and improve custom dashboards for 24/7 monitoring of systems, RAIDs, temperature sensors, and backup/archive processes.
Custom Application Support: Contribute to the development and maintenance of custom applications and dashboards that support media workflows, including tools for project deployment, directory services integration, and ticketing.
Remote/On‑Demand Support: Provide active on‑site support and participate in a 24/7 on‑call rotation for critical interventions (e.g., power/cooling issues).
Backup and Archive: Manage the Backup and Archive environment, maintain tape systems, and prepare projects for archiving to the cloud.
Qualifications & Experience
Experience: 14+ years of experience working with macOS and SAN environments, preferably Xsan.
Experience working with Stornext and Jamf.
Technical Depth:
Deep expertise in Fibre Channel networking.
Demonstrated experience with hardware RAIDs, block storage, and LUN creation.
Thorough knowledge of macOS ACLs, POSIX permissions, and Directory Services.
Expertise in installing and configuring Prometheus and Grafana, including creating Prometheus exporters.
Software & Scripting:
Experience with Shell Scripting.
Experience with remote connection technologies.
Thorough knowledge of data management for media and entertainment.
Please note: This position requires on‑site presence in
SCV/Cupertino
three days per week , including
Saturday and Sunday . The third on‑site day may be scheduled on
Tuesday, Wednesday, or Thursday . The remaining
two days
may be worked
remotely .
This role is subject to our Return to Office (RTO) policy. If you reside within a commutable distance of one of our office locations, you will be expected to work from the office a set number of days per week. The specific details, including the number of required office days, will be in accordance with the company’s then-current RTO policy, which is subject to change from time to time.
What We Offer
Monks has provided a compensation range that represents its good faith estimate of what Media.Monks may pay for the position at the time of posting. Monks may ultimately pay more or less than the posted compensation range. The salary offered to the selected candidate will be determined based on job‑related factors, but not based on a candidate’s sex or any other protected status.
Salary Range $133,298.00 - $150,925.00 USD
Benefits
Excellent, full coverage medical, dental, and vision insurance
Generous PTO and 15 company‑wide holidays
401(k) with company contribution
Paid parental leave
Work‑life balance with an emphasis on personal wellbeing
Career growth in a disruptor space & entrepreneurial opportunities within the Monks network
A globally diverse & inclusive culture with employee resource groups such as S4 Melanin, Pride.Monks, Cultura.Monks, and more!
Authentic commitment to DEI efforts and sustainable growth.
We are an equal‑opportunity employer committed to building a respectful and empowering work environment for all people to freely express themselves amongst colleagues who embrace diversity in all respects. Including fresh voices and unique points of view in all aspects of our business not only creates an environment where we can all grow and thrive but also increases our potential to produce work that better represents—and resonates with—the world around us.
#J-18808-Ljbffr
About The Role We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our Platform Engineering team, supporting a world‑class media production environment for a leading global technology company. This is a crucial role within a Managed Services model, focused on ensuring the high availability, performance, and resilience of critical server, storage, and media workflow systems. You will be one of two dedicated on‑site SREs who will partner with remote and consulting staff to provide around‑the‑clock operational support and continuous infrastructure improvement.
Key Responsibilities
Infrastructure Management: Maintain and troubleshoot all production hardware, servers, and storage infrastructure, with a specialized focus on the Storage Area Network (SAN).
Storage Expertise: Execute key maintenance and support for the SAN environment, including firmware/software updates for fiber switches, RAIDs, and ape systems.
Networking and System Administration: Manage Directory services, network services (DNS, static IPs, subnet masks), and configure shares and permissions on the SAN.
Monitoring and Observability: Manage and improve custom dashboards for 24/7 monitoring of systems, RAIDs, temperature sensors, and backup/archive processes.
Custom Application Support: Contribute to the development and maintenance of custom applications and dashboards that support media workflows, including tools for project deployment, directory services integration, and ticketing.
Remote/On‑Demand Support: Provide active on‑site support and participate in a 24/7 on‑call rotation for critical interventions (e.g., power/cooling issues).
Backup and Archive: Manage the Backup and Archive environment, maintain tape systems, and prepare projects for archiving to the cloud.
Qualifications & Experience
Experience: 14+ years of experience working with macOS and SAN environments, preferably Xsan.
Experience working with Stornext and Jamf.
Technical Depth:
Deep expertise in Fibre Channel networking.
Demonstrated experience with hardware RAIDs, block storage, and LUN creation.
Thorough knowledge of macOS ACLs, POSIX permissions, and Directory Services.
Expertise in installing and configuring Prometheus and Grafana, including creating Prometheus exporters.
Software & Scripting:
Experience with Shell Scripting.
Experience with remote connection technologies.
Thorough knowledge of data management for media and entertainment.
Please note: This position requires on‑site presence in
SCV/Cupertino
three days per week , including
Saturday and Sunday . The third on‑site day may be scheduled on
Tuesday, Wednesday, or Thursday . The remaining
two days
may be worked
remotely .
This role is subject to our Return to Office (RTO) policy. If you reside within a commutable distance of one of our office locations, you will be expected to work from the office a set number of days per week. The specific details, including the number of required office days, will be in accordance with the company’s then-current RTO policy, which is subject to change from time to time.
What We Offer
Monks has provided a compensation range that represents its good faith estimate of what Media.Monks may pay for the position at the time of posting. Monks may ultimately pay more or less than the posted compensation range. The salary offered to the selected candidate will be determined based on job‑related factors, but not based on a candidate’s sex or any other protected status.
Salary Range $133,298.00 - $150,925.00 USD
Benefits
Excellent, full coverage medical, dental, and vision insurance
Generous PTO and 15 company‑wide holidays
401(k) with company contribution
Paid parental leave
Work‑life balance with an emphasis on personal wellbeing
Career growth in a disruptor space & entrepreneurial opportunities within the Monks network
A globally diverse & inclusive culture with employee resource groups such as S4 Melanin, Pride.Monks, Cultura.Monks, and more!
Authentic commitment to DEI efforts and sustainable growth.
We are an equal‑opportunity employer committed to building a respectful and empowering work environment for all people to freely express themselves amongst colleagues who embrace diversity in all respects. Including fresh voices and unique points of view in all aspects of our business not only creates an environment where we can all grow and thrive but also increases our potential to produce work that better represents—and resonates with—the world around us.
#J-18808-Ljbffr