Logo
metropolis.io

Staff Reliability Engineer — Platform Resilience & Scale

metropolis.io, New York, New York, us, 10261

Save Job

A leading AI company in New York City seeks a Staff or Senior Software Engineer focused on Reliability to drive practices ensuring system availability and resilience. You will own the reliability posture, architect failover systems, and improve observability, enabling the platform to scale while maintaining 99.9%+ uptime. The role demands expertise in backend engineering, Java, and distributed systems, among other qualifications. A hybrid work culture fosters innovation and collaboration. #J-18808-Ljbffr