Logo
Diverse Lynx

Site Reliability Engineering (SRE)

Diverse Lynx, Atlanta

Save Job

Role name: Developer Role Description: • 6+ years of experience in systems engineering, platform support, DevOps, or site reliability roles.• Strong familiarity with Google Cloud Platform (GCP) services, especially BigQuery, Cloud Logging, IAM, and Service Accounts.• Provision and monitor staging/production cloud services.• Deploy, maintain, and troubleshoot cloud services.• Proven experience in an architectural role, designing solutions for reliability, scalability, and performance.• Deep understanding and practical application of SRE principles (SLIs/SLOs, error budgets, toil reduction, automation, incident management, postmortems).• Expertise in cloud computing platforms (e.g., GCP) including infrastructure, networking, and security services.• Strong experience with containerization and orchestration technologies (Kubernetes, Docker, serverless computing).• Solid experience designing and implementing observability solutions (e.g., Dynatrace, Prometheus, Grafana, ELK/EFK Stack).• Strong programming/scripting skills (e.g., Python, Go, Bash) for automation and tool development.• Excellent analytical, problem-solving, and strategic thinking skills.• Strong communication, collaboration, and leadership skills with the ability to influence technical direction across teams• Working On-Call as needed. Competencies: Digital : Google Cloud, Digital : Site Reliability Engineering (SRE)
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.