Job Description
nAn employer is looking for an SRE to join their enterprise level SRE team. They are building a specialized team of Senior Site Reliability Engineers to act as embedded technical experts across their IT organization. This team will be responsible for solving complex production issues, guiding development teams, and building tools that improve system resilience and observability.
nThis is not a traditional SRE role. You will be a technical leader, coach, and hands-on problem solver who thrives in ambiguity and drives results across organizational boundaries.
nResponsibilities
nInvestigate and resolve high-impact production issues across infrastructure and applications.
nEmbed with dev teams to guide them through performance, reliability, and architectural challenges.
nParticipate in incident response bridges as a technical expert.
nBuild tools and scripts to detect vulnerabilities, automate checks, and improve system visibility.
nConduct post-incident audits and ensure follow-through on remediation.
nCollaborate with DBAs, network engineers, and platform teams to unblock and resolve issues.
nProactively identify issues and drive them to resolution without waiting for direction.
nWe are a company committed to creating inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity employer that believes everyone matters. Qualified candidates will receive consideration for employment opportunities without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, disability, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to Human Resources Request Form ( . The EEOC "Know Your Rights" Poster is available here ( .
nTo learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: .
nSkills and Requirements
n10+ years of experience in SRE or DevOps roles.
nDeep expertise in Kubernetes (deployment, troubleshooting, performance tuning), Networking (firewalls, routing, connectivity issues), Relational Databases (patching, auditing, performance tuning)
nStrong scripting skills (e.g., Python, Bash) for tooling and automation.
nProven ability to lead through influence and solve problems across teams.
nComfortable navigating organizational blockers and driving issues to resolution.
nExperience with incident response and postmortem processes.
nFamiliarity with monitoring and observability tools.
nAbility to mentor and coach other engineers and development teams.
nStrong communication, and the ability to explain complex technical issues clearly to both technical and non-technical audiences.
nAbility to work cross functionally with DBAs, network engineers, developers, and leadership. null
nWe are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal employment opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment without regard to race, color, ethnicity, religion,sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military oruniformed service member status, or any other status or characteristic protected by applicable laws, regulations, andordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to