UST

Cloud Infrastructure Site Reliability Engineer (SRE)

UST, Alpharetta, Georgia, United States, 30239

Cloud Infrastructure Site Reliability Engineer (SRE) • UST Location: Atlanta, GA, United States

Join our mission‑driven team that transforms lives through technology. UST empowers teams to innovate, act nimbly, and create lasting impact.

Role Description As an SRE with expertise in multiple public cloud platforms, you will operate infrastructure solutions using Google’s SRE model, ensuring reliability, scalability, and security.

Responsibilities

Design, build, and maintain highly available, scalable, and secure cloud infrastructure on AWS, GCP, or Azure.

Develop and implement automation for provisioning, monitoring, scaling, and incident response using IaC tools such as Terraform, CloudFormation, and Ansible.

Monitor system reliability, capacity, and performance; proactively detect and address issues.

Respond to production incidents, participate in on‑call rotations, and lead post‑incident reviews.

Collaborate with software engineering and security teams to ensure new services are production‑ready.

Build and maintain deployment, monitoring, and operations tools; automate manual processes to reduce toil.

Document operational processes and system architectures.

Continuously evaluate and adopt new technologies to improve reliability, security, and efficiency.

What You Need

Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience.

3+ years of software development experience and proficiency in at least one language (Python, Go, Java, C++).

Experience administering cloud platforms (AWS, GCP, Azure) including networking, security, containerization, storage, data management, and serverless technologies.

Solid understanding of Linux systems, networking fundamentals, virtualized and distributed systems.

Deep understanding of observability tools and ability to set up monitoring dashboards.

Familiarity with CI/CD tools for automated testing, deployments, provisioning, and observability.

Ability to manage incidents, perform root‑cause analysis, and implement post‑mortem reviews.

Understanding of SLOs and SLAs for system reliability.

Exceptionally strong problem‑solving, troubleshooting, and communication skills.

Experience leading technical projects or mentoring junior engineers.

Additional Qualifications

Experience in enterprise‑scale financial services or regulated industries.

5+ years of experience in SRE, DevOps, infrastructure, or cloud engineering roles, preferably for large‑scale distributed systems.

Certifications such as Google Cloud Professional SRE, AWS Certified DevOps Engineer, etc.

Compensation Annual range: $65,000 – $98,000 (U.S. market).

Benefits Full‑time employees receive paid vacation (10 days), paid sick leave (6 days), 10 paid holidays, paid bereavement, jury duty leave, 401(k) with matching, medical, dental, vision, life insurance, disability coverage, HSA, FSA, and optional short‑term disability benefits. Benefits vary by location.

Seniority Level Mid‑Senior level

Employment Type Full‑time

Job Function Engineering and Information Technology

Industries IT Services and IT Consulting

Key Skills SRE, Cloud, Automation

What We Believe We embrace Humility, Humanity, and Integrity – values that shape a people‑first culture fostering diversity and sustainable solutions.

Equal Employment Opportunity Statement UST is an Equal Opportunity Employer. All qualified applicants receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other protected characteristic. UST complies with applicable laws regarding arrest or conviction records and “fair‑chance” ordinances.

Referral Information Referrals increase your chances of interviewing at UST by 2×.

#J-18808-Ljbffr