Logo
Ursi Technologies Inc

Senior Site Reliability Engineer

Ursi Technologies Inc, Dallas, Texas, United States, 75215

Save Job

Overview

We are seeking an experienced

Site Reliability Engineer (SRE)

with advanced DevOps expertise to help build, scale, and maintain our infrastructure and services. You will play a critical role in ensuring high availability, performance, scalability, and security of our production systems, while enabling continuous deployment and rapid delivery of features to our customers. Key Responsibilities

Design, build, and maintain reliable, scalable, and secure cloud-based infrastructure (AWS, Azure, or GCP). Develop and improve observability using monitoring, alerting, logging, and tracing tools (e.g., Prometheus, Grafana, ELK, Datadog). Automate repetitive tasks and infrastructure using Infrastructure-as-Code (Terraform, CloudFormation, Pulumi). Create and maintain CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, ArgoCD, etc.) to support fast and safe delivery. Lead incident response, root cause analysis, and postmortems to ensure high uptime and rapid recovery. Optimize system performance, reliability, and cost-effectiveness through proactive monitoring and tuning. Collaborate with software engineering teams to define SLAs/SLOs and improve service reliability. Implement and maintain security best practices across environments (secrets management, IAM, firewalls, etc.). Maintain disaster recovery plans, backups, and high-availability strategies. Required Qualifications

9+ years of experience as an SRE, DevOps Engineer, or similar role. Proficiency in scripting and automation (Bash, Python, Go, etc.). Strong experience with containerization and orchestration (Docker, Kubernetes, Helm). Solid understanding of Linux systems administration and networking fundamentals. Hands-on experience with cloud platforms (AWS, Azure, or GCP). Experience with IaC tools (Terraform, CloudFormation). Familiarity with GitOps and modern deployment practices. Experience with observability tools (Prometheus, Grafana, Datadog). Strong troubleshooting and incident response skills. Experience in a high-traffic, microservices-based architecture. Exposure to service meshes (Istio, Linkerd). Experience with security automation and compliance (SOC2, ISO27001). Seniority level

Mid-Senior level Employment type

Contract Job function

Information Technology Software Development Location

Dallas, TX or Atlanta, GA Duration

12+ Months

#J-18808-Ljbffr