Qualtrix consulting
Location: Chicago, IL
Interview Mode: In-person (Mandatory)
Start: Day One Onsite
Hybrid (3 days/week) Job Description: We are seeking an experienced AWS SRE Architect to lead the design and implementation of reliable, scalable, and secure cloud infrastructure. The ideal candidate will collaborate with cross-functional teams to ensure operational excellence, monitor system health, and drive automation across our AWS environments. Key Responsibilities: Architect Scalable Systems: Design and implement highly available, resilient, and fault-tolerant systems using AWS services. Incident Management: Lead the response to incidents, ensuring root cause analysis, resolution, and preventive measures. Monitoring and Logging: Establish robust monitoring, logging, and alerting systems using tools like Amazon CloudWatch, Prometheus, or Grafana. Automation and Infrastructure as Code (IaC): Develop automation scripts and templates using Terraform Performance Management: Optimize system performance, capacity planning, and load testing - Hands on memory and CPU optimization experience in Java spring boot Microservices Security and Compliance: Ensure cloud solutions meet security best practices and regulatory compliance standards. DevOps Collaboration: Partner with DevOps, Development, and Security teams to ensure seamless CI/CD pipeline management and application deployment. Documentation: Create and maintain architecture diagrams, runbooks, and technical documentation. Required Qualifications: Bachelors or Masters degree in Computer Science, Engineering, or related field. 8+ years of experience in IT operations, system administration, or software engineering. 5+ years of experience with AWS cloud infrastructure. Strong understanding of AWS services including EC2, S3, RDS, Lambda, ECS, EKS, VPC, and CloudFront. Experience in Infrastructure as Code (IaC) using tools like Terraform Proficiency in monitoring and observability tools such as Dynatrace or Datadog Hands-on experience with CI/CD pipelines using Jenkins, GitLab, Harness Expertise in scripting languages (Python, Bash, or PowerShell) for automation and infrastructure management. Experience with container orchestration using Docker and Kubernetes. Solid understanding of SRE principles including SLAs, SLOs, and Error Budgets. Strong problem-solving and analytical skills with experience in large-scale distributed systems.
Preferred Qualifications: AWS Certified Solutions Architect
Professional or AWS Certified DevOps Engineer. Experience with incident management and on-call rotations. Familiarity with ITIL processes and service management frameworks. Knowledge of security best practices (e.g., IAM, KMS, Security Groups).
Hybrid (3 days/week) Job Description: We are seeking an experienced AWS SRE Architect to lead the design and implementation of reliable, scalable, and secure cloud infrastructure. The ideal candidate will collaborate with cross-functional teams to ensure operational excellence, monitor system health, and drive automation across our AWS environments. Key Responsibilities: Architect Scalable Systems: Design and implement highly available, resilient, and fault-tolerant systems using AWS services. Incident Management: Lead the response to incidents, ensuring root cause analysis, resolution, and preventive measures. Monitoring and Logging: Establish robust monitoring, logging, and alerting systems using tools like Amazon CloudWatch, Prometheus, or Grafana. Automation and Infrastructure as Code (IaC): Develop automation scripts and templates using Terraform Performance Management: Optimize system performance, capacity planning, and load testing - Hands on memory and CPU optimization experience in Java spring boot Microservices Security and Compliance: Ensure cloud solutions meet security best practices and regulatory compliance standards. DevOps Collaboration: Partner with DevOps, Development, and Security teams to ensure seamless CI/CD pipeline management and application deployment. Documentation: Create and maintain architecture diagrams, runbooks, and technical documentation. Required Qualifications: Bachelors or Masters degree in Computer Science, Engineering, or related field. 8+ years of experience in IT operations, system administration, or software engineering. 5+ years of experience with AWS cloud infrastructure. Strong understanding of AWS services including EC2, S3, RDS, Lambda, ECS, EKS, VPC, and CloudFront. Experience in Infrastructure as Code (IaC) using tools like Terraform Proficiency in monitoring and observability tools such as Dynatrace or Datadog Hands-on experience with CI/CD pipelines using Jenkins, GitLab, Harness Expertise in scripting languages (Python, Bash, or PowerShell) for automation and infrastructure management. Experience with container orchestration using Docker and Kubernetes. Solid understanding of SRE principles including SLAs, SLOs, and Error Budgets. Strong problem-solving and analytical skills with experience in large-scale distributed systems.
Preferred Qualifications: AWS Certified Solutions Architect
Professional or AWS Certified DevOps Engineer. Experience with incident management and on-call rotations. Familiarity with ITIL processes and service management frameworks. Knowledge of security best practices (e.g., IAM, KMS, Security Groups).