Stellar Consulting Solutions, LLC
Site Reliability Engineer
Stellar Consulting Solutions, LLC, New York, New York, United States
Role: SRE(Site Reliability Engineer)
Contract#W2 Only
Location: San Diego, CA
Duration: 12+Months
Client: Medical
Summary: The Site Reliability Engineer will ensure the stability, performance, and security of cloud-based systems. This role involves managing AWS/Azure infrastructure, automating processes, and implementing monitoring tools to maintain system reliability and compliance. Key Responsibilities: Cloud Infrastructure Management Design, deploy, and maintain scalable and secure cloud systems on
AWS
and
Azure . Use
Infrastructure-as-Code (Terraform, Cloud Formation, AWS CDK)
and scripting languages like
PowerShell ,
TypeScript , or
Go . Ensure compliance with
SOC II
and
ePHI
standards for healthcare data security. Monitoring & Observability Set up and manage
Datadog
dashboards and alerts to monitor application and infrastructure performance. Develop runbooks for incident response and system health checks. Provide visibility into key metrics such as uptime, latency, and error rates. Performance & Reliability Identify and fix system performance issues and bottlenecks. Conduct root cause analysis and support on-call rotations for incident management. Improve architecture, disaster recovery, and overall system scalability. DevOps Collaboration Partner with developers to integrate
CI/CD pipelines
and promote automation. Work with security and compliance teams to meet all data protection requirements. Advocate for best practices in monitoring, deployment, and infrastructure design. Security & Compliance Maintain cloud security through IAM controls, encryption, and audits. Ensure adherence to healthcare compliance frameworks (SOC II, ePHI). Qualifications: Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience). 3+ years of experience as a
Site Reliability Engineer
on
AWS
and/or
Azure . Hands-on with
Terraform ,
Cloud Formation ,
AWS CDK , and
monitoring tools
(Datadog, Prometheus, Grafana). Experience with
Docker ,
Kubernetes , and automation tools like
Ansible ,
Chef , or
Puppet . Familiar with
CI/CD tools
(Jenkins, GitHub Actions, Azure DevOps). Strong problem-solving, collaboration, and communication skills. Experience with
healthcare data security
(SOC II, ePHI) preferred. Nice to Have: Experience in
regulated industries
(healthcare, medical devices). Certifications:
AWS Solutions Architect ,
Azure Administrator , or
CKA . Familiarity with
AI/ML monitoring
or
server less architectures (AWS Lambda, Azure Functions) . kindly share resume at gaurav@stellarconsulting.com or call me at 678-935-7045 to discuss more!
Summary: The Site Reliability Engineer will ensure the stability, performance, and security of cloud-based systems. This role involves managing AWS/Azure infrastructure, automating processes, and implementing monitoring tools to maintain system reliability and compliance. Key Responsibilities: Cloud Infrastructure Management Design, deploy, and maintain scalable and secure cloud systems on
AWS
and
Azure . Use
Infrastructure-as-Code (Terraform, Cloud Formation, AWS CDK)
and scripting languages like
PowerShell ,
TypeScript , or
Go . Ensure compliance with
SOC II
and
ePHI
standards for healthcare data security. Monitoring & Observability Set up and manage
Datadog
dashboards and alerts to monitor application and infrastructure performance. Develop runbooks for incident response and system health checks. Provide visibility into key metrics such as uptime, latency, and error rates. Performance & Reliability Identify and fix system performance issues and bottlenecks. Conduct root cause analysis and support on-call rotations for incident management. Improve architecture, disaster recovery, and overall system scalability. DevOps Collaboration Partner with developers to integrate
CI/CD pipelines
and promote automation. Work with security and compliance teams to meet all data protection requirements. Advocate for best practices in monitoring, deployment, and infrastructure design. Security & Compliance Maintain cloud security through IAM controls, encryption, and audits. Ensure adherence to healthcare compliance frameworks (SOC II, ePHI). Qualifications: Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience). 3+ years of experience as a
Site Reliability Engineer
on
AWS
and/or
Azure . Hands-on with
Terraform ,
Cloud Formation ,
AWS CDK , and
monitoring tools
(Datadog, Prometheus, Grafana). Experience with
Docker ,
Kubernetes , and automation tools like
Ansible ,
Chef , or
Puppet . Familiar with
CI/CD tools
(Jenkins, GitHub Actions, Azure DevOps). Strong problem-solving, collaboration, and communication skills. Experience with
healthcare data security
(SOC II, ePHI) preferred. Nice to Have: Experience in
regulated industries
(healthcare, medical devices). Certifications:
AWS Solutions Architect ,
Azure Administrator , or
CKA . Familiarity with
AI/ML monitoring
or
server less architectures (AWS Lambda, Azure Functions) . kindly share resume at gaurav@stellarconsulting.com or call me at 678-935-7045 to discuss more!