Logo
ZipRecruiter

Site Reliability Engineer (SRE) - D365 Infrastructure & Azure DevOps

ZipRecruiter, Atlanta

Save Job

Job Descriptionn

Onsite interview (Atlanta)

n

3 days onsite a week (Atlanta)

n

No H1B candidates & No C2C's

nn

Job description:

n

Summary

nn

We are seeking a highly experienced Site Reliability Engineer (SRE) with a strong background in Microsoft Dynamics 365 (D365) infrastructure, including Power Platform, Dataverse, and Finance & Operations (F&O). This role requires deep expertise in Azure DevOps (ADO), GitHub, and infrastructure automation. You will be responsible for deploying, configuring, monitoring, and maintaining our D365 environments and ensuring our CI/CD and operational practices meet enterprise-grade standards.

nn

Key Responsibilities

n

• Deploy, configure, and manage D365 environments, including Power Platform, Dataverse, and F&O, ensuring reliability, scalability, and security.

n

• Design, build, and maintain advanced Azure DevOps (ADO) pipelines for application deployments, infrastructure provisioning, and environment automation.

n

• Manage and optimize GitHub repositories, branching strategies, code integrations, and related workflows.

n

• Develop and implement comprehensive monitoring, alerting, and logging solutions to ensure high availability and rapid incident response.

n

• Collaborate closely with engineering, infrastructure, and product teams to support end-to-end delivery of D365 solutions.

n

• Drive root cause analysis and continuous improvement initiatives to enhance system resilience.

n

• Enhance infrastructure as code practices using Terraform () and/or Ansible to automate provisioning and configuration.

n

• Create and maintain detailed documentation of deployment processes, infrastructure configurations, and operational runbooks.

nn

Required Skills & Experience

n

• 5+ years of hands-on experience as an SRE, DevOps, or Infrastructure Engineer in enterprise environments.

n

• Strong practical knowledge of the Microsoft Dynamics 365 ecosystem, especially deploying and managing Power Platform, Dataverse, and F&O.

n

• Proven experience deploying, configuring, and maintaining production-grade D365 environments.

n

• Deep expertise in Azure DevOps (ADO) for building robust CI/CD pipelines and managing release processes.

n

• Solid experience using GitHub for source control, pull requests, workflows, and integrations.

n

• Demonstrated ability to design, implement, and maintain monitoring and alerting frameworks for cloud and D365 workloads.

n

• Proficiency with Infrastructure as Code (IaC), ideally using Terraform; Ansible experience is a plus.

n

• Strong knowledge of Azure infrastructure services and best practices.

nn

Qualifications

n

• Microsoft certifications in Azure or Dynamics 365.

n

• Experience designing secure, compliant, and cost-optimized D365 environments.

n

• Familiarity with role-based access, management, and governance in Azure and D365.

n

• Exposure to additional CI/CD or observability tools.

nn

What We’re Looking For

nn

This is not a general DevOps role. We’re looking for someone who:

n

• Has practical, hands-on expertise with the specific D365 components we use, with a proven track record of end-to-end deployments and operations.

n

• Can independently design and implement ADO pipelines, environment automation, and comprehensive monitoring solutions for D365 workloads.

n

• Is comfortable troubleshooting complex issues, driving operational excellence, and mentoring peers on D365 infrastructure practices.

n

• Brings a proactive mindset to continuously improve system reliability, performance, and scalability.

nn

If you’re a seasoned SRE passionate about Dynamics 365 and driving operational excellence across modern cloud environments, we’d love to connect.