Site Reliability Engineer (SRE) - D365 Infrastructure & Azure Dev...
ZipRecruiter - Atlanta
Work at ZipRecruiter
Overview
- View job
Overview
Job Descriptionn
Onsite interview (Atlanta)
n3 days onsite a week (Atlanta)
nNo H1B candidates & No C2C's
nnJob description:
nSummary
nnWe are seeking a highly experienced Site Reliability Engineer (SRE) with a strong background in Microsoft Dynamics 365 (D365) infrastructure, including Power Platform, Dataverse, and Finance & Operations (F&O). This role requires deep expertise in Azure DevOps (ADO), GitHub, and infrastructure automation. You will be responsible for deploying, configuring, monitoring, and maintaining our D365 environments and ensuring our CI/CD and operational practices meet enterprise-grade standards.
nnKey Responsibilities
n• Deploy, configure, and manage D365 environments, including Power Platform, Dataverse, and F&O, ensuring reliability, scalability, and security.
n• Design, build, and maintain advanced Azure DevOps (ADO) pipelines for application deployments, infrastructure provisioning, and environment automation.
n• Manage and optimize GitHub repositories, branching strategies, code integrations, and related workflows.
n• Develop and implement comprehensive monitoring, alerting, and logging solutions to ensure high availability and rapid incident response.
n• Collaborate closely with engineering, infrastructure, and product teams to support end-to-end delivery of D365 solutions.
n• Drive root cause analysis and continuous improvement initiatives to enhance system resilience.
n• Enhance infrastructure as code practices using Terraform () and/or Ansible to automate provisioning and configuration.
n• Create and maintain detailed documentation of deployment processes, infrastructure configurations, and operational runbooks.
nnRequired Skills & Experience
n• 5+ years of hands-on experience as an SRE, DevOps, or Infrastructure Engineer in enterprise environments.
n• Strong practical knowledge of the Microsoft Dynamics 365 ecosystem, especially deploying and managing Power Platform, Dataverse, and F&O.
n• Proven experience deploying, configuring, and maintaining production-grade D365 environments.
n• Deep expertise in Azure DevOps (ADO) for building robust CI/CD pipelines and managing release processes.
n• Solid experience using GitHub for source control, pull requests, workflows, and integrations.
n• Demonstrated ability to design, implement, and maintain monitoring and alerting frameworks for cloud and D365 workloads.
n• Proficiency with Infrastructure as Code (IaC), ideally using Terraform; Ansible experience is a plus.
n• Strong knowledge of Azure infrastructure services and best practices.
nnQualifications
n• Microsoft certifications in Azure or Dynamics 365.
n• Experience designing secure, compliant, and cost-optimized D365 environments.
n• Familiarity with role-based access, management, and governance in Azure and D365.
n• Exposure to additional CI/CD or observability tools.
nnWhat We’re Looking For
nnThis is not a general DevOps role. We’re looking for someone who:
n• Has practical, hands-on expertise with the specific D365 components we use, with a proven track record of end-to-end deployments and operations.
n• Can independently design and implement ADO pipelines, environment automation, and comprehensive monitoring solutions for D365 workloads.
n• Is comfortable troubleshooting complex issues, driving operational excellence, and mentoring peers on D365 infrastructure practices.
n• Brings a proactive mindset to continuously improve system reliability, performance, and scalability.
nnIf you’re a seasoned SRE passionate about Dynamics 365 and driving operational excellence across modern cloud environments, we’d love to connect.