Compunnel, Inc.
The Site Reliability Engineer (SRE) will be responsible for ensuring the availability, performance, and security of cloud-based applications and infrastructure. This role requires extensive experience with Azure cloud environments, infrastructure as code (IaC) tools, and DevOps best practices. The ideal candidate will work closely with development and operations teams to improve system reliability, automate processes, and enhance security measures.
Key Responsibilities:
Maintain and optimize Azure cloud infrastructure, ensuring high availability and performance. Develop and manage Infrastructure as Code (IaC) using ARM or Terraform templates. Implement and manage networking components, including vNet, Gateway, and public IP configurations. Create and maintain Azure resource groups and associated resources. Design, implement, and manage CI/CD pipelines for build and release processes. Enhance application security through Azure Frontdoor, DDoS protection, and other security measures. Manage and optimize Azure Storage Accounts for data storage and retrieval. Configure and administer Identity and Access Management (PIM, Entra ID, etc.). Utilize Azure Insights and other monitoring tools to track system performance and provide reporting. Identify opportunities for automation, performance tuning, and system reliability improvements. Provide guidance and recommendations on optimizing the existing cloud environment. Support disaster recovery and business continuity strategies, including multi-region failover and Azure backup services. Collaborate with development teams to integrate reliability and scalability best practices. Troubleshoot and resolve incidents within defined SLAs and document resolution processes. Maintain compliance with ITIL practices, change management, and enterprise governance standards. Required Qualifications:
Minimum 5 years of experience in DevOps/Site Reliability Engineering with a focus on Azure cloud environments. Strong proficiency in Infrastructure as Code (IaC) using ARM templates or Terraform. Hands-on experience with Azure networking components, including vNet, Gateway, and public IP. Expertise in creating and managing Azure resource groups and cloud resources. Experience in designing and maintaining CI/CD pipelines. Proficiency in Azure security best practices, including Azure Frontdoor and DDoS protection. Knowledge of Azure Storage Accounts and data management strategies. Familiarity with Identity and Access Management (PIM, Entra ID). Experience with system monitoring and reporting tools such as Azure Insights. Strong troubleshooting skills and ability to resolve incidents within SLA constraints. Excellent communication skills and ability to work independently. Preferred Qualifications:
Development experience with scripting or programming languages. Experience with disaster recovery planning, including multi-region failover and Azure backup services. Familiarity with ITIL processes and best practices for change management. Experience in automating deployments and system maintenance tasks.
#J-18808-Ljbffr
Maintain and optimize Azure cloud infrastructure, ensuring high availability and performance. Develop and manage Infrastructure as Code (IaC) using ARM or Terraform templates. Implement and manage networking components, including vNet, Gateway, and public IP configurations. Create and maintain Azure resource groups and associated resources. Design, implement, and manage CI/CD pipelines for build and release processes. Enhance application security through Azure Frontdoor, DDoS protection, and other security measures. Manage and optimize Azure Storage Accounts for data storage and retrieval. Configure and administer Identity and Access Management (PIM, Entra ID, etc.). Utilize Azure Insights and other monitoring tools to track system performance and provide reporting. Identify opportunities for automation, performance tuning, and system reliability improvements. Provide guidance and recommendations on optimizing the existing cloud environment. Support disaster recovery and business continuity strategies, including multi-region failover and Azure backup services. Collaborate with development teams to integrate reliability and scalability best practices. Troubleshoot and resolve incidents within defined SLAs and document resolution processes. Maintain compliance with ITIL practices, change management, and enterprise governance standards. Required Qualifications:
Minimum 5 years of experience in DevOps/Site Reliability Engineering with a focus on Azure cloud environments. Strong proficiency in Infrastructure as Code (IaC) using ARM templates or Terraform. Hands-on experience with Azure networking components, including vNet, Gateway, and public IP. Expertise in creating and managing Azure resource groups and cloud resources. Experience in designing and maintaining CI/CD pipelines. Proficiency in Azure security best practices, including Azure Frontdoor and DDoS protection. Knowledge of Azure Storage Accounts and data management strategies. Familiarity with Identity and Access Management (PIM, Entra ID). Experience with system monitoring and reporting tools such as Azure Insights. Strong troubleshooting skills and ability to resolve incidents within SLA constraints. Excellent communication skills and ability to work independently. Preferred Qualifications:
Development experience with scripting or programming languages. Experience with disaster recovery planning, including multi-region failover and Azure backup services. Familiarity with ITIL processes and best practices for change management. Experience in automating deployments and system maintenance tasks.
#J-18808-Ljbffr