Logo
Heath Consultants

Senior Cloud & Systems Administrator

Heath Consultants, Houston, Texas, United States, 77246

Save Job

The

Senior Cloud & Systems Administrator

plays a critical role in ensuring the stability, security, and scalability of Heath Consultants' hybrid IT infrastructure, spanning both on-premises datacenter systems and Microsoft Azure cloud services. This position requires a versatile engineer who can seamlessly bridge traditional systems administration with modern cloud engineering.

The role is accountable for managing

core infrastructure components

such as

Active Directory, Group Policy, VMware clusters, HP Nimble storage, Rubrik backup, and Microsoft SQL Server , while also leading the design, deployment, and optimization of

Azure workloads, Office 365 services, and hybrid integrations . By doing so, the Senior Cloud & Systems Administrator ensures the organization's systems are

highly available, resilient, secure, and aligned with business objectives .

Beyond day-to-day operations (monitoring, patching, performance tuning, and SLA-driven support), this role has a strong

strategic component

- driving cloud adoption, automation, cost optimization, disaster recovery planning, and compliance with SOC 2, ISO, NIST and internal governance requirements.

The Senior Cloud & Systems Administrator also acts as the

primary escalation point

for infrastructure-related issues, collaborating with the Service Desk, Security, and Network teams to resolve incidents, implement change controls, and deliver permanent solutions. The role requires

regular reporting, documentation, and metrics reviews , ensuring that the system's health, performance, and costs are transparent and well-managed.

Ultimately, this position is both

hands-on technical and forward-looking strategic , balancing the need to manage daily operational health while shaping the future of Heath Consultants' IT infrastructure in a secure, efficient, and cost-conscious manner.

Candidates MUST be local to the Houston, TX area for the interview process and work environment.

Key Responsibilities:

Hybrid Infrastructure Architecture & Strategy

Lead the design and implementation of Microsoft Azure and VMware-based systems. Ensure hybrid solutions (cloud + datacenter) are aligned with business requirements and IT strategy. Evaluate new technologies and services that enhance scalability, security, and cost efficiency. Monitoring, Metrics & Reporting

Monitor Azure, VMware, and Nimble environments using Azure Monitor, Log Analytics, vCenter, and storage dashboards. Track KPIs such as availability, performance, capacity, backup success rates, and SLA compliance. Generate weekly and monthly reports covering system health, cost optimization, and compliance metrics. Identity & Access Management

Administer

Active Directory

(on-prem and Azure AD), managing accounts, groups, and Group Policy. Enforce MFA, Conditional Access, and RBAC across hybrid environments. Integrate AAD with SaaS applications for secure Single Sign-On (SSO). Cloud & Datacenter Operations

Manage Azure resources (VMs, App Services, Functions, Storage, Networking, Key Vaults). Maintain VMware clusters, ESXi hosts, and Nimble storage (snapshots, replication, firmware). Troubleshoot incidents across Azure, O365, VMware, and storage systems. Apply patches and updates to cloud and on-prem workloads, validating in staging before rollout. Backup, Recovery & Continuity

Administer

Rubrik backups

for O365, VMware, and storage workloads; validate restore jobs and retention policies. Manage Azure Backup and Site Recovery for cloud resources. Conduct periodic DR testing across primary datacenter, DR site, and Azure. Network & Security

Configure and manage VNets, ExpressRoute, VPNs, and hybrid connectivity. Audit and enforce NSGs, firewalls, and security baselines across both cloud and datacenter. Analyze NSG flow logs, AD sign-ins, and Defender alerts; remediate vulnerabilities. Maintain SSL certificate lifecycle (issuance, renewal, monitoring, expiration alerts). Automation & Scripting

Develop and maintain automation scripts in

PowerShell, Python, ARM/Bicep, or Terraform . Automate patching, monitoring, reporting, and resource scaling. Implement proactive alerting and self-healing mechanisms to reduce downtime. Application & Database Support

Support Epicor ERP and other critical business applications hosted in Azure or on-prem. Maintain Microsoft SQL Server databases, ensuring performance, backup, and patch compliance. Collaborate with development teams to deploy and support application updates. Ticketing, Documentation & Collaboration

Act as escalation point for Alloy ticketing system issues related to cloud and systems. Ensure SLA compliance for incident resolution and change controls. Document infrastructure, SOPs, system diagrams, and operational processes. Provide training, knowledge transfer, and mentoring for service-desk and junior staff. Cost & Vendor Management

Use Azure Cost Management + Billing to monitor and optimize spend. Identify cost savings by eliminating underutilized or orphaned resources. Engage with Microsoft, Rubrik, VMware, and other vendor partners for escalations, renewals, and contracts. Knowledge, Skills, & Experience:

Education & Certification

Bachelor's degree in IT, Computer Science, or related field; or equivalent related work experience. Certifications: Microsoft Azure (AZ-104, AZ-305), VMware VCP, or MCSA preferred. Experience

5+ years in hybrid infrastructure engineering (Azure + VMware). Hands-on with

Azure, Office 365, VMware, Nimble storage, Rubrik backup, SQL Server . Proven record in monitoring, patching, automation, and SLA-driven operations. Technical Skills

PowerShell, Python, Terraform/ARM/Bicep scripting and automation. Strong knowledge of AD, AAD, GPOs, RBAC, and SSO. Proficiency with SSL lifecycle, patch management, and DR planning. Strong troubleshooting across cloud, virtualization, storage, and applications. Soft Skills

Strong documentation, reporting, and communication skills. Ability to manage competing priorities in a 24x7x365 environment. Self-starter, collaborative, and adaptable to changing requirements. Work Environment:

Hybrid work schedule

Dependability & Flexibility:

The role requires flexibility to work weekends, evenings, nights, and holidays as needed to support business operations.

24/7/365 IT Operations:

Heath's IT department provides around-the-clock support to employees.

Assistance is available through the proper channels depending on the situation:

Normal Working Hours:

Standard business hours coverage. Expanded Working Hours:

Extended coverage for evenings, weekends, and holidays. Emergency Support:

Critical support available outside of normal or expanded hours to address urgent issues that impact business continuity.