Logo
Kaseya

Director, Cloud Infrastructure / Cloud Engineering

Kaseya, Miami, Florida, us, 33222

Save Job

Kaseya® is the leading provider of complete IT infrastructure and security management solutions for Managed Service Providers (MSPs) and internal IT organizations worldwide powered by AI. Kaseya’s best‑in‑breed technologies allow organizations to efficiently manage and secure IT to drive sustained business success. Founded in 2000, Kaseya currently serves customers in over 20 countries across a wide variety of industries and manages over 15 million endpoints worldwide. To learn more about our company and our award‑winning solutions, visit

www.Kaseya.com

and learn more about Kaseya’s culture.

Director of Site Reliability Engineering (SRE) We are seeking a strategic and technically accomplished

Director of Site Reliability Engineering (SRE)

to lead our global infrastructure, network, and public cloud engineer and operations teams. The ideal candidate will have a strong background in site reliability engineering, network management, infrastructure services, and cloud technologies. This role requires a strategic thinker with excellent leadership skills to ensure the reliability, scalability, and performance of our systems.

Responsibilities

Architect and manage resilient infrastructure across all global office locations

Develop and implement strategies to ensure the reliability, availability, and performance of our systems

Oversee the design, deployment, and maintenance of network infrastructure, ensuring optimal performance and security

Lead public cloud deployments (AWS, Azure, OCI) with a focus on scalability, cost‑efficiency, and compliance

Collaborate with cross‑functional teams to define and implement infrastructure and network standards

Establish observability and monitoring systems to proactively manage performance and availability

Develop and maintain disaster recovery and business continuity plans

Ensure compliance with industry standards and regulations

Mentor and develop team members, fostering a culture of continuous improvement and innovation

Maintain comprehensive infrastructure diagrams, and create processes, SOPs, and other technical documentation

Provide technical leadership and training to engineers on the team

Establish best practices throughout the entire technology lifecycle management framework

Build and mature relations with business partners to identify areas of improvement to support business growth and agility

Skills

12+ years of experience in site reliability engineering, network management, and infrastructure services, with 5+ years in a leadership role

Extensive experience with network technologies such as Palo Alto and Meraki firewalls, Cisco and Meraki switch devices

Excellent understanding of networking technologies such as BGP, OSPF, STP (RSTP/MSTP), AAA, and layer‑2 switching

Proven experience with global hybrid‑cloud interconnectivity network architecture

Expertise in solutions architecture principles working with public cloud service platforms including Azure, AWS and OCI

Familiar with network access control principles and enterprise‑scale solutions using tools such as CISO ISE and PRISMA Access

Proven working experience with cloud service platforms such as Azure, AWS and OCI and knowledge of best practices and methods for resolving issues in those settings

Working knowledge of Infrastructure and Network monitoring systems such as Logicmonitor, Solarwinds, and Thousandeyes

Good knowledge and experience in managing Azure landing zone architectures, Server and Storage workloads, Entra ID, Active Directory, DNS, and DHCP services

Knowledge of business continuity and disaster recovery continuity of operations plans

Experience with automation and orchestration tools such as Ansible, Terraform, or Kubernetes

Skill in assessing security controls based on cybersecurity principles and knowledge of how to use network analysis tools to identify vulnerabilities

Knowledge of network access, identity, and access management (e.g., public key infrastructure, OAuth, OpenID, SAML, SPML)

Proven project management abilities to guide complex projects and the ability to give instructions to a non‑technical audience

Proven experience with managing large scale projects across cross‑discipline teams, including managing vendor resources

Communications/Leadership

Strong leadership and team management skills

Excellent oral, written, and interpersonal skills

Excellent analytical and problem‑solving skills

Ability to create work relationships across multiple areas, engaging with stakeholders, vendors and suppliers, their teams, and other employees

Ability to motivate, guide, and develop team members

Education/Technology

Bachelor’s degree in computer science, Management Information Systems, or a related field

Master’s degree in a related field preferred

CCNA, CCIE, CISSP or other IT/security certifications desired

Certifications in cloud platforms (AWS, Azure, Google Cloud) preferred

Other

Enterprise‑sized company experience a plus

Global experience desired

Proven ability to scale teams, build and retain right talent

Skilled in developing new processes and driving user adoption

A documented history of successfully driving projects to completion

Proven experience in translating complex requirements to infrastructure teams

Excellent English and great communication skills

Join the Kaseya growth rocket ship and see how we are #ChangingLives!

Additional Information Kaseya provides equal employment opportunity to all employees and applicants without regard to race, religion, age, ancestry, gender, sex, sexual orientation, national origin, citizenship status, physical or mental disability, veteran status, marital status, or any other characteristic protected by applicable law.

#J-18808-Ljbffr