Logo
BuildOps

Director of Engineering, DevOps & SRE

BuildOps, Raleigh, North Carolina, United States, 27601

Save Job

At BuildOps, we’re building a groundbreaking software solution, purpose-built to support today’s commercial contractors. From helping our customers manage their service department all the way to project management, we’re breaking the mold and building a team that invests in our mission statement. We love driven, self-motivated folks experienced in tech start-ups and thrive in fast-paced environments. Could you be our next hire?

This candidate will join a well-funded, fast-growing technology startup with the unique opportunity to help build out a critical function for the company.

The Director of Engineering, DevOps & SRE will lead and oversee the DevOps & Site Reliability Engineering (SRE) functions within the organization. This role is responsible for ensuring the reliability, scalability, security, and efficiency of the company’s technical infrastructure and operations. The ideal candidate is a strategic leader with deep technical expertise and a proven track record of managing cross-functional teams in a hybrid environment.

What You Will do: Strategic Leadership:

Develop and implement a cohesive strategy for DevOps, and SRE that aligns with organizational goals

Collaborate with senior leadership to identify technology needs and drive innovation across departments

Establish and enforce policies, standards, and best practices for technical operations

DevOps:

Design and maintain advanced CI/CD pipelines to streamline software delivery processes

Automate infrastructure provisioning and configuration management using tools like Terraform or Ansible

Define development, testing, release, update, and support processes for DevOps operations

Foster a culture of automation to reduce manual intervention wherever possible

Site Reliability Engineering (SRE):

Ensure system reliability, availability, and scalability through proactive monitoring and maintenance.

Define Service Level Objectives (SLOs) and manage error budgets to balance reliability with feature development.

Oversee incident management processes including root cause analysis and post-mortem reviews.

Implement observability tools for real-time monitoring of system health metrics.

What We Look For:

Bachelor’s degree in Computer Science, Information Technology, or related field; Master’s degree preferred

10+ years of experience in technical operations management or related roles

Proven leadership experience managing cross-functional technical teams in hybrid environments

Strong knowledge of IT infrastructure, DevOps practices, SRE principles, and cybersecurity frameworks

Expertise in tools like Terraform, Kubernetes, Jenkins, Prometheus/Grafana for monitoring systems

Excellent communication skills with the ability to interact effectively with stakeholders at all levels

Strategic thinking combined with hands-on problem-solving abilities

Experience implementing automation frameworks across IT operations

Familiarity with cloud platforms (AWS/GCP/Azure) for scalable infrastructure management

Strong understanding of incident response protocols and disaster recovery planning

Ability to work in a fast-paced, dynamic startup environment and adapt to change

Compensation:

Negotiable base salary + annual bonus

#J-18808-Ljbffr