DigitalOcean
Engineering Manager, GradientAI Infrastructure
Join DigitalOcean's GradientAI Infrastructure Team as an Engineering Manager to support our engineers, grow our culture, and lead a team developing AI/ML infrastructure products. You'll guide a 6‑8 person engineering team, facilitate communications, and empower the team to create innovative solutions for our partners and customers. This team will build a new product that brings DigitalOcean’s simplicity to the world of large language model hosting, serving, and optimization.
What You’ll Be Doing
Grow and lead a highly‑collaborative engineering team.
Develop and shepherd complex AI and cloud engineering projects through the entire product development lifecycle (ideation, product definition, experimentation, prototyping, development, testing, release, and operations).
Help the team achieve higher standards of performance and product quality.
Introduce and improve processes for team performance and quality‑of‑life.
Collaborate with product owners and cross‑functional teams to design idiomatic, feature‑rich, and operationally sustainable software solutions.
Oversee the design and implementation of scalable, automated systems for DNS provisioning, monitoring, and failover.
Facilitate transparent, constructive communication and a fair, growth‑oriented distribution of responsibilities between team members.
Provide coaching and counseling through mentoring, one‑on‑one meetings, etc.
What You’ll Add to DigitalOcean
7+ years of software engineering experience, including 4+ years with distributed systems and 2+ years building AI/ML technologies (ideally related to LLM hosting and inference), and 2+ years in a people‑management or team‑lead role.
A passion for leading, coaching, and mentoring software engineers.
Enduring interest in distributed systems design, AI/ML, and implementation at scale in the cloud.
Deep expertise in cloud computing platforms and modern AI/ML technologies.
Experience with modern LLMs, ideally related to hosting, serving, and optimizing such models.
Experience researching, evaluating, and building with open‑source technologies.
Proficiency in programming languages commonly used in cloud development, such as Python and Go.
Experience with infrastructure‑as‑code tools like Terraform or Ansible.
Experience with GPU platforms from AMD and NVIDIA and associated toolsets for tuning, configuring, and accelerating workloads on them would be ideal, but not required.
Knowledge of networking concepts (e.g., TCP/IP, VPCs, subnets, routing) and storage systems.
A strong sense of ownership and a drive to resolve any issues preventing delivery of value to customers.
An appreciation for process and developing cross‑disciplinary collaboration between engineering, operations, support, and product groups.
Strong project‑management skills.
Familiarity with end‑to‑end quality best practices and their implementation.
Enthusiasm for staffing, interviewing, growing, and retaining teams.
Experience coordinating with partner teams across time zones and geographies.
Compensation Range
$176,000 – $220,000
This is a remote role.
Why You’ll Like Working for DigitalOcean
We innovate with purpose. You’ll be part of a cutting‑edge technology company that simplifies cloud and AI so builders can spend more time creating software that changes the world.
We prioritize career development. You’ll work with some of the most interesting people in the industry, and we provide opportunities for growth, conferences, training, and education.
We care about your well‑being. You’ll receive competitive benefits, flexible time off, and an employee assistance program.
We reward our employees. In addition to the salary range, you may qualify for a bonus and equity compensation.
DigitalOcean is an equal‑opportunity employer. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
#J-18808-Ljbffr
What You’ll Be Doing
Grow and lead a highly‑collaborative engineering team.
Develop and shepherd complex AI and cloud engineering projects through the entire product development lifecycle (ideation, product definition, experimentation, prototyping, development, testing, release, and operations).
Help the team achieve higher standards of performance and product quality.
Introduce and improve processes for team performance and quality‑of‑life.
Collaborate with product owners and cross‑functional teams to design idiomatic, feature‑rich, and operationally sustainable software solutions.
Oversee the design and implementation of scalable, automated systems for DNS provisioning, monitoring, and failover.
Facilitate transparent, constructive communication and a fair, growth‑oriented distribution of responsibilities between team members.
Provide coaching and counseling through mentoring, one‑on‑one meetings, etc.
What You’ll Add to DigitalOcean
7+ years of software engineering experience, including 4+ years with distributed systems and 2+ years building AI/ML technologies (ideally related to LLM hosting and inference), and 2+ years in a people‑management or team‑lead role.
A passion for leading, coaching, and mentoring software engineers.
Enduring interest in distributed systems design, AI/ML, and implementation at scale in the cloud.
Deep expertise in cloud computing platforms and modern AI/ML technologies.
Experience with modern LLMs, ideally related to hosting, serving, and optimizing such models.
Experience researching, evaluating, and building with open‑source technologies.
Proficiency in programming languages commonly used in cloud development, such as Python and Go.
Experience with infrastructure‑as‑code tools like Terraform or Ansible.
Experience with GPU platforms from AMD and NVIDIA and associated toolsets for tuning, configuring, and accelerating workloads on them would be ideal, but not required.
Knowledge of networking concepts (e.g., TCP/IP, VPCs, subnets, routing) and storage systems.
A strong sense of ownership and a drive to resolve any issues preventing delivery of value to customers.
An appreciation for process and developing cross‑disciplinary collaboration between engineering, operations, support, and product groups.
Strong project‑management skills.
Familiarity with end‑to‑end quality best practices and their implementation.
Enthusiasm for staffing, interviewing, growing, and retaining teams.
Experience coordinating with partner teams across time zones and geographies.
Compensation Range
$176,000 – $220,000
This is a remote role.
Why You’ll Like Working for DigitalOcean
We innovate with purpose. You’ll be part of a cutting‑edge technology company that simplifies cloud and AI so builders can spend more time creating software that changes the world.
We prioritize career development. You’ll work with some of the most interesting people in the industry, and we provide opportunities for growth, conferences, training, and education.
We care about your well‑being. You’ll receive competitive benefits, flexible time off, and an employee assistance program.
We reward our employees. In addition to the salary range, you may qualify for a bonus and equity compensation.
DigitalOcean is an equal‑opportunity employer. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
#J-18808-Ljbffr