C the Signs
Lead CloudOps Engineer
We are looking for a hands‑on Lead CloudOps Engineer to oversee the reliability, scalability, automation, and day‑to‑day operations of our GCP‑based cloud platform. You will drive infrastructure automation, improve developer workflows, enhance observability, and ensure secure, stable platform operations.
While GCP is the primary environment, the role includes operational responsibility for an existing AWS enterprise environment, requiring the ability to troubleshoot issues, maintain existing systems, and support partner teams without owning major AWS architectural redesigns.
This position is ideal for someone who thrives in cloud‑native environments, enjoys automation, and balances engineering rigor with operational excellence. It is a founding member of the CloudOps team in the U.S. with potential to grow into future leadership and management positions.
Responsibilities
Lead day‑to‑day monitoring and management of GCP infrastructure, focusing on reliability, uptime, security, performance, and compliance.
Manage GKE clusters, including lifecycle, node pools, workload deployment, and operational best practices.
Implement and maintain GCP networking: VPCs, firewall rules, service networking, and private connectivity.
Support data and application teams using BigQuery, Cloud SQL, Pub/Sub, Cloud Storage, and Cloud Run.
Own and maintain Terraform configurations for GCP and AWS, using reusable modules, remote state, policy checks, and automation pipelines.
Automate environment provisioning, scaling, and configuration with CI/CD tools such as Cloud Build, GitHub Actions, ArgoCD, or Jenkins.
Build tooling and workflows that improve developer productivity (automated builds, deployments, secrets management, and ephemerally environments).
Build and enhance observability stacks using Cloud Monitoring, Prometheus/Grafana, ELK/Elastic, or OpenTelemetry.
Lead incident response, troubleshooting, root‑cause analysis, and post‑incident improvement efforts.
Define and manage SLOs, error budgets, and operational runbooks.
Ensure secure configurations across cloud services, Kubernetes workloads, secrets storage, and network boundaries.
Implement guardrails and compliance automation using IAM best practices, GCP Organization Policies, and Terraform checks.
Work with security and compliance teams to meet HIPAA, HITRUST, SOC 2, or internal audit requirements.
Maintain stability of the existing AWS environment by reviewing IAM roles, supporting workloads on EC2, ECS, EKS, RDS, S3, troubleshooting infrastructure or network issues, and managing configurations, upgrades, and patching.
Make small‑to‑medium improvements or automation updates for AWS infrastructure using Terraform or CI/CD workflows.
Mentor DevOps, CloudOps, and Platform Engineers through pair programming, reviews, and best‑practice sharing.
Partner with development, data, and security teams to build highly reliable, cloud‑native applications and pipelines.
Establish operational standards, documentation, and playbooks for cloud operations.
Qualifications
8+ years of experience in DevOps, CloudOps, or platform engineering.
Deep hands‑on experience with GCP, including GKE, workload identity, cluster networking, VPC design, firewalls, load balancers, BigQuery, Pub/Sub, Cloud SQL, Cloud Storage, Cloud Run, Cloud Functions, IAM, KMS, and Secret Manager.
Strong expertise with Terraform, including modules, workspaces, and governance.
Strong CI/CD experience with Git‑based workflows.
Solid understanding of Linux, networking basics, containerization, and distributed systems.
Experience supporting production workloads in regulated environments (HIPAA, HITRUST, SOC 2).
Practical experience supporting AWS operations (EC2, EKS, ALB/NLB, S3, RDS, CloudWatch, VPC, security groups).
Comfortable maintaining and improving existing AWS infrastructure.
Preferred Qualifications
Experience with GitOps tools such as ArgoCD or Flux.
Familiarity with service mesh (Istio, Anthos) or advanced networking.
Experience with policy‑as‑code (OPA/Gatekeeper, Sentinel).
Background with FinOps or cost optimization.
Experience building internal developer platforms or platform engineering teams.
Benefits & Why Join
Competitive salary and benefits package.
Flexible working arrangements (remote or hybrid).
Opportunity to work on life‑changing AI technology that directly impacts patient outcomes.
Join a team that combines cutting‑edge innovation with a mission to save lives and improve health equity.
Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Medical insurance; vision insurance; 401(k).
Job Details
Location: Boston, MA
Employment type: Full‑time
Seniority level: Mid‑Senior
Work authorization: Must be a U.S. citizen, green‑card holder, or possess a valid H1B visa.
#J-18808-Ljbffr
While GCP is the primary environment, the role includes operational responsibility for an existing AWS enterprise environment, requiring the ability to troubleshoot issues, maintain existing systems, and support partner teams without owning major AWS architectural redesigns.
This position is ideal for someone who thrives in cloud‑native environments, enjoys automation, and balances engineering rigor with operational excellence. It is a founding member of the CloudOps team in the U.S. with potential to grow into future leadership and management positions.
Responsibilities
Lead day‑to‑day monitoring and management of GCP infrastructure, focusing on reliability, uptime, security, performance, and compliance.
Manage GKE clusters, including lifecycle, node pools, workload deployment, and operational best practices.
Implement and maintain GCP networking: VPCs, firewall rules, service networking, and private connectivity.
Support data and application teams using BigQuery, Cloud SQL, Pub/Sub, Cloud Storage, and Cloud Run.
Own and maintain Terraform configurations for GCP and AWS, using reusable modules, remote state, policy checks, and automation pipelines.
Automate environment provisioning, scaling, and configuration with CI/CD tools such as Cloud Build, GitHub Actions, ArgoCD, or Jenkins.
Build tooling and workflows that improve developer productivity (automated builds, deployments, secrets management, and ephemerally environments).
Build and enhance observability stacks using Cloud Monitoring, Prometheus/Grafana, ELK/Elastic, or OpenTelemetry.
Lead incident response, troubleshooting, root‑cause analysis, and post‑incident improvement efforts.
Define and manage SLOs, error budgets, and operational runbooks.
Ensure secure configurations across cloud services, Kubernetes workloads, secrets storage, and network boundaries.
Implement guardrails and compliance automation using IAM best practices, GCP Organization Policies, and Terraform checks.
Work with security and compliance teams to meet HIPAA, HITRUST, SOC 2, or internal audit requirements.
Maintain stability of the existing AWS environment by reviewing IAM roles, supporting workloads on EC2, ECS, EKS, RDS, S3, troubleshooting infrastructure or network issues, and managing configurations, upgrades, and patching.
Make small‑to‑medium improvements or automation updates for AWS infrastructure using Terraform or CI/CD workflows.
Mentor DevOps, CloudOps, and Platform Engineers through pair programming, reviews, and best‑practice sharing.
Partner with development, data, and security teams to build highly reliable, cloud‑native applications and pipelines.
Establish operational standards, documentation, and playbooks for cloud operations.
Qualifications
8+ years of experience in DevOps, CloudOps, or platform engineering.
Deep hands‑on experience with GCP, including GKE, workload identity, cluster networking, VPC design, firewalls, load balancers, BigQuery, Pub/Sub, Cloud SQL, Cloud Storage, Cloud Run, Cloud Functions, IAM, KMS, and Secret Manager.
Strong expertise with Terraform, including modules, workspaces, and governance.
Strong CI/CD experience with Git‑based workflows.
Solid understanding of Linux, networking basics, containerization, and distributed systems.
Experience supporting production workloads in regulated environments (HIPAA, HITRUST, SOC 2).
Practical experience supporting AWS operations (EC2, EKS, ALB/NLB, S3, RDS, CloudWatch, VPC, security groups).
Comfortable maintaining and improving existing AWS infrastructure.
Preferred Qualifications
Experience with GitOps tools such as ArgoCD or Flux.
Familiarity with service mesh (Istio, Anthos) or advanced networking.
Experience with policy‑as‑code (OPA/Gatekeeper, Sentinel).
Background with FinOps or cost optimization.
Experience building internal developer platforms or platform engineering teams.
Benefits & Why Join
Competitive salary and benefits package.
Flexible working arrangements (remote or hybrid).
Opportunity to work on life‑changing AI technology that directly impacts patient outcomes.
Join a team that combines cutting‑edge innovation with a mission to save lives and improve health equity.
Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Medical insurance; vision insurance; 401(k).
Job Details
Location: Boston, MA
Employment type: Full‑time
Seniority level: Mid‑Senior
Work authorization: Must be a U.S. citizen, green‑card holder, or possess a valid H1B visa.
#J-18808-Ljbffr