Logo
Tandym Group

Sr. Kubernetes Platform Engineer/SME

Tandym Group, Washington, District of Columbia, us, 20022

Save Job

The Senior Kubernetes Platform Engineer will lead an AWS EKS-based multi-tenant platform. This role provides technical leadership, manages Kubernetes platform architecture, and ensures stability, scalability, and security for mission-critical workloads. The Senior Engineer oversees day-to-day platform operations, directs automation efforts, and serves as the escalation point for complex technical issues.

Key Responsibilities • Lead Kubernetes cluster administration (EKS) including provisioning, upgrades, node group management, and patching. • Architect and optimize platform infrastructure, ensuring scalability and adherence to security baselines. • Oversee creation and maintenance of Kubernetes IaC (Helm charts, Terraform modules) for tenant onboarding and environment configuration. • Manage and patch Kubernetes-layer tooling (Cilium, Karpenter) and associated open-source components. • Own platform network configuration (DNS, firewall services/TIC 3.0, VPN/tunnels). • Direct base image automation, distribution, and security incident response. • Implement and maintain load balancer configurations (public & ingress, TIC 3.0 compliance). • Drives development of golden paths and developer self-service capabilities. • Lead contingency planning and disaster recovery strategies, including backup/restore procedures and incident communications. • Produce and maintain platform documentation (POAMs, SOPs, security SOPs, management plans). • Mentor junior platform engineers and coordinate work across Cloud Ops, App Support, and Automation teams.

Minimum Qualifications • BS/BA, 10 years of work experience, with: • 5-7 years in Kubernetes operations with AWS EKS in production. • Must have hands-on experience with Cilium and Karpenter. • Proven leadership in multi-tenant Kubernetes environments. • Expertise in Helm, Terraform, ArgoCD, AWS IAM/RBAC, Cilium, and ECR. • Experience with platform-level security and FedRAMP/NIST compliance. • Strong scripting skills (Bash, Python, Go). • Certified Kubernetes Administrator (CKA) or equivalent experience preferred. • Excellent oral and written communications skills • Ability to work nights and/or weekends for patching or deployments. • Ability to obtain a public trust clearance