Fanatics
Platform Engineer III - (Infrastructure, Compute & Networking)
Fanatics, Washington, District of Columbia, us, 20022
Platform Engineer III (Infrastructure, Compute & Networking)
As Fanatics Betting & Gaming (FBG) accelerates Fanatics' mission to build the ultimate digital sports platform, the Platform Engineer III (Infrastructure, Compute & Networking) role empowers our engineering teams by delivering rock-solid cloud foundations. Joining our OddsFactory division, you'll focus on Azure best practices, networking, cluster/compute operations, and secure connectivity. OddsFactory powers in-play features at massive scale (billions of simulations/month). Today we run Windows services on Service Fabric in Azure; we're accelerating Kubernetes + GitOps (Argo CD) and exploring dual-cloud opportunities. In this Pipelines-focused role, you'll build secure, scalable CI/CD frameworks and developer tooling that streamline deployments and improve code quality. Responsibilities Provision and operate compute foundations: VM Scale Sets, Service Fabric clusters (Windows nodes), and Kubernetes clusters (Windows/Linux node pools), including upgrades, node image/patching, and custom autoscaling solutions Implement secure-by-default and Zero Trust patterns across network and runtime (segmentation, least privilege, workload identity/OIDC, egress controls, policy-as-code). Build reusable IaC modules and guardrails (RBAC, Azure Policy) that standardize landing zones and paved paths for product teams. Collaborate with application/platform teams on connectivity patterns (service endpoints, private access, ingress/egress), documentation, and enablement. Establish SLOs and deep observability for infra/network (Datadog), including metrics, traces, logs, flow logs, dashboards, alerts, and actionable runbooks. Partner with teams to onboard to golden paths; provide enablement, examples, and reference implementations. Contribute to capacity planning, cost optimization (FinOps), and operational readiness. Participate in an on-call rotation, ensuring platform stability and providing critical support for operational incidents. Occasionally travel for essential offsite meetings, special events, or collaborative team sessions. Required Qualifications 5+ years in platform engineering, SRE, or infrastructure roles operating Azure production environments. Hands-on expertise in Azure networking and security services (VNets, routing, NSGs/ASGs, Firewall, WAF/AGW, Private Link/Endpoints, DNS, load balancing). Operations experience with Service Fabric and/or Kubernetes cluster administration in production. Demonstrated proficiency in Infrastructure as Code (Terraform; Terragrunt preferred; ARM/CloudFormation a plus). Proficiency with programming/scripting (PowerShell & .NET preferred; familiarity with Go or Python is a bonus). Experience with observability (e.g., Datadog), chaos testing, and incident management. Strong problem-solving and collaboration skills; comfortable partnering across international teams. Outcome-oriented and data-driven; balance reliability, security, and delivery speed. Effective communication skills, with experience collaborating across diverse teams. Positive, flexible attitude comfortable working in a dynamic, fast-paced environment. Preferred Qualifications Resilience & Continuity: Multi-AZ/region architectures; automated failover; defined RTO/RPO with regular DR/chaos exercises and recovery runbooks. Traffic & Policy Plane: Service mesh awareness (mTLS, policy) and edge/CDN/WAF patterns; Cloudflare integrations (WAF, Workers/Queues, Access/Zero Trust, CDN). Packaging & Delivery: Helm and/or kustomize for cluster add-ons; Argo CD progressive delivery; multi-cluster promotion; app-of-apps patterns. Secrets & Identity: External Secrets patterns; Azure Key Vault/AWS KMS; workload identity/OIDC; least-privilege RBAC; policy-as-code (OPA/Gatekeeper or Kyverno). Familiarity or previous experience within the sports betting industry or strong interest in sports. Experience supporting scalable, real-time, or event-driven systems. Experience in fast-paced or start-up environments. The expected salary range for this role is based on job-related knowledge, skills, and experience. This role is eligible for the Fanatics Betting and Gaming annual bonus program and an equity award. *Salary range is listed in USD; actual salary will vary based on location. *Salary Range: $108,000 - $186,000 per year (actual salary will be determined in part by a successful candidate's geographic location). In addition to base salary, bonus, and equity, full-time employees are eligible for Medical, Dental, Vision, 401K, paid time off, and other benefits like GymPass, Pet Insurance, Family Care Benefits, and more. We'll also give you $700 to set up your home office! Please note that visa sponsorship is not available for this position. This is a remote position; however, candidates must reside in one of the following states: AL, AZ, GA, IA, IN, KY, LA, MI, MN, MO, NE, NH, NC, OH, OK, OR, PA, SC, SD, TN, TX, UT, VT, VA, WA, WI, WV. Alternatively, we are open to a hybrid role based in Denver, CO.
As Fanatics Betting & Gaming (FBG) accelerates Fanatics' mission to build the ultimate digital sports platform, the Platform Engineer III (Infrastructure, Compute & Networking) role empowers our engineering teams by delivering rock-solid cloud foundations. Joining our OddsFactory division, you'll focus on Azure best practices, networking, cluster/compute operations, and secure connectivity. OddsFactory powers in-play features at massive scale (billions of simulations/month). Today we run Windows services on Service Fabric in Azure; we're accelerating Kubernetes + GitOps (Argo CD) and exploring dual-cloud opportunities. In this Pipelines-focused role, you'll build secure, scalable CI/CD frameworks and developer tooling that streamline deployments and improve code quality. Responsibilities Provision and operate compute foundations: VM Scale Sets, Service Fabric clusters (Windows nodes), and Kubernetes clusters (Windows/Linux node pools), including upgrades, node image/patching, and custom autoscaling solutions Implement secure-by-default and Zero Trust patterns across network and runtime (segmentation, least privilege, workload identity/OIDC, egress controls, policy-as-code). Build reusable IaC modules and guardrails (RBAC, Azure Policy) that standardize landing zones and paved paths for product teams. Collaborate with application/platform teams on connectivity patterns (service endpoints, private access, ingress/egress), documentation, and enablement. Establish SLOs and deep observability for infra/network (Datadog), including metrics, traces, logs, flow logs, dashboards, alerts, and actionable runbooks. Partner with teams to onboard to golden paths; provide enablement, examples, and reference implementations. Contribute to capacity planning, cost optimization (FinOps), and operational readiness. Participate in an on-call rotation, ensuring platform stability and providing critical support for operational incidents. Occasionally travel for essential offsite meetings, special events, or collaborative team sessions. Required Qualifications 5+ years in platform engineering, SRE, or infrastructure roles operating Azure production environments. Hands-on expertise in Azure networking and security services (VNets, routing, NSGs/ASGs, Firewall, WAF/AGW, Private Link/Endpoints, DNS, load balancing). Operations experience with Service Fabric and/or Kubernetes cluster administration in production. Demonstrated proficiency in Infrastructure as Code (Terraform; Terragrunt preferred; ARM/CloudFormation a plus). Proficiency with programming/scripting (PowerShell & .NET preferred; familiarity with Go or Python is a bonus). Experience with observability (e.g., Datadog), chaos testing, and incident management. Strong problem-solving and collaboration skills; comfortable partnering across international teams. Outcome-oriented and data-driven; balance reliability, security, and delivery speed. Effective communication skills, with experience collaborating across diverse teams. Positive, flexible attitude comfortable working in a dynamic, fast-paced environment. Preferred Qualifications Resilience & Continuity: Multi-AZ/region architectures; automated failover; defined RTO/RPO with regular DR/chaos exercises and recovery runbooks. Traffic & Policy Plane: Service mesh awareness (mTLS, policy) and edge/CDN/WAF patterns; Cloudflare integrations (WAF, Workers/Queues, Access/Zero Trust, CDN). Packaging & Delivery: Helm and/or kustomize for cluster add-ons; Argo CD progressive delivery; multi-cluster promotion; app-of-apps patterns. Secrets & Identity: External Secrets patterns; Azure Key Vault/AWS KMS; workload identity/OIDC; least-privilege RBAC; policy-as-code (OPA/Gatekeeper or Kyverno). Familiarity or previous experience within the sports betting industry or strong interest in sports. Experience supporting scalable, real-time, or event-driven systems. Experience in fast-paced or start-up environments. The expected salary range for this role is based on job-related knowledge, skills, and experience. This role is eligible for the Fanatics Betting and Gaming annual bonus program and an equity award. *Salary range is listed in USD; actual salary will vary based on location. *Salary Range: $108,000 - $186,000 per year (actual salary will be determined in part by a successful candidate's geographic location). In addition to base salary, bonus, and equity, full-time employees are eligible for Medical, Dental, Vision, 401K, paid time off, and other benefits like GymPass, Pet Insurance, Family Care Benefits, and more. We'll also give you $700 to set up your home office! Please note that visa sponsorship is not available for this position. This is a remote position; however, candidates must reside in one of the following states: AL, AZ, GA, IA, IN, KY, LA, MI, MN, MO, NE, NH, NC, OH, OK, OR, PA, SC, SD, TN, TX, UT, VT, VA, WA, WI, WV. Alternatively, we are open to a hybrid role based in Denver, CO.