Logo
CyberCoders

AI Cloud Network Architect

CyberCoders, Miami, Florida, us, 33222

Save Job

Join to apply for the

AI Cloud Network Architect

role at

CyberCoders .

We are a post‑IPO, publicly traded company focused on AI and high‑performance computing infrastructure, founded in 2017 and experiencing rapid growth.

Role Overview We’re looking for a fully remote Network Engineer to spearhead the development of high‑performance, scalable networking infrastructure tailored for AI‑driven cloud environments. This role centers on architecting and fine‑tuning network solutions across geographically distributed data centers, enabling support for AI workloads across bare metal, virtual machines, and containerized platforms—from initial training to fine‑tuning and inference stages.

Responsibilities

Architect robust, high‑throughput data center network fabrics optimized for AI workloads and multi‑tenant cloud environments

Design and deploy EVPN/VXLAN overlays, L2/L3 switching topologies, and host‑level multi‑networking across bare metal, VMs, and containers; proficient in MLAG switch environments

Implement BGP, VRFs, and ACLs to enforce network segmentation, tenant isolation, and dynamic security policy frameworks

Lead end‑to‑end design of advanced GPU interconnect fabrics (e.g., Infiniband, Spectrum‑X, RDMA/RoCEv2, DPUs/Smart NICs) leveraging CLOS/ECMP architectures

Build and maintain carrier‑grade edge routing systems with BGP‑based peering, automatic failover, and intelligent traffic engineering

Manage inter‑data center connectivity, routing governance, and resilience strategies across geographically distributed clusters

Define and implement SDN architectures via Netconf, gNMI, and controller‑based frameworks for dynamic network orchestration

Qualifications

5+ years of experience in data center or service provider networking

Proven expertise across key domains:

High‑speed Ethernet architectures (100G, 200G, 400G, 800G)

Advanced routing and switching technologies: EVPN/VXLAN, Layer 2/3, BGP, OSPF, VRFs, and QoS

High‑performance fabrics for AI/HPC workloads, including Infiniband and RoCEv2

Multi‑host networking across bare metal, KVM, and Kubernetes environments

Experience designing GPU cluster networks with technologies such as NVIDIA UFM, SHARP, and data center fabric best practices

Proficient with infrastructure automation tools, including Netbox, Netconf, and IaC frameworks like Ansible and Terraform

Practical knowledge of vendor platforms: Arista EOS, Juniper Junos, and Mellanox/NVIDIA Cumulus and Onyx

Strong grasp of network telemetry, packet‑level diagnostics, and performance optimization techniques

Excellent communication and documentation skills, with a collaborative approach to cross‑functional engineering work

Bonus Skills

Familiarity with AI/ML workload patterns, including model training and inference bottlenecks

Background in multi‑tenant cloud networking for platform‑as‑a‑service (PaaS) or infrastructure‑as‑a‑service (IaaS) environments

Certifications such as CCIE, JNCIE, or comparable credentials

$175k - $250k/year + BONUS!

RSUs

401k with match

Comprehensive benefits and more!

#J-18808-Ljbffr