Logo
MulticoreWare, Inc.

Cloud Platform Architect (Network, Storage & Kubernetes)

MulticoreWare, Inc., California, Missouri, United States, 65018

Save Job

MulticoreWare is a global software solutions & products company with its HQ in San Jose, CA, USA. With worldwide offices, it serves its clients and partners in North America, EMEA and APAC regions. Started by a group of researchers, MulticoreWare has grown to serve its clients and partners on HPC & Cloud computing, GPUs, Multicore & Multithread CPUS, DSPs, FPGAs and a variety of AI hardware accelerators. MulticoreWare was founded by a team of researchers that wanted a better way to program for heterogeneous architectures. With the advent of GPUs and the increasing prevalence of multi-core, multi-architecture platforms, our clients were struggling with the difficulties of using these platforms efficiently. We started as a boot-strapped services company and have since expanded our portfolio to span products and services related to compilers, machine learning, video codecs, image processing and augmented/virtual reality. Our hardware expertise has also expanded with our team; we now employ experts on HPC and Cloud Computing, GPUs, DSPs, FPGAs, and mobile and embedded platforms. We specialize in accelerating software and algorithms, so if your code targets a multi-core, heterogeneous platform, we can help.

Job Description

Role Overview We are looking for an experienced

Cloud Platform Architect

with deep expertise in

networking, storage, and Kubernetes

to design and implement a

cloud platform at scale , similar to AWS/GCP/Azure. The ideal candidate will have strong experience in

infrastructure automation, distributed systems, and large-scale platform engineering , with the ability to architect and lead the development of multi-tenant, high-performance cloud services.

Key Responsibilities Design and implement a scalable cloud platform covering compute, storage, and networking layers.

Define architecture for

multi-cluster Kubernetes environments , ensuring high availability, scalability, and security.

Build core services such as

identity & access management, service discovery, observability, and API gateways .

Networking

Architect multi-tenant networking for VPC/VNet equivalents, load balancers, firewalls, and service meshes.

Implement

SDN solutions (Calico, Cilium, OVN, etc.)

and network policy enforcement at scale.

Optimize inter-cluster and inter-datacenter connectivity.

Storage

Design and manage distributed storage solutions (Ceph, Rook, OpenEBS, MinIO, Lustre).

Architect persistent storage for Kubernetes (CSI drivers, snapshots, backup/restore).

Ensure data availability, durability, and compliance with SLAs.

Design

multi-tenant Kubernetes platforms

with advanced scheduling, security, and RBAC.

Automate

provisioning, scaling, and upgrades

using operators, Helm, and GitOps (ArgoCD/Flux).

Integrate with monitoring/logging (Prometheus, Grafana, Loki, ELK).

Automation & Infrastructure-as-Code

Implement full stack automation with

Terraform, Ansible, or Pulumi .

Drive CI/CD pipelines for infrastructure and application delivery.

Build self-service capabilities for internal teams.

Security & Compliance

Design security at all layers (network, storage, workloads).

Implement secrets management (Vault, External Secrets, KMS).

Ensure compliance with data governance and regulatory requirements.

Leadership

Collaborate with product and engineering teams to define roadmap and priorities.

Mentor and guide platform engineers and DevOps teams.

Evaluate new technologies and contribute to open-source where applicable.

Required Skills & Experience Networking : Deep knowledge of TCP/IP, routing, load balancing, DNS, SDN (Calico, Cilium, Istio/Linkerd).

Storage : Hands-on with distributed storage (Ceph, MinIO, Gluster, Rook) and Kubernetes storage orchestration (CSI).

Kubernetes : 5+ years experience, expert in multi-cluster deployments, operators, controllers, service mesh.

Cloud & Infra : Strong background in virtualization (KVM, VMware, OpenStack) and bare-metal automation (MAAS, Ironic, PXE, IPMI/Redfish).

IaC & Automation : Proficiency in Terraform, Ansible, GitOps tools (ArgoCD, Flux).

CI/CD : Experience with Jenkins, GitHub Actions, GitLab CI/CD.

Programming/Scripting : Proficiency in Go, Python, or Bash.

Monitoring/Observability : Prometheus, Grafana, Loki, ELK, Jaeger.

Strong knowledge of

distributed systems, high availability, and fault tolerance .

Preferred Qualifications Experience designing cloud platforms at scale (e.g., internal private cloud, hyperscaler background).

Contributions to open-source Kubernetes ecosystem (CNCF projects).

Familiarity with

service billing, quota management, and multi-tenancy at scale .

Exposure to

bare-metal cloud orchestration (Metal3, Tinkerbell, Equinix Metal, Ironic) .

Strong leadership and architectural decision-making skills.

#J-18808-Ljbffr