Logo
HCLTech

Senior Engineer Cognitive Infrastructure (Santa Clara County)

HCLTech, Santa Clara, California, United States

Save Job

Role: Senior Engineer

Cognitive Infrastructure Location: Santa Clara, CA /Dallas, TX Job Description: This is a key strategic role for working with Nvidia and other key Tech OEMs like Dell, HPE, Cisco etc, internal stakeholders and customers to generate business opportunities in the US and EU region respectively. The person would be working with Sales, delivery and Pre-sales groups to identify, generate and manage opportunities related to AI and AI Factory tracks. This is a quota driven role that spans across on-premises infrastructure, private cloud, platforms and public cloud with reference to AI. This role involves working closely with sales, Pre-sales team, and delivery teams to understand customer needs, create opportunities and position the hybrid cloud AI and AI factory offerings effectively. A strategic professional responsible for executing and implementing AI infra and platform solutions This role requires deep technical and hands-on experience to deliver AI Infra and AI factory offerings

Responsibilities: Senior Engineer

Cognitive Infrastructure (Kubernetes | NVIDIA | MLOps | GenAI) , Total Experience: 12+ Years

What this person will do - Design & Operate Hybrid Kubernetes clusters on AWS/GCP/Azure and onprem (baremetal, DGX, Grace Hopper). Deploy & manage the NVIDIA GPU Operator (drivers, CUDA, MIG, device plugins) and create GPUaware scheduling policies. Build productiongrade MLOps pipelines with Kubeflow Pipelines, GitOps (Argo CD/Flux), MLflow/DVC. Deploy & operate LLMs using NVIDIA Triton, vLLM, TensorRTLLM, or custom FastAPI/GRPC services

include quantization, dynamic batching, safetyfilter integration and pertenant quota enforcement. Integrate vector databases (Milvus, Pinecone, Qdrant, Weaviate, FAISS) for retrievalaugmented generation and similarity search. Implement observability (Prometheus, Grafana, Loki/ELK, OpenTelemetry) and define SLO/SLI dashboards. Enforce security & compliance

RBAC, OPA/Gatekeeper, Vault/KMS, image signing, GDPR/HIPAA guidelines. Optimize cost & capacity

GPU quota controls, spotinstance usage, autoscaling, transparent cost reporting. Enable teams

turn notebooks into reproducible pipelines, run officehours, write docs/tutorials. Drive technology roadmap

evaluate new NVIDIA releases, opensource projects (Kubeflow, LangChain, vLLM, TGI etc.) and lead PoCs.

Required Experience 8+ years building & operating production Kubernetes (cloud + onprem), Deep knowledge of NVIDIA GPU Operator stack (drivers, CUDA, MIG). Strong handson with Kubeflow Pipelines or equivalent MLOps tools, Experience deploying LLMs at scale (quantization, LoRA, inference optimization). Proficiency in Python (PyTorch, TensorFlow, HuggingFace, LangChain) and IaC (Helm, Kustomize, Terraform). Experience with vector search engines (Milvus, Pinecone, etc.), Solid observability/SRE background (Prometheus, Grafana, OpenTelemetry). Securityfirst mindset (RBAC, OPA, Vault, image signing).

NicetoHave : Work with NVIDIA DGX / Grace Hopper hardware, Knowledge of OpenShift, k3s, or edgefocused deployments. Experience with LWS, Kserve, or serverless inference, Opensource contributions (Kubernetes, Kubeflow, Triton, Milvus, vLLM). Certifications

CKA, Any Cloud AI/ML Certification.. Nvididia Certifications

Specifics: Hands on Job Techno-Commercial skills are a must

How Youll Grow At HCLTech, we offer continuous opportunities for you to find your spark and grow with us. We want you to be happy and satisfied with your role and to really learn what type of work sparks your brilliance the best. Throughout your time with us, we offer transparent communication with senior level employees, learning and career development programs at every level, and opportunities to experiment in different roles or even pivot industries. We believe that you should be in control of your career with unlimited opportunities to find the role that fits you best.

Equality & Opportunity for All As a company with employees representing 165 nationalities across the globe, we pride ourselves on being an equal opportunity employer, committed to providing equal employment opportunities to all applicants and employees regardless of race, religion, sex, color, age, national origin, pregnancy, sexual orientation, physical disability or genetic information, military or veteran status, or any other protected classification, in accordance with federal, state, and/or local law.