Logo
NVIDIA

Solutions Architect, Generative AI

NVIDIA, Santa Clara, California, us, 95053

Save Job

Overview

NVIDIA is seeking an outstanding AI Engineer or Solutions Architect to join our Generative AI partner enablement team. In this role, you will act as both a strategic technical expert and a hands-on developer, building proof-of-concept solutions and reference architectures for innovative Generative AI applications. You will provide partners with technical blueprints and guidance to architect and deploy their applications using NVIDIA’s full AI stack, from GPU systems and CUDA to NeMo and Triton. The Generative AI Partners Enablement SA team applies next-generation technologies to solve customer problems. We act as trusted advisors and technical partners to our ecosystem. As a member of the NVIDIA Generative AI Solution Architecture team, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. You’ll help deliver accelerated computing AI and production-grade AI solutions at scale. Responsibilities

Serve as the primary technical domain expert for pre- and post-sale for partners, embedding deeply with them to design and deploy Generative AI solutions. Maintain strong relationships with leadership and technical teams to drive adoption and successful utilization of NVIDIA GenAI platforms. Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes, and advising on standard methodologies for scaling solutions to production. Define the scope, success metrics, and evaluation criteria for partner-led customer projects, ensuring they are built on standardized and reproducible GPU-accelerated workflows. Enable strategic partners to launch their own Professional Services and platforms by tailoring NVIDIA agentic AI blueprints for high-impact customer workloads. Proactively drive deeper adoption and utilization of NVIDIA’s Generative AI products. Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads. Qualifications

MSc, PhD in Computer Science, Electrical Engineering, Software Engineering, ML Engineering, or related fields (or equivalent experience). 5+ years of relevant work experience in developing and deploying AI models at scale as a Software Engineer or deep learning engineer. Consistent track record of building enterprise-grade agentic AI systems using open-source models with a solid foundation in deep learning, emphasizing generative models. Hands-on experience with LLM and agentic frameworks (NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen) and evaluation/observability platforms. Comfortable building prototypes or proofs of concept. Strong coding skills and proficiency in Python, C++, and deep learning frameworks (PyTorch, TensorFlow). Excellent communication and presentation skills to effectively collaborate with internal executives, partners, and customers. Preferred qualifications

Demonstrated expertise and hands-on experience with NVIDIA AI platforms. Understanding of advanced agent architectures and emerging communication protocols (MCP or Google A2A). Strong practical knowledge of Generative AI and LLM development, including ability to train GPT and Megatron models. Understanding of MLOps lifecycle management and experience with LLMOps workflows. Experience with CUDA programming and benchmarking/analyzing performance of foundation models. Compensation and benefits

Your base salary will be determined based on location, experience, and pay for similar roles. The base salary range is 148,000 USD - 235,750 USD. You will also be eligible for equity and benefits. Application and diversity

Applications for this job will be accepted at least until August 14, 2025. NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We value diversity in our current and future employees and do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr