Overview
Distinguished AI Engineer (Generative AI, Agentic Frameworks) role at Capital One. We are creating responsible and reliable AI systems to reimagine banking for customers and businesses. Our AI/ML efforts focus on real-time, personalized experiences, scalable infrastructure, and world-class applied science and engineering.
In this role you will
- Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
- Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability.
- Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
- Invent and introduce state-of-the-art LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of large-scale production AI systems.
- Contribute to the technical vision and the long-term roadmap of foundational AI systems at Capital One.
The Ideal Candidate
- You love to build systems, take pride in the quality of your work, and want to help change banking for good.
- Passion for staying abreast of the latest research and the ability to apply novel techniques in production.
- You adapt quickly, bring clarity to undefined problems, ask questions, and can articulate findings concisely; you share new ideas even when unproven.
- You are deeply technical with a strong foundation in engineering and mathematics, and can identify optimization opportunities across hardware, software, and AI.
- You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.
Basic Qualifications
- Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields with at least 8 years of experience developing AI/ML algorithms or a Master's degree with at least 6 years of experience; or equivalent.
- At least 8 years of experience programming with Python, Go, Scala, or Java.
Preferred Qualifications
- 8+ years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g., AWS, Google Cloud, Azure).
- Experience architecting, designing, developing, integrating, delivering, and supporting complex AI systems.
- Demonstrated ability to lead and mentor multiple engineering teams and influence cross-functional stakeholders up to the VP level.
- Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost.
- 2+ years of experience supporting Agentic Frameworks (LangChain, CrewAI, Semantic Kernel, AutoGen) and 2+ years with LLMOps (Vertex AI, SageMaker, Azure ML).
- Experience developing AI/ML algorithms or technologies (LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang.
Compensation and Benefits
The minimum and maximum full-time annual salaries for this role are listed below, by location. This salary information is for candidates hired to work in listed locations. Salaries for part-time roles will be prorated based on hours worked.
San Francisco, CA: $287,800 - $328,500 for Distinguished AI Engineer
San Jose, CA: $287,800 - $328,500 for Distinguished AI Engineer
Candidates hired to work in other locations will be paid at the applicable location salary range. This role is eligible for performance-based incentives, which may include cash bonuses and/or long-term incentives.
Equal Opportunity
Capital One is an equal opportunity employer (EOE, including disability and veteran status) and maintains a drug-free workplace. We consider qualified applicants with criminal histories in a manner consistent with applicable laws and regulations. For accommodations during the application process, contact
Job Details
Seniority level: Mid-Senior level
Employment type: Full-time
Job function: Engineering and Information Technology
#J-18808-Ljbffr