Logo
OAT Orion Advisor Technology, LLC

Senior AI Engineer

OAT Orion Advisor Technology, LLC, San Francisco

Save Job

At Orion, we’re building the intelligent infrastructure that powers modern financial advisory. As part of our mission to unify planning, investing, and client engagement, we’re looking for a Senior AI Engineer to rapidly deploy cutting-edge large language models (LLMs) and generative AI into production, powering scalable systems and customer experiences.

This is a hands-on, product-facing role focused on shipping working AI features. You’ll work closely with product managers, designers, and engineers to build systems that directly impact thousands of advisors and millions of investor accounts.

Looking for candidates in the San Francisco, CA area.

In this role, you’ll get to:

Integrate LLMs and generative AI into advisor facing products and workflows

Design and build RAG systems using internal and external data sources

Apply techniques like prompt engineering, fine-tuning (e.g. LoRA), and custom embeddings to optimize domain-specific performance

Evaluate and productionize open-source and proprietary models (GPT-4, Claude, LLaMA, Mistral, etc.)

Develop APIs and services to deliver AI- powered features at scale

Collaborate across product and engineering teams to deliver rapidly and reliably

Continuously measure AI feature quality: accuracy, latency, and user impact

We’re looking for talent who:

Has proven experience working with large language models (e.g. OpenAI GPT, Claude, LLaMa)

Has familiarity with embedding models, understanding tokenization, prompt engineering and fine-tuning (e.g., LoRA).

Has practical experience with at least 1 vector database: Pinecone, Weaviate, FAISS

Has strong proficiency with orchestration tools like LangChain, Haystack or LlamaIndex for building RAG systems

Has ability to design and implement retrieval-augmented generation (RAG) systems.

Has experience in data preprocessing, chunking and vectorization pipelines

5+ years in software engineering, with 2+ years in applied ML or AI

Has deep understanding of LLMs, embeddings, RAG architecture, and vector search

Has strong grasp of prompt design, fine-tuning strategies, and model evaluation

Has proficiency with tools such as: LangChain, LlamaIndex, Pinecone, Weaviate, OpenAI, Hugging Face, FastAPI, Docker

Has strong engineering discipline and communication skills, especially in cross-functional settings

#LI-AP1

Salary Range:

$125,336.00 - $196,765.00

About this Opportunity:

At Orion, we’re building the intelligent infrastructure that powers modern financial advisory. As part of our mission to unify planning, investing, and client engagement, we’re looking for a Senior AI Engineer to rapidly deploy cutting-edge large language models (LLMs) and generative AI into production, powering scalable systems and customer experiences.

This is a hands-on, product-facing role focused on shipping working AI features. You’ll work closely with product managers, designers, and engineers to build systems that directly impact thousands of advisors and millions of investor accounts.

Looking for candidates in the San Francisco, CA area.

In this role, you’ll get to:

  • Integrate LLMs and generative AI into advisor facing products and workflows

  • Design and build RAG systems using internal and external data sources

  • Apply techniques like prompt engineering, fine-tuning (e.g. LoRA), and custom embeddings to optimize domain-specific performance

  • Evaluate and productionize open-source and proprietary models (GPT-4, Claude, LLaMA, Mistral, etc.)

  • Develop APIs and services to deliver AI- powered features at scale

  • Collaborate across product and engineering teams to deliver rapidly and reliably

  • Continuously measure AI feature quality: accuracy, latency, and user impact

We’re looking for talent who:

  • Has proven experience working with large language models (e.g. OpenAI GPT, Claude, LLaMa)

  • Has familiarity with embedding models, understanding tokenization, prompt engineering and fine-tuning (e.g., LoRA).

  • Has practical experience with at least 1 vector database: Pinecone, Weaviate, FAISS

  • Has strong proficiency with orchestration tools like LangChain, Haystack or LlamaIndex for building RAG systems

  • Has ability to design and implement retrieval-augmented generation (RAG) systems.

  • Has experience in data preprocessing, chunking and vectorization pipelines

  • 5+ years in software engineering, with 2+ years in applied ML or AI

  • Has deep understanding of LLMs, embeddings, RAG architecture, and vector search

  • Has strong grasp of prompt design, fine-tuning strategies, and model evaluation

  • Has proficiency with tools such as: LangChain, LlamaIndex, Pinecone, Weaviate, OpenAI, Hugging Face, FastAPI, Docker

  • Has strong engineering discipline and communication skills, especially in cross-functional settings

#LI-AP1

Salary Range:

$125,336.00 - $196,765.00

The pay listed in this posting indicates the estimated pay at the time of this posting; however, may vary depending on geographic location, job-related knowledge, skills, and experience. In addition, Orion offers a competitive benefits package which includes health, dental, vision, and disability coverage on day one, 401(k) plan with employer match, paid parentalleave, pet benefits including pawternity leave and pet insurance, student loan repayment and more.

About Us

At Orion, we achieve our best work when we support one another, staying personally accountable to each other and the clients we serve. We create a welcoming environment where everyone is respected, valued, and heard. Our commitment to create raving fans ensures we consistently exceed client expectations. Thinking differently is in our DNA—we innovate always , push boundaries, and reject the status quo to deliver transformative outcomes. Together, we support one another and see it through to success, driving our collective achievements and those of our clients.

#J-18808-Ljbffr