Logo
Pryon

Senior Product Manager - Accelerated Compute Memory Systems

Pryon, New York, New York, us, 10261

Save Job

Senior Product Manager - Accelerated Compute Memory Systems We are seeking a Product Manager for Large-Scale AI Infrastructure to define and drive the strategy for Pryon’s HPC RAG platform. This platform ingests and indexes petabyte-scale multimodal data and serves low-latency, high-throughput inference across mission-critical knowledge bases.

About Pryon:

We’re a team of AI, technology, and language experts whose DNA lives in Alexa, Siri, Watson, and virtually every human language technology product on the market. We’re building an industry-leading knowledge management and Retrieval-Augmented Generation (RAG) platform with proprietary NLP capabilities that transform unstructured data into meaningful experiences with high accuracy and speed.

In This Role You Will:

Own the end-to-end product vision for Pryon’s HPC platform across data ingestion, retrieval, inference, compliance, and scalability

Translate scale targets (thousands of concurrent requests, billions of documents, sub-second latency) into actionable product requirements and success metrics

Partner with Engineering, Security, Research, and Infrastructure teams to design horizontal scalability, multimodal ingestion, and regulatory compliance (FOIA, OIG, CISA)

Collaborate with federal and enterprise customers to capture requirements for IL5/6 and public-facing deployments

Drive roadmap alignment across ingestion, retrieval, ranking, and inference sub-tracks, ensuring delivery against Beta and GA milestones

Serve as the internal and external voice of the AI platform, presenting to agency stakeholders, strategic partners, and the Pryon executive team

What You'll Need to Be Successful:

10+ years in product management, with at least 5 in AI/ML infrastructure or large-scale distributed systems

Proven track record building state-of-the-art inference and RAG: direct experience delivering high-performance distributed systems at scale

Deep familiarity with RAG architectures, multimodal ingestion, and LLM-backed retrieval

Familiarity with HPC cluster management software such as Slurm

Strong background in high-throughput, low-latency production environments—ideally with petabyte-scale ingestion and real-time inference

Understanding of federal compliance environments (audit logging, IL5/6 certifications)

Ability to balance technical depth with executive-level communication, ensuring teams and stakeholders are aligned

Benefits for Full Time Employees:

Remote first organization

100% Company paid Health/Dental/Vision benefits for you and your dependents

Life Insurance, Short-term and Long-term Disability

401k

Unlimited PTO

We are interested in every qualified candidate who is authorized to work in the United States. However, we are not able to sponsor or take over sponsorship of employment visas at this time.

We will not discriminate on the basis of race, religion, sex, sexual orientation, or national origin in violation of applicable civil rights laws.

Location : New York, NY

Salary: $152,000.00-$212,000.00

#J-18808-Ljbffr