Pryon
Senior Product Manager - Accelerated Compute Memory Systems
Pryon, New York, New York, us, 10261
Senior Product Manager - Accelerated Compute Memory Systems
We are seeking a Product Manager for Large-Scale AI Infrastructure to define and drive the strategy for Pryon’s HPC RAG platform. This platform ingests and indexes petabyte-scale multimodal data and serves low-latency, high-throughput inference across mission-critical knowledge bases.
About Pryon:
We’re a team of AI, technology, and language experts whose DNA lives in Alexa, Siri, Watson, and virtually every human language technology product on the market. We’re building an industry-leading knowledge management and Retrieval-Augmented Generation (RAG) platform with proprietary NLP capabilities that transform unstructured data into meaningful experiences with high accuracy and speed.
In This Role You Will:
Own the end-to-end product vision for Pryon’s HPC platform across data ingestion, retrieval, inference, compliance, and scalability
Translate scale targets (thousands of concurrent requests, billions of documents, sub-second latency) into actionable product requirements and success metrics
Partner with Engineering, Security, Research, and Infrastructure teams to design horizontal scalability, multimodal ingestion, and regulatory compliance (FOIA, OIG, CISA)
Collaborate with federal and enterprise customers to capture requirements for IL5/6 and public-facing deployments
Drive roadmap alignment across ingestion, retrieval, ranking, and inference sub-tracks, ensuring delivery against Beta and GA milestones
Serve as the internal and external voice of the AI platform, presenting to agency stakeholders, strategic partners, and the Pryon executive team
What You'll Need to Be Successful:
10+ years in product management, with at least 5 in AI/ML infrastructure or large-scale distributed systems
Proven track record building state-of-the-art inference and RAG: direct experience delivering high-performance distributed systems at scale
Deep familiarity with RAG architectures, multimodal ingestion, and LLM-backed retrieval
Familiarity with HPC cluster management software such as Slurm
Strong background in high-throughput, low-latency production environments—ideally with petabyte-scale ingestion and real-time inference
Understanding of federal compliance environments (audit logging, IL5/6 certifications)
Ability to balance technical depth with executive-level communication, ensuring teams and stakeholders are aligned
Benefits for Full Time Employees:
Remote first organization
100% Company paid Health/Dental/Vision benefits for you and your dependents
Life Insurance, Short-term and Long-term Disability
401k
Unlimited PTO
We are interested in every qualified candidate who is authorized to work in the United States. However, we are not able to sponsor or take over sponsorship of employment visas at this time.
We will not discriminate on the basis of race, religion, sex, sexual orientation, or national origin in violation of applicable civil rights laws.
Location : New York, NY
Salary: $152,000.00-$212,000.00
#J-18808-Ljbffr
About Pryon:
We’re a team of AI, technology, and language experts whose DNA lives in Alexa, Siri, Watson, and virtually every human language technology product on the market. We’re building an industry-leading knowledge management and Retrieval-Augmented Generation (RAG) platform with proprietary NLP capabilities that transform unstructured data into meaningful experiences with high accuracy and speed.
In This Role You Will:
Own the end-to-end product vision for Pryon’s HPC platform across data ingestion, retrieval, inference, compliance, and scalability
Translate scale targets (thousands of concurrent requests, billions of documents, sub-second latency) into actionable product requirements and success metrics
Partner with Engineering, Security, Research, and Infrastructure teams to design horizontal scalability, multimodal ingestion, and regulatory compliance (FOIA, OIG, CISA)
Collaborate with federal and enterprise customers to capture requirements for IL5/6 and public-facing deployments
Drive roadmap alignment across ingestion, retrieval, ranking, and inference sub-tracks, ensuring delivery against Beta and GA milestones
Serve as the internal and external voice of the AI platform, presenting to agency stakeholders, strategic partners, and the Pryon executive team
What You'll Need to Be Successful:
10+ years in product management, with at least 5 in AI/ML infrastructure or large-scale distributed systems
Proven track record building state-of-the-art inference and RAG: direct experience delivering high-performance distributed systems at scale
Deep familiarity with RAG architectures, multimodal ingestion, and LLM-backed retrieval
Familiarity with HPC cluster management software such as Slurm
Strong background in high-throughput, low-latency production environments—ideally with petabyte-scale ingestion and real-time inference
Understanding of federal compliance environments (audit logging, IL5/6 certifications)
Ability to balance technical depth with executive-level communication, ensuring teams and stakeholders are aligned
Benefits for Full Time Employees:
Remote first organization
100% Company paid Health/Dental/Vision benefits for you and your dependents
Life Insurance, Short-term and Long-term Disability
401k
Unlimited PTO
We are interested in every qualified candidate who is authorized to work in the United States. However, we are not able to sponsor or take over sponsorship of employment visas at this time.
We will not discriminate on the basis of race, religion, sex, sexual orientation, or national origin in violation of applicable civil rights laws.
Location : New York, NY
Salary: $152,000.00-$212,000.00
#J-18808-Ljbffr