Energy Jobline ZR
Senior Engineering Manager - Accelerated Compute Memory Systems in New York
Energy Jobline ZR, New York, New York, us, 10261
Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide.
We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.
About Pryon: We’re a team of AI, technology, and experts whose DNA lives in Alexa, Siri, Watson, and virtually every human technology product on the market. Now we’re building an industry-leading knowledge management and Retrieval‑Augmented (RAG) platform. Our proprietary, cutting‑edge natural processing capabilities transform unstructured data into meaningful experiences that increase productivity with unmatched accuracy and speed.
Pryon is building one of the industry's most ambitious AI infrastructure platforms: a petabyte‑scale ingestion and inference system powering mission‑critical government and enterprise deployments. We need an Engineering Manager with deep HPC expertise—someone who can teach, not be taught. You’ll lead the technical team building our ingestion, retrieval, and inference layers, ensuring scalability, reliability, and compliance.
In This Role You Will:
Build and lead a team delivering the ingestion, retrieval, and inference layers that will power mission‑critical deployments for commercial and federal entities with millions of public users.
Architect and deliver horizontally scalable, fault‑tolerant systems capable of handling billions of documents and burst loads of 30K+ concurrent users.
Guide implementation of multimodal ingestion pipelines (PDF, HTML, DOCX, JSON, XML, PPTX, TIFF).
Oversee design and optimization of LLM‑driven data ingestion and retrieval workflows.
Own optimization and tuning of high‑throughput, low‑latency production environments via async orchestration frameworks.
Establish performance benchmarking, compliance frameworks, and automated testing for scale.
You will balance technical leadership with people leadership, guiding architecture decisions, while also scaling and mentoring a high‑performing team.
Collaborate cross‑functionally with Product, Executive Leadership, and Customer Success.
What You’ll Need to Be Successful:
10+ years in software engineering, 5+ years in management roles with large‑scale AI/ML systems and infrastructure.
Expert‑level proficiency in Python and Golang, with 5+ years building production distributed systems.
Experience with orchestration frameworks (Kubernetes, Ray, Dask) and proficiency with vector databases (Pinecone, Weaviate, Qdrant, or similar).
Experience with message queuing systems (Kafka, Pulsar, RabbitMQ).
In‑depth knowledge and hands‑on experience building scalable distributed architectures and high‑performance compute systems.
Proven experience in multimodal ingestion pipelines within RAG platforms.
Direct experience in designing, fine‑tuning, and optimizing LLMs for ingestion and retrieval workloads.
Previous success managing engineering teams delivering production‑grade, HPC‑scale RAG systems.
Deep understanding of infra domains: compute, storage, networking, observability, security, disaster recovery, and cost management.
Familiarity with HPC cluster management software such as Slurm.
Familiarity with cloud platforms (AWS, Azure, GCP) and/or on‑prem datacenter operations.
Benefits for Full‑Time Employees: Remote first organization. 100% Company paid Health, Dental, Vision benefits for you and your dependents. Life Insurance, Short‑term and Long‑term. 401k. Unlimited PTO.
We are interested in every qualified candidate who is authorized to work in the United States. However, we are not able to sponsor or take over sponsorship of employment visas at this time.
Pryon will not consider preference or other actions that violate the Nation’s civil rights laws.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
If you are interested in applying for this job please press the Apply button and follow the application process. Energy Jobline wishes you the very best of luck in your next career move.
#J-18808-Ljbffr
We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.
About Pryon: We’re a team of AI, technology, and experts whose DNA lives in Alexa, Siri, Watson, and virtually every human technology product on the market. Now we’re building an industry-leading knowledge management and Retrieval‑Augmented (RAG) platform. Our proprietary, cutting‑edge natural processing capabilities transform unstructured data into meaningful experiences that increase productivity with unmatched accuracy and speed.
Pryon is building one of the industry's most ambitious AI infrastructure platforms: a petabyte‑scale ingestion and inference system powering mission‑critical government and enterprise deployments. We need an Engineering Manager with deep HPC expertise—someone who can teach, not be taught. You’ll lead the technical team building our ingestion, retrieval, and inference layers, ensuring scalability, reliability, and compliance.
In This Role You Will:
Build and lead a team delivering the ingestion, retrieval, and inference layers that will power mission‑critical deployments for commercial and federal entities with millions of public users.
Architect and deliver horizontally scalable, fault‑tolerant systems capable of handling billions of documents and burst loads of 30K+ concurrent users.
Guide implementation of multimodal ingestion pipelines (PDF, HTML, DOCX, JSON, XML, PPTX, TIFF).
Oversee design and optimization of LLM‑driven data ingestion and retrieval workflows.
Own optimization and tuning of high‑throughput, low‑latency production environments via async orchestration frameworks.
Establish performance benchmarking, compliance frameworks, and automated testing for scale.
You will balance technical leadership with people leadership, guiding architecture decisions, while also scaling and mentoring a high‑performing team.
Collaborate cross‑functionally with Product, Executive Leadership, and Customer Success.
What You’ll Need to Be Successful:
10+ years in software engineering, 5+ years in management roles with large‑scale AI/ML systems and infrastructure.
Expert‑level proficiency in Python and Golang, with 5+ years building production distributed systems.
Experience with orchestration frameworks (Kubernetes, Ray, Dask) and proficiency with vector databases (Pinecone, Weaviate, Qdrant, or similar).
Experience with message queuing systems (Kafka, Pulsar, RabbitMQ).
In‑depth knowledge and hands‑on experience building scalable distributed architectures and high‑performance compute systems.
Proven experience in multimodal ingestion pipelines within RAG platforms.
Direct experience in designing, fine‑tuning, and optimizing LLMs for ingestion and retrieval workloads.
Previous success managing engineering teams delivering production‑grade, HPC‑scale RAG systems.
Deep understanding of infra domains: compute, storage, networking, observability, security, disaster recovery, and cost management.
Familiarity with HPC cluster management software such as Slurm.
Familiarity with cloud platforms (AWS, Azure, GCP) and/or on‑prem datacenter operations.
Benefits for Full‑Time Employees: Remote first organization. 100% Company paid Health, Dental, Vision benefits for you and your dependents. Life Insurance, Short‑term and Long‑term. 401k. Unlimited PTO.
We are interested in every qualified candidate who is authorized to work in the United States. However, we are not able to sponsor or take over sponsorship of employment visas at this time.
Pryon will not consider preference or other actions that violate the Nation’s civil rights laws.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
If you are interested in applying for this job please press the Apply button and follow the application process. Energy Jobline wishes you the very best of luck in your next career move.
#J-18808-Ljbffr