Quantum Integrators
AI Research Engineer / Data Scientist (LLM)
Quantum Integrators, Trenton, New Jersey, United States
US IT Staffing Expert | Matching top-tier tech talent with leading organizations | Contract | C2C | Full-Time
Job location: Morristown NJ
Tax Term - (Full- Time, W2)
Role Summary:
Own end-to-end delivery of LLM systems and agentic workflows. You$B!G(Bll drive architecture and evaluation strategy, productionize services with reliability and guardrails, and mentor juniors while partnering with product and stakeholders.
What You Do
Lead POC $B"* (B pilot $B"* (B production for LLM/agent solutions (tool/function calling, planning, fallbacks).
Architect retrieval stacks (chunking strategies, hybrid search, metadata, re-ranking) and hallucination controls.
Design offline + online evaluations, golden sets, CI eval gates, and experiment frameworks for correctness, faithfulness, safety, and bias.
Implement confidence scoring and calibration (retrieval/LLM agreement, self-consistency, logprobs/entropy) with abstain/deferral and user-visible citations.
Optimize cost/latency/reliability (caching, batching, routing, distillation/quantization where useful).
Stand up secure APIs/services with observability, tracing, RBAC, and audit logs; uphold privacy and compliance requirements.
Advise on prompting vs. fine-tuning/LoRA; own model and vendor selection and trade-offs.
Mentor teammates; collaborate on roadmaps and stakeholder-facing KPIs.
Must-Have (Core)
4–8+ years in applied ML/data/engineering, with shipped LLM applications (RAG, agent/tool calling, structured extraction, domain QA).
Demonstrated ownership of eval strategy (golden sets, pass/fail thresholds, A/B) and integration of eval gates into CI/CD.
Hands-on confidence & calibration techniques and production abstention/deferral policies.
Deep retrieval experience (hybrid search, filters, re-ranking, chunking) and mitigation of hallucinations.
Strong Python engineering (FastAPI), SQL/data wrangling, containers, and observability (logs/spans/metrics).
Cloud experience (Azure/AWS/GCP) and vector/search tech (e.g., Azure AI Search, Elasticsearch, Pinecone/FAISS).
Track record of taking ambiguous business problems to measurable outcomes and shipping on timelines.
Nice to Have
Fine-tuning/LoRA, prompt routing, multi-agent orchestration (planner/critic/tool catalogs).
Advanced uncertainty: calibration curves, conformal methods, or LLM-as-a-judge with safeguards.
Distillation/quantization and model serving efficiency; GPU/CPU trade-offs.
Document AI (layout-aware models, VLMs), Snowflake/Databricks, Airflow/Prefect.
Safety/compliance leadership (red-teaming playbooks, model cards, DPIA/PIA inputs).
Domain expertise (e.g., insurance/finance/healthcare) and regulated data handling.
O: 609-632-0621 ext.130
QuantumIntegrators.com
Seniority Level Mid-Senior level
Employment Type Full-time
Job Function Other
Industries IT Services and IT Consulting
Referrals increase your chances of interviewing at Quantum Integrators by 2x
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Tax Term - (Full- Time, W2)
Role Summary:
Own end-to-end delivery of LLM systems and agentic workflows. You$B!G(Bll drive architecture and evaluation strategy, productionize services with reliability and guardrails, and mentor juniors while partnering with product and stakeholders.
What You Do
Lead POC $B"* (B pilot $B"* (B production for LLM/agent solutions (tool/function calling, planning, fallbacks).
Architect retrieval stacks (chunking strategies, hybrid search, metadata, re-ranking) and hallucination controls.
Design offline + online evaluations, golden sets, CI eval gates, and experiment frameworks for correctness, faithfulness, safety, and bias.
Implement confidence scoring and calibration (retrieval/LLM agreement, self-consistency, logprobs/entropy) with abstain/deferral and user-visible citations.
Optimize cost/latency/reliability (caching, batching, routing, distillation/quantization where useful).
Stand up secure APIs/services with observability, tracing, RBAC, and audit logs; uphold privacy and compliance requirements.
Advise on prompting vs. fine-tuning/LoRA; own model and vendor selection and trade-offs.
Mentor teammates; collaborate on roadmaps and stakeholder-facing KPIs.
Must-Have (Core)
4–8+ years in applied ML/data/engineering, with shipped LLM applications (RAG, agent/tool calling, structured extraction, domain QA).
Demonstrated ownership of eval strategy (golden sets, pass/fail thresholds, A/B) and integration of eval gates into CI/CD.
Hands-on confidence & calibration techniques and production abstention/deferral policies.
Deep retrieval experience (hybrid search, filters, re-ranking, chunking) and mitigation of hallucinations.
Strong Python engineering (FastAPI), SQL/data wrangling, containers, and observability (logs/spans/metrics).
Cloud experience (Azure/AWS/GCP) and vector/search tech (e.g., Azure AI Search, Elasticsearch, Pinecone/FAISS).
Track record of taking ambiguous business problems to measurable outcomes and shipping on timelines.
Nice to Have
Fine-tuning/LoRA, prompt routing, multi-agent orchestration (planner/critic/tool catalogs).
Advanced uncertainty: calibration curves, conformal methods, or LLM-as-a-judge with safeguards.
Distillation/quantization and model serving efficiency; GPU/CPU trade-offs.
Document AI (layout-aware models, VLMs), Snowflake/Databricks, Airflow/Prefect.
Safety/compliance leadership (red-teaming playbooks, model cards, DPIA/PIA inputs).
Domain expertise (e.g., insurance/finance/healthcare) and regulated data handling.
O: 609-632-0621 ext.130
QuantumIntegrators.com
Seniority Level Mid-Senior level
Employment Type Full-time
Job Function Other
Industries IT Services and IT Consulting
Referrals increase your chances of interviewing at Quantum Integrators by 2x
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr