Reducto, Inc.
Overview
Reducto helps AI teams ingest real world enterprise data with state of the art accuracy. The vast majority of enterprise data — from financial statements to health records — is locked in unstructured file formats like PDFs and spreadsheets. We train vision models to read those documents the way a human would, and make it possible to build products, train models, and automate processes at scale. We’ve grown incredibly quickly, growing revenue by 7x YOY, and now work with hundreds of companies ranging from leading AI teams (Harvey, Vanta, Scale), through to enterprise (FAANG, top 3 trading firm). We\u2019ve raised over 100M from world class investors like A16z, Benchmark, and First Round Capital, and are looking for senior engineers for our Platform team. The Opportunity As a Senior Software Engineer on our Platform team, you\u2019ll work on our core API that powers document parsing for hundreds of companies. You\u2019ll integrate cutting-edge LLMs, optimize document processing pipelines, and build the platform and infrastructure that makes state-of-the-art document understanding accessible at scale. You\u2019ll work closely with our ML engineers to serve in-house models at the frontier of document understanding. We would love to meet you if you : Responsibilities
Integrating and optimizing LLM calls for structured extraction, form filling, and document understanding tasks Experimenting with new techniques and output structures to improve LLM accuracy and reduce latency Making improvements to API design and pre-processing algorithms (chunking, structured extraction, etc.) based on customer feedback Building and improving document processing pipelines that handle everything from PDFs to spreadsheets at scale Driving backend latency and costs down without sacrificing quality Building internal tooling and evals to better understand / analyze failure cases Working directly with the founders and customers to shape the product direction and engineering strategy Have prior experience founding a company or building products at early stages Have experience with prompt engineering, fine-tuning, or building AI agents Have experience delivering AI-powered products at internet scale Are ambitious and driven, and care a lot about doing great work with great people Keep up with the latest developments in ML / AI Qualifications
You have 5+ years of experience building, hardening and scaling real world applications with 2+ years of experience integrating LLMs into production systems. You're exceptional with Python or have a deep understanding of another language of choice. Build your own tools as needed—like a quick Streamlit app to test hypotheses or create a dataset. Approach
A quantitative approach to building products. Ability to debug, experiment, and iterate fast. You should be comfortable getting hands-on with the full development lifecycle, from ideation to shipping to users. You have a high bar for quality that you hold yourself and your team to. The core work will include
Integrating and optimizing LLM calls for structured extraction, form filling, and document understanding tasks Experimenting with new techniques and output structures to improve LLM accuracy and reduce latency Making improvements to API design and pre-processing algorithms (chunking, structured extraction, etc.) based on customer feedback Building and improving document processing pipelines that handle everything from PDFs to spreadsheets at scale Driving backend latency and costs down without sacrificing quality Building internal tooling and evals to better understand / analyze failure cases Working directly with the founders and customers to shape the product direction and engineering strategy Have prior experience founding a company or building products at early stages Have experience with prompt engineering, fine-tuning, or building AI agents Have experience delivering AI-powered products at internet scale Are ambitious and driven, and care a lot about doing great work with great people Keep up with the latest developments in ML / AI Location and expectations
This is an in person role at our office in SF. We\u2019re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you. About Reducto
Nearly 80% of enterprise data is in unstructured formats like PDFs PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that\u2019s simply impractical for use in digital workflows. This isn\u2019t an inconvenience—it\u2019s a critical bottleneck that leads to dozens of wasted hours every week. Traditional approaches fail at reliably extracting information in complex PDFs OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with. Reducto breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you : Accurately extract text and tables even with nonstandard layouts Automatically convert graphs to tabular data and summarize images in documents Extract important fields from complex forms with simple, natural language instructions Build powerful retrieval pipelines using Reducto’s document metadata Intelligently chunk information using the document’s layout data Benefits at Reducto
Lunch :
Receive a free lunch to eat with your teammates daily at the office Reimbursed Transportation :
Provide us with your receipts and we’ll take care of the costs Insurance :
Generous health insurance covering medical, dental, and vision. Health and Wellness Budget :
We provide up to $150 / mo reimbursement for health and wellness spending, such as gym memberships, fitness classes, or similar. Parental Leave :
Work with us to build a leave schedule that works for you and your family Reducto is an Equal Opportunity Employer committed to diversity and inclusion in the workplace. All qualified applicants will receive consideration for employment without regard to sex, race, color, age, national origin, religion, physical and mental disability, genetic information, marital status, sexual orientation, gender identity / assignment, citizenship, pregnancy or maternity, protected veteran status, or any other status prohibited by applicable national, federal, state or local law. J-18808-Ljbffr
#J-18808-Ljbffr
Reducto helps AI teams ingest real world enterprise data with state of the art accuracy. The vast majority of enterprise data — from financial statements to health records — is locked in unstructured file formats like PDFs and spreadsheets. We train vision models to read those documents the way a human would, and make it possible to build products, train models, and automate processes at scale. We’ve grown incredibly quickly, growing revenue by 7x YOY, and now work with hundreds of companies ranging from leading AI teams (Harvey, Vanta, Scale), through to enterprise (FAANG, top 3 trading firm). We\u2019ve raised over 100M from world class investors like A16z, Benchmark, and First Round Capital, and are looking for senior engineers for our Platform team. The Opportunity As a Senior Software Engineer on our Platform team, you\u2019ll work on our core API that powers document parsing for hundreds of companies. You\u2019ll integrate cutting-edge LLMs, optimize document processing pipelines, and build the platform and infrastructure that makes state-of-the-art document understanding accessible at scale. You\u2019ll work closely with our ML engineers to serve in-house models at the frontier of document understanding. We would love to meet you if you : Responsibilities
Integrating and optimizing LLM calls for structured extraction, form filling, and document understanding tasks Experimenting with new techniques and output structures to improve LLM accuracy and reduce latency Making improvements to API design and pre-processing algorithms (chunking, structured extraction, etc.) based on customer feedback Building and improving document processing pipelines that handle everything from PDFs to spreadsheets at scale Driving backend latency and costs down without sacrificing quality Building internal tooling and evals to better understand / analyze failure cases Working directly with the founders and customers to shape the product direction and engineering strategy Have prior experience founding a company or building products at early stages Have experience with prompt engineering, fine-tuning, or building AI agents Have experience delivering AI-powered products at internet scale Are ambitious and driven, and care a lot about doing great work with great people Keep up with the latest developments in ML / AI Qualifications
You have 5+ years of experience building, hardening and scaling real world applications with 2+ years of experience integrating LLMs into production systems. You're exceptional with Python or have a deep understanding of another language of choice. Build your own tools as needed—like a quick Streamlit app to test hypotheses or create a dataset. Approach
A quantitative approach to building products. Ability to debug, experiment, and iterate fast. You should be comfortable getting hands-on with the full development lifecycle, from ideation to shipping to users. You have a high bar for quality that you hold yourself and your team to. The core work will include
Integrating and optimizing LLM calls for structured extraction, form filling, and document understanding tasks Experimenting with new techniques and output structures to improve LLM accuracy and reduce latency Making improvements to API design and pre-processing algorithms (chunking, structured extraction, etc.) based on customer feedback Building and improving document processing pipelines that handle everything from PDFs to spreadsheets at scale Driving backend latency and costs down without sacrificing quality Building internal tooling and evals to better understand / analyze failure cases Working directly with the founders and customers to shape the product direction and engineering strategy Have prior experience founding a company or building products at early stages Have experience with prompt engineering, fine-tuning, or building AI agents Have experience delivering AI-powered products at internet scale Are ambitious and driven, and care a lot about doing great work with great people Keep up with the latest developments in ML / AI Location and expectations
This is an in person role at our office in SF. We\u2019re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you. About Reducto
Nearly 80% of enterprise data is in unstructured formats like PDFs PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that\u2019s simply impractical for use in digital workflows. This isn\u2019t an inconvenience—it\u2019s a critical bottleneck that leads to dozens of wasted hours every week. Traditional approaches fail at reliably extracting information in complex PDFs OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with. Reducto breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you : Accurately extract text and tables even with nonstandard layouts Automatically convert graphs to tabular data and summarize images in documents Extract important fields from complex forms with simple, natural language instructions Build powerful retrieval pipelines using Reducto’s document metadata Intelligently chunk information using the document’s layout data Benefits at Reducto
Lunch :
Receive a free lunch to eat with your teammates daily at the office Reimbursed Transportation :
Provide us with your receipts and we’ll take care of the costs Insurance :
Generous health insurance covering medical, dental, and vision. Health and Wellness Budget :
We provide up to $150 / mo reimbursement for health and wellness spending, such as gym memberships, fitness classes, or similar. Parental Leave :
Work with us to build a leave schedule that works for you and your family Reducto is an Equal Opportunity Employer committed to diversity and inclusion in the workplace. All qualified applicants will receive consideration for employment without regard to sex, race, color, age, national origin, religion, physical and mental disability, genetic information, marital status, sexual orientation, gender identity / assignment, citizenship, pregnancy or maternity, protected veteran status, or any other status prohibited by applicable national, federal, state or local law. J-18808-Ljbffr
#J-18808-Ljbffr