Box
Redwood City, CA, United States
Overview
Box (NYSE: BOX) is the leader in Intelligent Content Management. Our platform enables collaboration, content lifecycle management, and secure, automated workflows with enterprise AI. Box helps organizations transform work with billions of files and information flowing across teams and processes every day. Box is headquartered in Redwood City, CA, with offices worldwide. As a founding ML Engineer on the AI Agents team, you will build foundational agents powering the Box AI ecosystem, including DeepSearch, DeepResearch, Extract, and Compose. You will design techniques for intent detection, ranking, evaluation, retrieval-augmented generation (RAG), and multi-agent orchestration, and establish metrics to measure agent quality. You will collaborate with platform engineers to enable scalable deployment and empower Box teams and customers to configure agents for their workflows. What you’ll do
Build, evaluate, and evolve foundational agents such as DeepSearch, DeepResearch, Extract, and Compose. Develop techniques for intent detection, query understanding, ranking, and retrieval-augmented generation (RAG) to improve accuracy and relevance. Define metrics, evaluation pipelines, and benchmarks for agent quality, including precision/recall, factual grounding, and latency trade-offs. Research and implement best practices in retrieval, orchestration, and evaluation of multi-agent workflows. Collaborate with platform engineers to design core components that enable secure, reliable, and scalable deployment of agents. Partner with product teams to translate enterprise use cases into agentic solutions with measurable user experience improvements. Contribute to technical discussions, share research insights, and help define the roadmap for Box’s agent ecosystem. Participate in on-call rotation to help respond to and triage issues when needed. Who you are
You are passionate about building and evaluating AI agents that solve enterprise problems. You enjoy working at the intersection of machine learning and distributed systems, bridging research with production. You’ve designed or evaluated ML systems for search, ranking, RAG, or conversational AI. You like to own and deliver high-quality work—technically and in your teamwork. You are collaborative, curious, and comfortable mentoring or learning from other engineers and ML practitioners. Required skills
Strong background in machine learning, information retrieval, or natural language processing. Proficiency in Python, Java, or Scala. Experience designing, training, and evaluating ML models in production. Familiarity with retrieval systems, ranking models, RAG pipelines, or intent classification. BS degree in Computer Science, Machine Learning, or related field. 3+ years of industry experience building or evaluating ML-powered systems. Preferred skills
Advanced degree in computer science, machine learning, or related field. Hands-on experience with LangChain, LangGraph, or other agent frameworks. Familiarity with LLMs, embeddings, semantic search, indexing, and relevance optimization. Experience with cloud-based ML platforms (e.g., Vertex AI, AWS Bedrock, SageMaker). Experience with Kubernetes-based systems for deploying and scaling ML workloads. Experience evaluating generative AI systems for factuality, safety, and grounding. Equal Opportunity
Box is an equal opportunity employer. We value diversity and do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability, or any other protected characteristic. Box provides reasonable accommodations in the application process and for employees with disabilities as required by law.
#J-18808-Ljbffr
Box (NYSE: BOX) is the leader in Intelligent Content Management. Our platform enables collaboration, content lifecycle management, and secure, automated workflows with enterprise AI. Box helps organizations transform work with billions of files and information flowing across teams and processes every day. Box is headquartered in Redwood City, CA, with offices worldwide. As a founding ML Engineer on the AI Agents team, you will build foundational agents powering the Box AI ecosystem, including DeepSearch, DeepResearch, Extract, and Compose. You will design techniques for intent detection, ranking, evaluation, retrieval-augmented generation (RAG), and multi-agent orchestration, and establish metrics to measure agent quality. You will collaborate with platform engineers to enable scalable deployment and empower Box teams and customers to configure agents for their workflows. What you’ll do
Build, evaluate, and evolve foundational agents such as DeepSearch, DeepResearch, Extract, and Compose. Develop techniques for intent detection, query understanding, ranking, and retrieval-augmented generation (RAG) to improve accuracy and relevance. Define metrics, evaluation pipelines, and benchmarks for agent quality, including precision/recall, factual grounding, and latency trade-offs. Research and implement best practices in retrieval, orchestration, and evaluation of multi-agent workflows. Collaborate with platform engineers to design core components that enable secure, reliable, and scalable deployment of agents. Partner with product teams to translate enterprise use cases into agentic solutions with measurable user experience improvements. Contribute to technical discussions, share research insights, and help define the roadmap for Box’s agent ecosystem. Participate in on-call rotation to help respond to and triage issues when needed. Who you are
You are passionate about building and evaluating AI agents that solve enterprise problems. You enjoy working at the intersection of machine learning and distributed systems, bridging research with production. You’ve designed or evaluated ML systems for search, ranking, RAG, or conversational AI. You like to own and deliver high-quality work—technically and in your teamwork. You are collaborative, curious, and comfortable mentoring or learning from other engineers and ML practitioners. Required skills
Strong background in machine learning, information retrieval, or natural language processing. Proficiency in Python, Java, or Scala. Experience designing, training, and evaluating ML models in production. Familiarity with retrieval systems, ranking models, RAG pipelines, or intent classification. BS degree in Computer Science, Machine Learning, or related field. 3+ years of industry experience building or evaluating ML-powered systems. Preferred skills
Advanced degree in computer science, machine learning, or related field. Hands-on experience with LangChain, LangGraph, or other agent frameworks. Familiarity with LLMs, embeddings, semantic search, indexing, and relevance optimization. Experience with cloud-based ML platforms (e.g., Vertex AI, AWS Bedrock, SageMaker). Experience with Kubernetes-based systems for deploying and scaling ML workloads. Experience evaluating generative AI systems for factuality, safety, and grounding. Equal Opportunity
Box is an equal opportunity employer. We value diversity and do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability, or any other protected characteristic. Box provides reasonable accommodations in the application process and for employees with disabilities as required by law.
#J-18808-Ljbffr