Logo
Mistral AI

Applied Scientist/Research Engineer - Palo Alto/NYC

Mistral AI, Palo Alto, California, United States, 94306

Save Job

Applied Scientist/Research Engineer - Palo Alto/NYC

Join to apply for the Applied Scientist/Research Engineer - Palo Alto/NYC role at

Mistral AI . About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work. We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited. Join us to be part of a pioneering company shaping the future of AI. About the Job

Mistral AI is seeking Applied Scientists and Research Engineers to drive innovative research and collaborate with clients on complex research projects. You will develop SOTA models across different modalities such as text, image, and speech. By developing novel methods and research ideas you will apply these models across a diverse set of use cases and domains. Working cross-functionally with both external and internal science, engineering, and product teams you will deliver high-impact AI solutions that turn the needle. What you will do

Run pre-training, post-training and deploy state of the art models on clusters with thousands of GPUs. You will handle OOM errors and NCCL issues as needed. Generate and curate data for pre-training and post-training, evaluate model performance, and strive to beat expectations. Develop tools and frameworks to facilitate data generation, model training, evaluation and deployment. Collaborate with cross-functional teams to tackle complex use cases using agents and RAG pipelines. Manage research projects and communications with client research teams. About You

Fluent in English with excellent communication skills; able to explain complex technical concepts to technical and non-technical audiences. Expert with PyTorch or JAX. Able to contribute to a large codebase and work independently with little guidance. Write clean, readable, high-performance, fault-tolerant Python code. Self-motivated and able to ship without requiring explicit roadmaps or constant supervision. Low-ego, collaborative and eager to learn. Proven track record through personal projects, professional projects or in academia. Preferred qualifications: Hold a PhD or master's in a relevant field (e.g., Mathematics, Physics, Machine Learning, Computer Science & Engineering); exceptional candidates from other backgrounds may apply. Experience across agents, multi-modality, robotics, diffusion, or time-series research. Contribution to large codebases used by many (open source or industry). Publications in top academic journals or conferences. Love improving existing code by fixing typing issues, adding tests and improving CI pipelines. Benefits

Competitive salary and bonus structure Generous equity Health: Blueshield of California medical coverage for employees (and 75% for dependents) 401K with 6% matching Paid time off: 18 days Transportation: reimbursement for office parking or $120/month for public transport Betterup coaching on a voluntary basis Gym membership reimbursement: $120/month Meal stipend: $400 monthly Visa sponsorship Seniority level

Not Applicable Employment type

Full-time Job function

Engineering and Information Technology Industries

Technology, Information and Internet

#J-18808-Ljbffr