Logo
Capital One

Senior Manager, Data Science - LLM Customization Team

Capital One, New York, New York, us, 10261

Save Job

Flex your interpersonal skills to translate the complexity of your work into tangible business goals.* Innovative. You continually research and evaluate emerging technologies. You stay current on published state-of-the-art methods, technologies, and applications and seek out opportunities to apply them.* Creative. You thrive on bringing definition to big, undefined problems. You love asking questions and pushing hard to find answers. You’re not afraid to share a new idea.* Technical. You’re comfortable with advanced ML and DL technologies including language models and are passionate about developing further. You have hands-on experience working with LLMs and solutions using open-source tools and cloud computing platforms.* Influential. You are passionate about AI/ML and can bring along a cross functional team in breakthrough innovations. You communicate clearly and effectively to share your findings with non-technical audiences.* You are experienced in training language models or large computer vision models as well as have expertise in one or more key subdomains such as: training optimization, self-supervised learning, explainability, RLHF.* You have an engineering mindset as shown by a track record of delivering models at scale both in training data and inference volumes. You have experience in delivering libraries, platforms, or solution level code to existing products.* Currently has, or is in the process of obtaining one of the following with an expectation that the required degree will be obtained on or before the scheduled start date:* At least 2 years of experience leveraging open source programming languages for large scale data analysis* At least 2 years of experience working with machine learning* At least 2 years of experience utilizing relational databases* A Bachelor's Degree in a quantitative field (Statistics, Economics, Operations Research, Analytics, Mathematics, Computer Science, or a related quantitative field) plus 7 years of experience performing data analytics* A Master's Degree in a quantitative field (Statistics, Economics, Operations Research, Analytics, Mathematics, Computer Science, or a related quantitative field)

or an MBA with a quantitative concentration plus 5 years of experience performing data analytics* A PhD in a quantitative field (Statistics, Economics, Operations Research, Analytics, Mathematics, Computer Science, or a related quantitative field) plus 2 years of experience performing data analytics* PhD in “STEM” field (Science, Technology, Engineering, or Mathematics) plus 4 years of experience in data analytics* At least 1 year of experience working with AWS* At least 1 year of experience managing people* At least 5 years’ experience in Python, Scala, or R for large scale data analysis* At least 5 years’ experience with machine learning* LLM

+ PhD focus on NLP or Masters with 5 years of industrial NLP research experience

+ Multiple publications on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)

+ Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)

+ Publications in deep learning theory

+ Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR* Finetuning

+ PhD focused on topics related to guiding LLMs with further tasks (Supervised Finetuning, Instruction-Tuning, Dialogue-Finetuning, Parameter Tuning)

+ Demonstrated knowledge of principles of transfer learning, model adaptation and model guidance

+ Experience deploying a fine-tuned large language modelCapital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the . Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level. #J-18808-Ljbffr