Logo
TEKsystems

Data Scientist

TEKsystems, Cary, North Carolina, United States, 27518

Save Job

Skills

5-10+ years of experience in AI and machine learning, model building, evaluation, and fine-tuning.

2+ years of working knowledge of applying recent LLMs including ChatGPT, GPT 3.5, OPT, BLOOM, etc. UTILIZING RAG!

Experience working directly with large language models and Transformer based architectures including BERT, RoBERTa, T5 etc.

NLP Experience (Named Entity Recognition, Sentiment Analysis, Text Summarization)

Experience with conversational search / semantic search, reinforcement learning, prompt engineering, hallucination mitigation

DevOps repos Debugging, building APIs and managing the algorithm flow across multiple workstreams in one repo

Senior level experience deploying models in the Cloud (AWS) or Azure as secondary.

Description TEKsystems is partnered with a software company in Raleigh that needs to hire a Senior Data Scientist for their flagship product, LexisNexis® Legal & Professional, a leading global provider of information and analytics. Recently LexisNexis has focused on the general availability of Lexis+ AI™ for U.S. customers, a generative AI solution designed to transform legal work. Lexis+ AI delivers trusted results in a familiar, easy-to-use interface with linked hallucination‑free legal citations that combine the power of generative AI with proprietary LexisNexis search technology, Shepard’s® Citations functionality, and authoritative content. The Senior Data Scientist will focus on search and be dedicated to the creation of next‑generation AI and Machine Learning techniques and strategies for LexisNexis in their global expansion. This candidate will assist with deploying ethical, powerful generative AI solutions with a flexible, multi‑model approach that prioritizes using the best model for each individual legal use case. This approach includes working with large language models like Anthropic’s Claude 2, hosted on Amazon Bedrock from Amazon Web Services (AWS), and OpenAI’s GPT‑4 and ChatGPT, hosted on Microsoft Azure.

Accountabilities

Solve some of the most challenging problems in natural language processing, machine learning, and information retrieval including topical classification, sentiment analysis, user intent detection.

Research, build, and deploy models based on both shallow and deep machine learning. Train robust NLP‑based models a very large corpus of news and financial data.

Apply machine learning techniques for improving search algorithms.

Drive best practices for NLP/Machine Learning pipelines.

Maintain current knowledge base of state‑of‑the‑art ML algorithms (BERT, ELMo, GPT, etc.), API's, and open‑source methods and be able to quickly evaluate alternatives.

Translate complex business requirements into actionable stories with reasonable time estimates.

Work with product leaders to apply data science solutions.

Qualifications

Strong coding skills in Python 7+ years

Be a natural problem solver, able to take a lead in collaborating to resolve issues

Have communication skills

4+ years of experience in AI and machine learning

Deep understanding of machine learning algorithms, classification models, diagnostic testing of models

Experience working directly and Transformer based architectures including BERT, RoBERTa, T5 etc. Nd familiarity with large language models and fine tuning

Experience with conversational search / semantic search, reinforcement learning, prompt engineering, hallucination mitigation

Working understanding of the business risks associated with applying LLM (LangChain) in a business

Experience working with AWS, RAG, SageMaker, SQL

#J-18808-Ljbffr