Logo
Analytica

Data Scientist

Analytica, Baltimore, Maryland, United States, 21276

Save Job

Data Scientist Analytica is seeking a

Data Scientist

to support long term federal client engagements projects in the DC Metro area. The role will apply statistical programming, modeling, visualization techniques, data mining, and forecasting skills to analyze challenging public sector problems.

This position is fully remote.

Analytica has been recognized by Inc. for 3 consecutive years as one of the 250 fastest growing businesses. We offer competitive compensation with opportunities for bonuses, employer paid health care, training and development funds, and 401k match.

Responsibilities include:

Pre-processing – demonstrate the skills and experience to collect, clean, and prepare data sets for input into a computational model using Python. Strong candidates explain various methods such as stop word removal, stemming, lemmatization, and tokenization.

Feature Engineering and Attribute Evaluation – candidate must demonstrate experience with NLP feature engineering methods such as TF‑IDF, word2vec, GloVe, and FastText, identifying key determinants for modeling that exist in the business process and existing data sets, and selecting evaluation protocols.

Modeling – candidates will have practiced selecting classification modeling techniques to fit the business problem. Examples include machine learning supervised and unsupervised learning, regression, neural networks and deep learning, natural language processing, etc.

Validation – strong candidates describe their experience with investigating, reporting, and justifying model results.

Visualization – experience presenting results of modeling activities, depicting the insights realized, and explaining relevance to the organization’s business challenges.

Qualifications:

Master's degree required, PhD preferred in Statistics, Mathematics, Computer Science, or similar.

High degree of experience utilizing SAS, R, or Python to support NLP use cases such as Document Summarization, Named Entity Recognition, Sentiment Analysis, and/or Topic Modeling.

At least four years of experience developing scalable, production-ready NLP solutions using scikit‑learn, Keras, TensorFlow, PyTorch, Spark NLP.

Experience using git/github to version control source code.

Experience leveraging transformer architecture to develop NLP models.

Experience with open source NLP packages such as Gensim, SpaCy, or NLTK.

Experience with BERT, GPT‑J, RoBERTa, T5 or other transformers.

Experience with GenAI and Prompt Engineering is a plus.

Experience in Databricks and MLFlow is a plus.

Experience with machine translation and transcription of foreign language documents using Microsoft Azure translation services is a plus.

Experience working in an AWS cloud environment and with related AWS services such as Bedrock and Textract.

Experience coordinating and maintaining user stories.

Must be a US citizen.

Must be able to obtain and maintain a Public trust security clearance.

About Analytica Analytica is a leading consulting and information technology solutions provider to public sector organizations supporting health, civilian, and national security missions. The company is an award‑winning SBA certified 8(a) small business that has been recognized by Inc. Magazine for the past three years as one of the 250 fastest‑growing companies in the U.S. Analytica specializes in providing software and systems engineering, information management, analytics & visualization, agile project management, and management consulting services. The company is appraised by the Software Engineering Institute at CMMI® Maturity Level 3 and is an ISO 9001:2008 certified provider.

Seniority level Mid‑Senior level

Employment type Full‑time

Job function Information Technology

Industries IT Services and IT Consulting, Government Relations Services

Benefits

Medical insurance

Vision insurance

401(k)

Tuition assistance

Paid maternity leave

#J-18808-Ljbffr