ZipRecruiter
Job Description Analytica is seeking a remote Senior Data Scientist - GenAI to support long-term federal client engagements in financial regulatory or health projects in the DC Metro area. The role will apply statistical programming, modeling, visualization techniques, data mining, and forecasting skills to analyze challenging public sector problems. Our company has been recognized by Inc. Magazine as one of the fastest-growing 250 businesses in the US for 3 years. We work with U.S. government clients in health, civilian, and security missions to build better technology products that impact our daily lives. We offer competitive compensation, bonuses, employer-paid health care, training and development funds, and a 401k match. Responsibilities include: Pre-processing - Collect, clean, and prepare data sets for modeling using Python, SAS, or R. Experience with methods like stop word removal, stemming, lemmatization, and tokenization is required. Feature Engineering and Attribute Evaluation - Use NLP feature engineering methods such as TF-IDF, word2vec, GloVe, and FastText to identify key determinants in data and select appropriate evaluation protocols. Modeling - Select and apply modeling techniques like supervised and unsupervised machine learning, regression, neural networks, and deep learning. Validation - Investigate, report, and justify model results. Visualization - Present modeling results, depict insights, and explain their relevance to business challenges. Qualifications: Master's degree required; PhD in Statistics, Mathematics, Computer Science, or related field preferred. Extensive experience with Python for NLP tasks such as Document Summarization, Named Entity Recognition, Sentiment Analysis, and Topic Modeling. Competency in computer vision is required. Experience with multi-modal GenAI/LLMs and prompt engineering techniques. At least four years developing scalable NLP solutions using tools like scikit-learn, Keras, TensorFlow, PyTorch, Spark NLP. Experience with transformer architectures (BERT, GPT-J, RoBERTa, T5, etc.). Familiarity with open-source NLP packages such as Gensim, SpaCy, NLTK. Experience with Databricks and cloud environments, specifically AWS. Ability to coordinate and maintain user stories. Must be a US citizen and able to obtain and maintain a Public Trust security clearance. About Analytica: Analytica provides software and IT solutions to public sector organizations supporting health, civilian, and security missions. Recognized by Inc. Magazine as one of the fastest-growing companies, we specialize in software engineering, data analytics, visualization, agile project management, and consulting. We are committed to equal employment opportunities and comply with all applicable laws. Ensure email communication from us is from the domain . Powered by JazzHR 0T9uopoDIM #J-18808-Ljbffr