Logo
Wisconsin Staffing

Data Scientist, RWD & NLP

Wisconsin Staffing, Madison, Wisconsin, us, 53774

Save Job

Data Scientist, RWD & NLP

Norstella, a global pharma intelligence solution provider, is seeking a skilled NLP Data Scientist with a clinical background and a focus on language models to join our AI & Life Sciences Solutions team. Your expertise in processing and understanding natural language data, along with your knowledge of Electronic Health Records (EHR) and laboratory reports analysis, will be instrumental in driving our data science initiatives and innovations, particularly in the development of rich multimodal real-world datasets to expedite RWD-driven drug development in pharma. Responsibilities: Employ and leverage NLP and open-source Large Language Models (LLM) such as LLama2, Mixtral, Qwen, BERT, etc., to extract, process, and interpret unstructured medical data from diverse sources like EHRs, medical notes, and laboratory reports. Collaborate with clinical scientists and data scientists to create efficient NLP models for healthcare, exhibiting an understanding of both the technical and medical aspects of the data. Conduct data cleaning, preprocessing, and validation to maintain the accuracy and reliability of insights gathered from NLP processes. Validate and present data findings to stakeholders, exhibiting clear and effective communication skills. Qualifications: Master's or Ph.D. degree in Computational Biology, Computer Science, Data Science, Computational Linguistics, Machine Learning, or a related analytical field. Deep understanding and direct experience (2+ years) in handling and interpreting either Electronic Health Records (EHR) and laboratory tests results or genetic test results. Proven experience (2+ years) in NLP with a strong knowledge of NLP techniques such as Named Entity Recognition (NER), text summarization, topic modeling, etc. and their applied use in healthcare. Expert-level understanding and practical experience (1+ years) with open-source Large Language Models (Llama2/3, Mixtral etc.), e.g., prompt engineering, inference, and fine-tuning. Proficient in Python and SQL, with strong experience in NLP libraries such as NLTK, spaCy, Hugging face Transformers, and deep learning libraries such as PyTorch, TensorFlow. Familiarity with common data science and ML practices, e.g., version control systems, agile methodologies, and documentation. Experience in working with AWS cloud environment and large databases (e.g., AWS Redshift). Experience in managing ML lifecycle using open-source tools (e.g., MLflow). Detail-oriented with strong analytical and problem-solving abilities. Excellent verbal and written communication skills, with ability to present complex data to non-technical audience. Preferred Qualifications: Experience dealing with protected health information (PHI) and familiarity with healthcare-related data privacy laws such as HIPAA. Familiarity with standard healthcare codes and terminologies such as ICD-10, CPT, LOINC, and SNOMED CT. Experience in RAG (Retrieval-Augmented Generation) and vector store in the context of storing large volume of healthcare unstructured documents and querying those. Please Note: All candidates must be authorized to work in the United States. We do not provide visa sponsorship or transfers. We are not currently accepting candidates who are on an OPT visa Benefits: Medical and prescription drug benefits Health savings accounts or flexible spending accounts Dental plans and vision benefits Basic life and AD&D Benefits 401k retirement plan Short and Long-Term Disability Paid parental leave Open vacation policy The expected base salary for this position ranges from $135,000 to $145,000. It is not typical for offers to be made at or near the top of the range. Salary offers are based on a wide range of factors including relevant skills, training, experience, education, and, where applicable, licensure or certifications obtained. Market and organizational factors are also considered. In addition to base salary and a competitive benefits package, successful candidates are eligible to receive a discretionary bonus. Norstella is an equal opportunity employer and does not discriminate on the grounds of gender, sexual orientation, marital or civil partner status, pregnancy or maternity, gender reassignment, race, color, nationality, ethnic or national origin, religion or belief, disability or age. Our ethos is to respect and value people's differences, to help everyone achieve more at work as well as in their personal lives so that they feel proud of the part they play in our success. We believe that all decisions about people at work should be based on the individual's abilities, skills, performance and behavior and our business requirements. Norstella operates a zero-tolerance policy to any form of discrimination, abuse or harassment.