Logo
Piper Companies

Data Scientist

Piper Companies, Annapolis, Maryland, United States, 21403

Save Job

Job Title: Senior Data Scientist - High-Performance Computing (HPC)

Location: Annapolis Junction

Salary: $150,000- $190,000 based on years of experience.

Clearance: TS/SCI with CI Poly Position Summary

We are seeking a highly skilled and innovative Senior Data Scientist to lead the design, development, and deployment of advanced data-driven applications within a High-Performance Computing (HPC) environment. This role requires deep expertise in data science, machine learning, and scalable computing to solve complex business challenges through creative and non-traditional analytical approaches. Key Responsibilities Data Analysis & Exploration Conduct exploratory data analysis (EDA) to uncover insights from large-scale, unstructured datasets. Apply advanced visualization techniques to summarize and communicate data characteristics. Exercise creativity in applying novel analytical methods to high-value use cases. Machine Learning & AI Research and implement cutting-edge machine learning algorithms for distributed computing. Develop predictive models for classification, regression, and clustering tasks. Evaluate and optimize model performance for scalability and accuracy. Application Development & Deployment Design and implement complex algorithms for data processing and analysis. Build prototypes and production-grade applications integrating data science techniques. Manage codebase using version control systems (e.g., Git) and ensure collaborative development. Deploy applications across HPC clusters, cloud platforms, or containerized environments (Docker, Kubernetes). Computational & Integration Skills Develop and maintain ETL pipelines and data integration workflows. Interpret and connect complex data sources to generate actionable insights. Collaborate with cross-functional teams to deliver data-driven solutions. Required Qualifications Bachelor's or Master's degree in Computer Science, Data Science, Statistics, or a related field. 5+ years of experience in data science, machine learning, or software development. Proficiency in programming languages such as Python, R, Java, or Scala. Experience with data manipulation libraries (e.g., Pandas, NumPy) and visualization tools (e.g., Matplotlib, Seaborn). Strong understanding of machine learning frameworks (e.g., scikit-learn, TensorFlow). Hands-on experience with HPC environments, cloud computing, and containerization technologies. Familiarity with version control systems and collaborative development practices. Preferred Qualifications Experience with distributed computing frameworks (e.g., Spark, Dask). Knowledge of data warehousing and pipeline orchestration tools. Background in scientific computing or large-scale simulation environments. Strong communication skills and ability to translate technical findings into business insights.