Logo
Baselayer

Data Scientist

Baselayer, San Francisco, California, United States, 94199

Save Job

Trusted by 2,200+ financial institutions, Baselayer is the intelligent business identity platform that helps verify any business, automate KYB, and monitor real‑time risk. Baselayer’s B2B risk solutions & identity graph network leverage state & federal government filings and proprietary data sources to prevent fraud, accelerate onboarding, and lower credit losses.

About You You want to learn from the best of the best, get your hands dirty, and put in the work to hit your full potential. You're not just doing it for the win—you're doing it because you have something to prove and want to be great. You are looking to be an impeccable data scientist working with cutting‑edge AI/ML technologies.

Qualifications

You have 1-3 years of experience in data science, working with Python and building data‑driven models

You're comfortable working with large‑scale datasets and enjoy extracting meaningful insights from complex information

You have a strong foundation in statistics and machine learning, with particular interest in applying advanced techniques to real‑world problems

You prioritize data integrity and ethical considerations, especially when working with sensitive information in regulated environments like KYC/KYB

You have a keen eye for detail and take pride in creating clear, interpretable analyses while optimizing for actionable insights

You thrive in a high‑trust, ownership‑focused environment and are comfortable translating between technical concepts and business objectives

Problem‑solver who navigates ambiguous data challenges confidently

Proactive self‑starter who thrives in dynamic settings

Incredibly intelligent and analytical; you take pride in your data‑driven solutions

Highly feedback‑oriented; we believe in radical candor and using feedback to get to the next level

Responsibilities

Data Analysis & Insight Generation: conduct comprehensive analyses of large datasets to extract actionable insights that drive business decisions in the GTM and identity verification space

Predictive Modeling: develop statistical and machine learning models to identify patterns, predict outcomes, and optimize processes for KYC/KYB workflows

Data Pipeline Development: collaborate with engineering teams to design and implement efficient data pipelines that transform raw data into useful features for analysis and modeling

AI/ML Application: work closely with ML engineers to apply advanced techniques including LLMs to solve specific business problems within the identity verification domain

Experiment Design & Evaluation: design and execute experiments to validate hypotheses, measure impact of changes, and continuously improve data science solutions

Data Visualization & Communication: create clear, compelling visualizations and presentations that effectively communicate complex findings to technical and non‑technical stakeholders

Feature Engineering: identify and develop innovative features from raw data sources to enhance model performance and enable new capabilities

Performance Monitoring: establish metrics and monitoring systems to track the effectiveness of data science solutions, identifying opportunities for refinement and optimization

Research & Innovation: stay current with the latest research and techniques in data science and machine learning, evaluating new approaches for potential application to business challenges

Benefits

Hybrid in SF – in office 3 days/week

Flexible PTO

Smart, genuine, ambitious team

Salary Range Salary Range: $122k – $167k + Equity – 0.05% – 0.25%

Seniority Level

Entry level

Employment type

Full‑time

Job function

Engineering and Information Technology

Industries

Technology, Information and Internet

Referrals increase your chances of interviewing at Baselayer by 2x.

San Francisco, CA

#J-18808-Ljbffr