Headspace
Staff Data Engineer- Data Architect San Francisco - Hybrid
Headspace, Snowflake, Arizona, United States, 85937
Overview
Were looking for an experienced Data Architect who can also operate hands-on as a Staff Data Engineer to Headspace. You will design and evolve our domain-based Enterprise Data Model (EDM), lead Master Data Management (MDM) initiatives, and build production-grade data pipelines in Python / PySpark. The ideal candidate is comfortable whiteboarding conceptual models, building and reviewing ETL jobs, and coaching engineering teams on data architecture best practices. Location: Hybrid in San Francisco, working 3 days per week from the office. Responsibilities Lead the development of scalable data infrastructure: architecture and implementation of PySpark data pipelines to ingest and transform diverse datasets into the data lake in a fault-tolerant, robust system. Set design patterns: drive the creation and enforcement of standard conventions in code, architecture, schema design, and table design. Architect world-class data platforms: design and lead the evolution of secure, compliant, and privacy-forward data warehousing platforms for healthcare data. Strategic collaboration for business insights: partner with analytics, product, and engineering leaders to ensure the data ecosystem provides actionable and reliable insights into critical business metrics. Champion data-driven leadership: mentor other members of the DE and broader data team, particularly around dbt architecture and query performance, and foster a data-first culture across teams. Influence organizational strategy: act as a technical thought leader, shaping the companys data strategy and influencing cross-functional roadmaps with data-centric solutions.
Qualifications
7+ years in data engineering / architecture, with 2+ years leading EDM/MDM programs and a proven track record of leading high-impact initiatives at scale. Proven ability to create and maintain domain-based enterprise data models (canonical, hub-and-spoke, data-product-oriented). Deep expertise with data-modeling tools (ERwin, ER/Studio, PowerDesigner, or equivalent) and modeling techniques (3NF, Dimensional, Data Vault, Anchor). Production experience writing performant Python and PySpark code on distributed compute (Spark 3+, Delta Lake). Strong SQL skills across columnar and relational engines (e.g., Snowflake, Redshift, Databricks SQL, Postgres). Solid grasp of data-governance practices: lineage, glossaries, PII/PHI controls, and data-quality frameworks. Ability to articulate architecture choices to both executive stakeholders and hands-on engineers. Deep experience designing and optimizing real-time and batch ETL pipelines (preferably within dbt), employing best practices for scalability and reliability. Systems thinker who can balance near-term delivery with long-term architecture vision. Comfortable in highly collaborative, agile environments; able to mentor cross-functional teams. Excellent written and verbal communication; able to translate complex data topics into plain language. Bias for automation, documentation, and continuous improvement.
Nice-To-Haves
Hands-on with Databricks platform (Unity Catalog, Delta Live Tables, MLflow). dbt Core for transformation, tests, and metadata; dbt Semantic Layer experience is a plus. Exposure to event streaming (Kafka, EventHub) and CDC tools. Experience integrating with commercial MDM suites or building custom match-merge solutions. Familiarity with cloud data-platform services on AWS (Terraform). Background in data-privacy standards (GDPR, CCPA, HIPAA) and differential-privacy or tokenization techniques.
Salary and Benefits
The anticipated base salary range for this full-time position is $140,400$224,250 plus equity and benefits. Our salary ranges reflect location and market conditions. The final compensation is determined by factors including location, relevant experience, skills, and education. Your recruiter will share the precise range for your location during the hiring process. Headspace offers a comprehensive Total Rewards package including base salary, stock awards, healthcare, wellness stipend, retirement match, and more. Details will be provided during the recruitment process. Equal Opportunity and Inclusion
Headspace is an equal opportunity employer. We do not discriminate on protected characteristics and are committed to building a diverse and inclusive workforce. If you require reasonable accommodation or have questions about the application process, please contact our Talent team. #J-18808-Ljbffr
Were looking for an experienced Data Architect who can also operate hands-on as a Staff Data Engineer to Headspace. You will design and evolve our domain-based Enterprise Data Model (EDM), lead Master Data Management (MDM) initiatives, and build production-grade data pipelines in Python / PySpark. The ideal candidate is comfortable whiteboarding conceptual models, building and reviewing ETL jobs, and coaching engineering teams on data architecture best practices. Location: Hybrid in San Francisco, working 3 days per week from the office. Responsibilities Lead the development of scalable data infrastructure: architecture and implementation of PySpark data pipelines to ingest and transform diverse datasets into the data lake in a fault-tolerant, robust system. Set design patterns: drive the creation and enforcement of standard conventions in code, architecture, schema design, and table design. Architect world-class data platforms: design and lead the evolution of secure, compliant, and privacy-forward data warehousing platforms for healthcare data. Strategic collaboration for business insights: partner with analytics, product, and engineering leaders to ensure the data ecosystem provides actionable and reliable insights into critical business metrics. Champion data-driven leadership: mentor other members of the DE and broader data team, particularly around dbt architecture and query performance, and foster a data-first culture across teams. Influence organizational strategy: act as a technical thought leader, shaping the companys data strategy and influencing cross-functional roadmaps with data-centric solutions.
Qualifications
7+ years in data engineering / architecture, with 2+ years leading EDM/MDM programs and a proven track record of leading high-impact initiatives at scale. Proven ability to create and maintain domain-based enterprise data models (canonical, hub-and-spoke, data-product-oriented). Deep expertise with data-modeling tools (ERwin, ER/Studio, PowerDesigner, or equivalent) and modeling techniques (3NF, Dimensional, Data Vault, Anchor). Production experience writing performant Python and PySpark code on distributed compute (Spark 3+, Delta Lake). Strong SQL skills across columnar and relational engines (e.g., Snowflake, Redshift, Databricks SQL, Postgres). Solid grasp of data-governance practices: lineage, glossaries, PII/PHI controls, and data-quality frameworks. Ability to articulate architecture choices to both executive stakeholders and hands-on engineers. Deep experience designing and optimizing real-time and batch ETL pipelines (preferably within dbt), employing best practices for scalability and reliability. Systems thinker who can balance near-term delivery with long-term architecture vision. Comfortable in highly collaborative, agile environments; able to mentor cross-functional teams. Excellent written and verbal communication; able to translate complex data topics into plain language. Bias for automation, documentation, and continuous improvement.
Nice-To-Haves
Hands-on with Databricks platform (Unity Catalog, Delta Live Tables, MLflow). dbt Core for transformation, tests, and metadata; dbt Semantic Layer experience is a plus. Exposure to event streaming (Kafka, EventHub) and CDC tools. Experience integrating with commercial MDM suites or building custom match-merge solutions. Familiarity with cloud data-platform services on AWS (Terraform). Background in data-privacy standards (GDPR, CCPA, HIPAA) and differential-privacy or tokenization techniques.
Salary and Benefits
The anticipated base salary range for this full-time position is $140,400$224,250 plus equity and benefits. Our salary ranges reflect location and market conditions. The final compensation is determined by factors including location, relevant experience, skills, and education. Your recruiter will share the precise range for your location during the hiring process. Headspace offers a comprehensive Total Rewards package including base salary, stock awards, healthcare, wellness stipend, retirement match, and more. Details will be provided during the recruitment process. Equal Opportunity and Inclusion
Headspace is an equal opportunity employer. We do not discriminate on protected characteristics and are committed to building a diverse and inclusive workforce. If you require reasonable accommodation or have questions about the application process, please contact our Talent team. #J-18808-Ljbffr