Hatch
HATCH
https://www.usehatchapp.com/
Senior Data Engineer
MUST BE BASED IN NYC
No Relocation
Cannot Sponsor
About Hatch
At Hatch, we're building AI that doesn't just assist behind the scenes; it converses with customers out in the wild. Backed by Y Combinator and top-tier investors like Bessemer and NextView, we're scaling fast, doubling revenue year over year, and looking for A players to help us cement our place as the category leader in AI for customer engagement.
About the Role
We are looking for a skilled, software engineering focused
Senior Data Engineer
to join our growing data team. You will be responsible for building, optimizing, and maintaining data pipelines and architectures that support our analytics, reporting, and AI initiatives. The ideal candidate is an experienced software engineer, capable with data analytics, data warehousing, and working with large-scale data processing systems.
Our data volume and the sophistication of our AI models are growing rapidly. To keep pace we need seniorlevel engineers who can treat
data systems as production software , not just analytics plumbing. If you love writing robust code, designing for scale, and operating distributed systems in production, let's talk.
Note:
This is not a businessintelligence or analyst role. Candidates whose primary experience is building reports or dashboards will not be successful here.
Key Responsibilities Design, build, and
own
scalable batch and realtime data pipelines (Kafka/Flink/Spark, Airflow/Temporal/DBT). Develop productionquality services in
Python or Go , with comprehensive tests, code reviews, CI/CD, and observability. Model, partition, and tune datasets in data lakes & warehouses (Snowflake, BigQuery, Redshift), balancing performance, cost, and governance. Collaborate with backend engineers to define
data contracts
and streaming interfaces between services. Drive infrastructureascode (Terraform/Pulumi) and container orchestration (Docker/Kubernetes/EKS) for the data platform. Establish and monitor SLOs for data quality, latency, and availability; debug incidents across the stack. Mentor engineers, lead technical design reviews, and raise the engineering bar through clear documentation and knowledge sharing. What We're Looking For 5+ years
combined
software engineering + data engineering
experience, including 3+ years building production services in Go or Python,. Proven ability to write performant, maintainable code in large codebases-beyond SQL and lowcode ETL tools. Deep understanding of computerscience fundamentals: data structures, algorithms, concurrency, networking, and distributedsystems concepts. Expertise with at least two of the following distributed data technologies:
Kafka/Kinesis, Spark/Flink, ClickHouse/Trino, Iceberg/Hudi, Redis/MongoDB . Handson experience instrumenting, monitoring, and troubleshooting services in
AWS or GCP
(CloudWatch, Datadog, Prometheus/Grafana, etc.). Strong SQL skills and practical knowledge of dimensional and eventdriven data modeling. Familiarity with containerization, CI/CD pipelines, and bluegreen or canary deployments. Excellent written and verbal communication-you explain complex systems clearly and bring others along. Nice to Have Experience supporting ML & LLM inference pipelines in production (vector DBs, feature stores, prompt engineering). Exposure to eventdriven microservices, protobuf/Avro schemas, and schemaregistry governance. Prior success in a fastgrowing startup environment where you wore multiple hats. What We Offer Competitive salary and equity Remote (Eastern or Central Time Zone required) OR Hybrid work environment (3 days/week in our NYC office) Medical, dental, and vision benefits 401(k) plan Flexible PTO Opportunity to build at the ground floor of a high-growth, mission-driven company Not offering sponsorship Why Hatch Shape the future of AI-driven customer service Build alongside founders and leaders who value speed, ownership, and ambition Solve hard problems that impact real businesses and customers Join a team of builders who care about great engineering, fast execution, and each other
https://www.usehatchapp.com/
Senior Data Engineer
MUST BE BASED IN NYC
No Relocation
Cannot Sponsor
About Hatch
At Hatch, we're building AI that doesn't just assist behind the scenes; it converses with customers out in the wild. Backed by Y Combinator and top-tier investors like Bessemer and NextView, we're scaling fast, doubling revenue year over year, and looking for A players to help us cement our place as the category leader in AI for customer engagement.
About the Role
We are looking for a skilled, software engineering focused
Senior Data Engineer
to join our growing data team. You will be responsible for building, optimizing, and maintaining data pipelines and architectures that support our analytics, reporting, and AI initiatives. The ideal candidate is an experienced software engineer, capable with data analytics, data warehousing, and working with large-scale data processing systems.
Our data volume and the sophistication of our AI models are growing rapidly. To keep pace we need seniorlevel engineers who can treat
data systems as production software , not just analytics plumbing. If you love writing robust code, designing for scale, and operating distributed systems in production, let's talk.
Note:
This is not a businessintelligence or analyst role. Candidates whose primary experience is building reports or dashboards will not be successful here.
Key Responsibilities Design, build, and
own
scalable batch and realtime data pipelines (Kafka/Flink/Spark, Airflow/Temporal/DBT). Develop productionquality services in
Python or Go , with comprehensive tests, code reviews, CI/CD, and observability. Model, partition, and tune datasets in data lakes & warehouses (Snowflake, BigQuery, Redshift), balancing performance, cost, and governance. Collaborate with backend engineers to define
data contracts
and streaming interfaces between services. Drive infrastructureascode (Terraform/Pulumi) and container orchestration (Docker/Kubernetes/EKS) for the data platform. Establish and monitor SLOs for data quality, latency, and availability; debug incidents across the stack. Mentor engineers, lead technical design reviews, and raise the engineering bar through clear documentation and knowledge sharing. What We're Looking For 5+ years
combined
software engineering + data engineering
experience, including 3+ years building production services in Go or Python,. Proven ability to write performant, maintainable code in large codebases-beyond SQL and lowcode ETL tools. Deep understanding of computerscience fundamentals: data structures, algorithms, concurrency, networking, and distributedsystems concepts. Expertise with at least two of the following distributed data technologies:
Kafka/Kinesis, Spark/Flink, ClickHouse/Trino, Iceberg/Hudi, Redis/MongoDB . Handson experience instrumenting, monitoring, and troubleshooting services in
AWS or GCP
(CloudWatch, Datadog, Prometheus/Grafana, etc.). Strong SQL skills and practical knowledge of dimensional and eventdriven data modeling. Familiarity with containerization, CI/CD pipelines, and bluegreen or canary deployments. Excellent written and verbal communication-you explain complex systems clearly and bring others along. Nice to Have Experience supporting ML & LLM inference pipelines in production (vector DBs, feature stores, prompt engineering). Exposure to eventdriven microservices, protobuf/Avro schemas, and schemaregistry governance. Prior success in a fastgrowing startup environment where you wore multiple hats. What We Offer Competitive salary and equity Remote (Eastern or Central Time Zone required) OR Hybrid work environment (3 days/week in our NYC office) Medical, dental, and vision benefits 401(k) plan Flexible PTO Opportunity to build at the ground floor of a high-growth, mission-driven company Not offering sponsorship Why Hatch Shape the future of AI-driven customer service Build alongside founders and leaders who value speed, ownership, and ambition Solve hard problems that impact real businesses and customers Join a team of builders who care about great engineering, fast execution, and each other