ATI
Senior Data Engineer, Physical Intelligence
ATI, Waltham, Massachusetts, United States, 02254
About ATI:
We are a cutting-edge Series A company focused on revolutionizing the automotive robotics space. The team consists of experienced robotics, industrial automation, and automotive professionals with deep relationships and a distribution network across the U.S. We have strategic backing from long-term capital and the support to build something truly amazing. Come join our team and help us define the future of automotive robotics!
Position Overview:
We are seeking a Senior Data Engineer to join our team and lead the development of the data systems that power our robotics platform, computer vision pipelines, and product insights. In this full-stack data role, you’ll design and build the end-to-end data infrastructure — from ingestion at the edge to analytics and machine learning pipelines in the cloud. You’ll work across the stack to turn raw sensor, vision, and operational data into high-quality, actionable intelligence that drives product decisions and enables next-generation automation.
Key Responsibilities:
Design, implement, and maintain end-to-end data pipelines that ingest data from robotic systems, sensors, cameras, APIs, and other sources into our data platform.
Build and scale both batch and real-time streaming systems, ensuring reliability, fault tolerance, and high throughput.
Develop and manage our data warehouse, and feature store, enabling downstream ML, analytics, and application use cases.
Define and enforce data contracts, schema standards, validation rules, and lineage tracking to ensure data integrity and trust.
Create transformation and modeling layers (e.g., using dbt, Spark, or equivalent) to deliver clean, well-structured data for engineers, analysts, and stakeholders.
Collaborate closely with computer vision, robotics, and ML teams to build pipelines that support training, evaluation, and continuous learning.
Instrument observability — build metrics, alerts, and dashboards on pipeline health, latency, data quality, and drift.
Contribute to architecture and roadmap decisions around the evolution of ATI’s data systems and infrastructure.
6+ years of experience as a data engineer, data infrastructure engineer, or similar role with a focus on end-to-end systems.
Strong programming skills in Python, Scala, or Java and deep experience with modern data frameworks (e.g., Spark, Flink, Beam).
Proven experience building streaming and batch pipelines using technologies like Kafka, Kinesis, or Pub/Sub.
Expertise in SQL and relational database design, including complex queries, performance tuning, and schema optimization.
Hands-on experience with data modeling, partitioning, indexing, and OLAP/OLTP tradeoffs.
Experience building and maintaining data lakes, warehouses, or feature stores for ML and analytics.
Familiarity with orchestration and workflow tools (e.g., Airflow, Dagster, Prefect).
Strong foundation in data validation, quality checks, schema evolution, and lineage tracking.
Track record of building robust observability into data systems — metrics, logging, and alerting.
Excellent communication and collaboration skills; comfortable partnering across engineering, product, and operations.
Preferred Qualifications:
Experience working with robotics, IoT, or computer vision data.
Knowledge of cloud platforms (AWS, GCP, or Azure) and containerized environments (Docker, Kubernetes).
Experience integrating data pipelines into ML workflows (training, serving, retraining).
Proven ability to set standards and build data engineering practices from the ground up in a fast-paced startup.
Why Join Us?
You will be data hire #1 and grow with the company. Build foundational data infrastructure that directly powers robotic automation, computer vision, and AI at scale. Play a high-impact, cross-functional role shaping how ATI uses data to deliver groundbreaking products. Competitive compensation, equity, and benefits, with the opportunity to grow as we scale.
#J-18808-Ljbffr
You will be data hire #1 and grow with the company. Build foundational data infrastructure that directly powers robotic automation, computer vision, and AI at scale. Play a high-impact, cross-functional role shaping how ATI uses data to deliver groundbreaking products. Competitive compensation, equity, and benefits, with the opportunity to grow as we scale.
#J-18808-Ljbffr