Sierra Digital Inc
Senior Databricks Data Engineer
HOUSTON TX - onsite - 100%
A
Senior Databricks Data Engineer
is responsible for designing, building, and optimizing scalable data pipelines and analytics solutions using
Databricks (Apache Spark)
on cloud platforms.
Responsibilities
Design and develop
end-to-end data pipelines
using Databricks (Spark, PySpark, SQL)
Build and manage
Delta Lake
architectures (Bronze–Silver–Gold layers)
Optimize Spark jobs for
performance, cost, and scalability
Ingest data from multiple sources (APIs, RDBMS, streaming, files)
Implement
ETL/ELT
processes using Databricks notebooks and workflows
Work with
large-scale structured and unstructured data
Collaborate with data scientists, analysts, and business teams
Ensure data quality, governance, and security best practices
Support CI/CD and version control (Git, Azure DevOps, GitHub)
Databricks
(core expertise)
Apache Spark
(PySpark, Spark SQL)
AWS (S3, Glue, Redshift)
GCP (BigQuery, GCS)
SQL (advanced)
and Python
Data modeling & performance tuning
Streaming frameworks (Kafka, Spark Structured Streaming – nice to have)
Experience Level
10+ years in
data engineering
3–5+ years of
hands-on Databricks/Spark experience
Experience in
cloud-native data platforms
For more details reach me at Sierra Digital, Inc. | 6001 Savoy Drive, Suite 210 | Houston, Texas 77036
Click here to view my LinkedIn #J-18808-Ljbffr
A
Senior Databricks Data Engineer
is responsible for designing, building, and optimizing scalable data pipelines and analytics solutions using
Databricks (Apache Spark)
on cloud platforms.
Responsibilities
Design and develop
end-to-end data pipelines
using Databricks (Spark, PySpark, SQL)
Build and manage
Delta Lake
architectures (Bronze–Silver–Gold layers)
Optimize Spark jobs for
performance, cost, and scalability
Ingest data from multiple sources (APIs, RDBMS, streaming, files)
Implement
ETL/ELT
processes using Databricks notebooks and workflows
Work with
large-scale structured and unstructured data
Collaborate with data scientists, analysts, and business teams
Ensure data quality, governance, and security best practices
Support CI/CD and version control (Git, Azure DevOps, GitHub)
Databricks
(core expertise)
Apache Spark
(PySpark, Spark SQL)
AWS (S3, Glue, Redshift)
GCP (BigQuery, GCS)
SQL (advanced)
and Python
Data modeling & performance tuning
Streaming frameworks (Kafka, Spark Structured Streaming – nice to have)
Experience Level
10+ years in
data engineering
3–5+ years of
hands-on Databricks/Spark experience
Experience in
cloud-native data platforms
For more details reach me at Sierra Digital, Inc. | 6001 Savoy Drive, Suite 210 | Houston, Texas 77036
Click here to view my LinkedIn #J-18808-Ljbffr