Q-Cells
Senior Data Engineer, Industrial IoT
Q-Cells, San Francisco, California, United States, 94199
Description
POSITION DESCRIPTION:
We are looking for a Senior Data Engineer specializing in Industrial IoT to architect and build the data foundation for our next-generation data center energy management platform. The ideal candidate will have deep expertise in designing data pipelines for time-series IoT data, building semantic data models, and creating scalable data integration frameworks that transform raw industrial telemetry into analytics-ready datasets. You will own the entire data lifecycle from ingestion to semantic enrichment, working at the intersection of operational technology and modern data platforms. This position is remote and offers the opportunity to define how critical infrastructure data is collected, modeled, and consumed across AI/ML applications. RESPONSIBILITIES Design and implement end-to-end data pipelines ingesting millions of data points per minute from industrial IoT devices Build semantic data models and metadata management systems that add business context to raw telemetry streams Create data transformation frameworks that normalize heterogeneous device data into unified schemas Develop automated data quality validation, anomaly detection, and data lineage tracking systems Design and maintain a centralized device metadata repository and configuration management database Implement streaming ETL pipelines using Apache Kafka, Spark Streaming, or cloud-native services Build time-series data storage solutions optimized for both real-time analytics and historical analysis Create data APIs and access patterns supporting both operational dashboards and ML model training Partner with ML engineers to ensure data pipelines meet feature engineering requirements Establish data governance practices including schema versioning, backwards compatibility, and change management Collaborate with field engineers to understand device characteristics and improve data collection strategies REQUIRED QUALIFICATIONS Bachelor's degree in Computer Science, Data Science, Information Systems, or related field 10+ years in data engineering with 5+ years focused on IoT, time-series, or streaming data platforms Expert-level experience with streaming data technologies (Kafka, Pulsar, Kinesis, Event Hubs) Strong expertise in time-series databases (InfluxDB, TimescaleDB, Azure Data Explorer, AWS Timestream) Proficiency in Python or Scala for data pipeline development and SQL for complex data transformations Experience with schema management, data catalogs, and metadata-driven architectures Deep understanding of data modeling for analytical workloads including dimensional modeling and data vault Hands-on experience with cloud data platforms (Azure Synapse, AWS Glue, Databricks, Snowflake) Knowledge of data formats and serialization (Parquet, Avro, Protocol Buffers, JSON Schema) Experience building real-time data quality monitoring and alerting systems Travel may be required up to 10% for architecture reviews and stakeholder alignment PREFERRED QUALIFICATIONS Solar Industry experience (Renewable) Experience with industrial protocols (OPC-UA, Modbus, MQTT) and IoT data standards Knowledge of semantic web technologies (RDF, OWL, knowledge graphs, ontologies) Familiarity with OpenTelemetry, Prometheus, or other observability data formats Experience in energy, utilities, manufacturing, or critical infrastructure domains Understanding of edge computing and distributed data processing architectures Experience with data mesh or data fabric architectural patterns Knowledge of ML feature stores and data preparation for AI/ML pipelines Exposure to graph databases (Neo4j, Amazon Neptune, Azure Cosmos DB Gremlin) Experience with regulatory compliance for data handling (SOC 2, NERC CIP) Certifications in cloud data platforms (Azure Data Engineer, AWS Data Analytics)
Hanwha Q CELLS Technologies, Inc. a subsidiary of Hanwha Q CELLS, one of the world's largest and most recognized photovoltaic manufacturers for its high-performance, high-quality solar cells and modules. It is headquartered in Seoul, South Korea (Global Executive HQ) Talheim, Germany (Technology & Innovation HQ) and Santa Clara, CA, USA (HW and SW Product Development HQ). Through its growing global business network spanning Europe, North America, Asia, South America, Africa, and the Middle East, the company provides excellent services and long-term partnerships to its customers in the utility, commercial, government, and residential markets. Hanwha Q CELLS is a flagship company of Hanwha Group, a FORTUNE Global 500 firm and a Top 7 business enterprise in South Korea.
PHYSICAL, MENTAL & ENVIRONMENTAL DEMANDS:
To comply with the Rehabilitation Act of 1973 the essential physical, mental and environmental requirements for this job are listed below. These are requirements normally expected to perform regular job duties. Incumbent must be able to successfully perform all of the functions of the job with or without reasonable accommodation.
Mobility
Standing
20% of time
Sitting
70% of time
Walking
10% of time
Strength
Pulling
up to 10 Pounds
Pushing
up to 10 Pounds
Carrying
up to 10 Pounds
Lifting
up to 10 Pounds
Dexterity
(F = Frequently, O = Occasionally, N = Never)
Typing
F
Handling
F
Reaching
F
Agility
(F = Frequently, O = Occasionally, N = Never)
Turning
F
Twisting
F
Bending
O
Crouching
O
Balancing
N
Climbing
N
Crawling
N
Kneeling
N
The salary range is required by the California Pay Transparency Act and may differ depending on the location of those candidates hired nationwide. Actual compensation is influenced by a wide array of factors including but not limited to, skill set, education, licenses and certifications, essential job duties and requirements, and the necessary experience relative to the job's minimum qualifications.
*This target salary range is for CA positions only and should not be interpreted as an offer of compensation. You may view your privacy rights by reviewing Qcells' Privacy Policy or by contacting our HR team for a copy.
We are looking for a Senior Data Engineer specializing in Industrial IoT to architect and build the data foundation for our next-generation data center energy management platform. The ideal candidate will have deep expertise in designing data pipelines for time-series IoT data, building semantic data models, and creating scalable data integration frameworks that transform raw industrial telemetry into analytics-ready datasets. You will own the entire data lifecycle from ingestion to semantic enrichment, working at the intersection of operational technology and modern data platforms. This position is remote and offers the opportunity to define how critical infrastructure data is collected, modeled, and consumed across AI/ML applications. RESPONSIBILITIES Design and implement end-to-end data pipelines ingesting millions of data points per minute from industrial IoT devices Build semantic data models and metadata management systems that add business context to raw telemetry streams Create data transformation frameworks that normalize heterogeneous device data into unified schemas Develop automated data quality validation, anomaly detection, and data lineage tracking systems Design and maintain a centralized device metadata repository and configuration management database Implement streaming ETL pipelines using Apache Kafka, Spark Streaming, or cloud-native services Build time-series data storage solutions optimized for both real-time analytics and historical analysis Create data APIs and access patterns supporting both operational dashboards and ML model training Partner with ML engineers to ensure data pipelines meet feature engineering requirements Establish data governance practices including schema versioning, backwards compatibility, and change management Collaborate with field engineers to understand device characteristics and improve data collection strategies REQUIRED QUALIFICATIONS Bachelor's degree in Computer Science, Data Science, Information Systems, or related field 10+ years in data engineering with 5+ years focused on IoT, time-series, or streaming data platforms Expert-level experience with streaming data technologies (Kafka, Pulsar, Kinesis, Event Hubs) Strong expertise in time-series databases (InfluxDB, TimescaleDB, Azure Data Explorer, AWS Timestream) Proficiency in Python or Scala for data pipeline development and SQL for complex data transformations Experience with schema management, data catalogs, and metadata-driven architectures Deep understanding of data modeling for analytical workloads including dimensional modeling and data vault Hands-on experience with cloud data platforms (Azure Synapse, AWS Glue, Databricks, Snowflake) Knowledge of data formats and serialization (Parquet, Avro, Protocol Buffers, JSON Schema) Experience building real-time data quality monitoring and alerting systems Travel may be required up to 10% for architecture reviews and stakeholder alignment PREFERRED QUALIFICATIONS Solar Industry experience (Renewable) Experience with industrial protocols (OPC-UA, Modbus, MQTT) and IoT data standards Knowledge of semantic web technologies (RDF, OWL, knowledge graphs, ontologies) Familiarity with OpenTelemetry, Prometheus, or other observability data formats Experience in energy, utilities, manufacturing, or critical infrastructure domains Understanding of edge computing and distributed data processing architectures Experience with data mesh or data fabric architectural patterns Knowledge of ML feature stores and data preparation for AI/ML pipelines Exposure to graph databases (Neo4j, Amazon Neptune, Azure Cosmos DB Gremlin) Experience with regulatory compliance for data handling (SOC 2, NERC CIP) Certifications in cloud data platforms (Azure Data Engineer, AWS Data Analytics)
Hanwha Q CELLS Technologies, Inc. a subsidiary of Hanwha Q CELLS, one of the world's largest and most recognized photovoltaic manufacturers for its high-performance, high-quality solar cells and modules. It is headquartered in Seoul, South Korea (Global Executive HQ) Talheim, Germany (Technology & Innovation HQ) and Santa Clara, CA, USA (HW and SW Product Development HQ). Through its growing global business network spanning Europe, North America, Asia, South America, Africa, and the Middle East, the company provides excellent services and long-term partnerships to its customers in the utility, commercial, government, and residential markets. Hanwha Q CELLS is a flagship company of Hanwha Group, a FORTUNE Global 500 firm and a Top 7 business enterprise in South Korea.
PHYSICAL, MENTAL & ENVIRONMENTAL DEMANDS:
To comply with the Rehabilitation Act of 1973 the essential physical, mental and environmental requirements for this job are listed below. These are requirements normally expected to perform regular job duties. Incumbent must be able to successfully perform all of the functions of the job with or without reasonable accommodation.
Mobility
Standing
20% of time
Sitting
70% of time
Walking
10% of time
Strength
Pulling
up to 10 Pounds
Pushing
up to 10 Pounds
Carrying
up to 10 Pounds
Lifting
up to 10 Pounds
Dexterity
(F = Frequently, O = Occasionally, N = Never)
Typing
F
Handling
F
Reaching
F
Agility
(F = Frequently, O = Occasionally, N = Never)
Turning
F
Twisting
F
Bending
O
Crouching
O
Balancing
N
Climbing
N
Crawling
N
Kneeling
N
The salary range is required by the California Pay Transparency Act and may differ depending on the location of those candidates hired nationwide. Actual compensation is influenced by a wide array of factors including but not limited to, skill set, education, licenses and certifications, essential job duties and requirements, and the necessary experience relative to the job's minimum qualifications.
*This target salary range is for CA positions only and should not be interpreted as an offer of compensation. You may view your privacy rights by reviewing Qcells' Privacy Policy or by contacting our HR team for a copy.