Data Engineer
Strata Clean Energy - Durham, North Carolina, United States, 27703
Work at Strata Clean Energy
Overview
- View job
Overview
Data Engineer
role at
Strata Clean Energy Join to apply for the
Data Engineer
role at
Strata Clean Energy Get AI-powered advice on this job and more exclusive features. Design, build, and maintain scalable ETL/ELT pipelines to process large datasets efficiently. Leverage Python for scripting and orchestration tasks. Database Management: Develop and optimize queries and schemas in ClickHouse and SQL databases. Ensure performance for high-volume data operations, including tuning for performance, correctness and efficient storage. Support data integration efforts combining ClickHouse, MS SQL Server and Databricks. Deploy and manage data workflows and applications on Azure cloud services, Docker and Python orchestration tools. Implement solutions for data storage, processing, and analytics using Azure-native and on-prem solutions. Data Orchestration and Automation: Use Python-based orchestration tools (e.g., Apache Airflow, Dagster, or Prefect) to schedule and monitor workflows. Automate repetitive tasks and maintain robust workflows with fault-tolerance and scalability. Manage containerized applications for deployments and CI/CD pipelines. Collaborate with cross-functional teams to utilize Databricks for big data processing and advanced analytics. Documentation and Best Practices: Document processes, configurations, and troubleshooting guides. Enforce best practices for coding, testing, and deployment across data engineering tasks
Essential Duties And Responsibilities
Data Pipeline Development: Design, build, and maintain scalable ETL/ELT pipelines to process large datasets efficiently. Leverage Python for scripting and orchestration tasks. Database Management: Develop and optimize queries and schemas in ClickHouse and SQL databases. Ensure performance for high-volume data operations, including tuning for performance, correctness and efficient storage. Support data integration efforts combining ClickHouse, MS SQL Server and Databricks. Cloud Infrastructure: Deploy and manage data workflows and applications on Azure cloud services, Docker and Python orchestration tools. Implement solutions for data storage, processing, and analytics using Azure-native and on-prem solutions. Data Orchestration and Automation: Use Python-based orchestration tools (e.g., Apache Airflow, Dagster, or Prefect) to schedule and monitor workflows. Automate repetitive tasks and maintain robust workflows with fault-tolerance and scalability. Manage containerized applications for deployments and CI/CD pipelines. Collaborate with cross-functional teams to utilize Databricks for big data processing and advanced analytics. Documentation and Best Practices: Document processes, configurations, and troubleshooting guides. Enforce best practices for coding, testing, and deployment across data engineering tasks
Technical Skills
Strong Proficiency in Python for scripting, data manipulation, and orchestration. Strong knowledge of SQL database management and advanced SQL skills. Experience with Azure services (e.g., Azure Databricks, Blob Storage). Hands-on experience with Docker for containerization. Familiarity with Grafana for system monitoring and visualization. Working knowledge of Databricks for big data and machine learning workflows.
Education And/or Work Experience Requirements
Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field. Strong problem-solving skills and a deep understanding of data architecture principles. Ability to manage multiple priorities and work effectively in a collaborative environment. Excellent communication and documentation skills.
Preferred Qualifications
Experience with ClickHouse, OLAP Databases Knowledge of distributed systems and data modeling best practices. Experience with PV modeling tools and techniques (e.g., PVSyst, PVlib, SAM, and/or similar software) Experience with AVEVA and IBM Maximo
Physical Requirements/Work Environment:
Work is performed in an office environment. On occasion, the position may require an in-person site visit. Must be able to sit, stand, bend, twist and lift up to 20 pounds.
This job description is intended to describe the essential job functions of this position and is not intended to be an all-inclusive statement of job responsibilities.
I have read, understand and am able to perform the duties within the job description. I have received a copy of this form.
Seniority level
Seniority level Entry level Employment type
Employment type Full-time Job function
Job function Information Technology Referrals increase your chances of interviewing at Strata Clean Energy by 2x Sign in to set job alerts for “Data Engineer” roles.
Software Engineer I (Full Time) United States
Raleigh, NC $100,000.00-$175,000.00 3 months ago Software Engineer I (Intern) United States
Raleigh, NC $44,000.00-$130,000.00 6 days ago Software Engineer II (Full Time) United States
Software Engineer II (Intern) United States
Frontend Developer (React JS with Mapping)
Software Engineer ( Fullstack - .NET/C#/Angular)
Software Engineer, Site Reliability Engineering, Campus
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr