Logo
Miracle Software Systems, Inc

GCP Data Engineer (F2F) - W2

Miracle Software Systems, Inc, Dearborn, Michigan, United States, 48120

Save Job

Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Miracle Software Systems, Inc Lead HCM Executive at Miracle Software Systems, Inc.

We Miracle Software Systems is looking for the GCP Data engineer on F2F - W2/Fulltime Note : Face to Face interview need locals Location: Dearborn, MI Duration: 12+ Months Skills:

GCP, ETL, Apache Spark, Data Architecture, Python, SQL, KAFKA Position Description: Employees in this job function are responsible for designing, building, and maintaining data solutions including data infrastructure, pipelines, etc. for collecting, storing, processing and analyzing large volumes of data efficiently and accurately Key Responsibilities

Collaborate with business and technology stakeholders to understand current and future data requirements Design, build and maintain reliable, efficient and scalable data infrastructure for data collection, storage, transformation, and analysis Plan, design, build and maintain scalable data solutions including data pipelines, data models, and applications for efficient and reliable data workflow Design, implement and maintain existing and future data platforms like data warehouses, data lakes, data lakehouse etc. for structured and unstructured data Design and develop analytical tools, algorithms, and programs to support data engineering activities like writing scripts and automating tasks Ensure optimum performance and identify improvement opportunities Skills Required

Google Cloud Platform ETL Apache Spark Data Architecture Python SQL KAFKA Experience & Skills

Data Pipeline Architecture & Development: Design, build, and maintain highly scalable, fault-tolerant, and performant data pipelines to ingest and process data from 10+ siloed sources, including both structured and unstructured formats. ML-Driven ETL Implementation: Operationalize ETL pipelines for intelligent data ingestion, automated cataloging, and sophisticated normalization of diverse datasets. Unified Data Model Creation: Architect and implement a unified data model capable of connecting all relevant data elements across various sources, optimized for efficient querying and insight generation by AI agents and chatbot interfaces. Big Data Processing: Utilize advanced distributed processing frameworks (Apache Beam, Apache Spark, Google Cloud Dataflow) to handle large-scale data transformations and data flow. Cloud-Native Data Infrastructure: Leverage GCP services to build and manage robust data storage, processing, and orchestration layers. Data Quality, Governance & Security: Implement rigorous data quality gates, validation rules, bad record handling, and comprehensive logging. Ensure strict adherence to data security policies, IAM role management, and GCP perimeter security. Automation & Orchestration: Develop shell scripts, Cloud Build YAMLs, and utilize Cloud Scheduler/PubSub for end-to-end automation of data pipelines and infrastructure provisioning. Collaboration with AI/ML Teams: Work closely with AI/ML engineers, data scientists, and product managers to understand data requirements, integrate data solutions with multi-agentic systems, and optimize data delivery for chatbot functionalities. Testing & CI/CD: Implement robust testing strategies, maintain high code quality through active participation in Git/GitHub, perform code reviews, and manage CI/CD pipelines via Cloud Build. Performance Tuning & Optimization: Continuously monitor, optimize, and troubleshoot data pipelines and BigQuery performance using techniques like table partitioning, clustering, and sharding. Seniority level

Mid-Senior level Employment type

Full-time Job function

Information Technology Software Development Referrals increase your chances of interviewing at Miracle Software Systems, Inc by 2x Detroit, MI $112,597.33-$152,810.66 1 month ago

#J-18808-Ljbffr