Sr. Data Engineer/Lead (Remote)
Hudson Data - Herndon, Virginia, United States, 22070
Work at Hudson Data
Overview
- View job
Overview
Description: Required Qualifications: BS in Computer Science or Engineering, or a related technical field • Integrate third-party APIs, and the ability to understand them in several formats • 5+ years of Expertise writing production grade code in Python, Kafka, Spark, Scala and/or Java. • Data build tools (DBT) have good experience. • Coding experience in shipping/Devops complex software to production • Expertise in data structures, algorithms, performance and scalability • Knowledge of high-scale performance and optimization tools and techniques • Desire to learn functional programming, specifically in Python/Java, as it is our primary development language • Able to drive ELT solutions across the AWS consumption Stack(Athena, Glue)
Preferred Qualifications: • Experience with Kafka, Spark, Apache Airflow and Kubernates (Docker and containers) • Understanding of fault-tolerant systems, network programming, multithreaded programming, and security • Experience with distributed systems and application design in a SOA environment • Experience with AWS and/or Azure (configuring, deploying, managing) services and distributed applications • experience working with healthcare data (EMR Clinical data, Claims from variety of payers, etc.) • 3+ years of experience working with, analyzing, and understanding data using SQL • Experience with Python a must • Experience with Databricks a plus • Experience with recommendations, predictions, personalization and machine learning, A/B testing, Spark, Kafka, and MongoDB • Experience with healthcare data experience (FHIR, Claims, Eligibility, RX) • Experience with Python spark/Scala/Jav must • Experience with Bigdata Solutions a must.