Junior Data Engineer
Data Machines Inc - Reston, Virginia, United States, 22090
Work at Data Machines Inc
Overview
- View job
Overview
Location: Reston, VA
Clearance Requirement: TS w/ SCI Eligibility
Job Description and Responsibilities:
Come join the future of data-driven decision making! At Data Machines we leverage data analytics, DevSecOps, machine intelligence, and data science to engineer solutions for our Federal government, defense, and commercial sponsors to solve real-world, critical mission problems.
Data Machines is looking for a motivated and detail-oriented Junior Data Engineer to join our growing Data Engineering team. This is an exciting opportunity for someone early in their career to gain hands-on experience with modern data technologies, contribute to the development of data pipelines, and help drive data-driven decision-making across the organization. This position is full-time on site in Reston, VA.
Key Responsibilities: Assist in the design, development, and maintenance of scalable data pipelines and ETL processes Work with structured and unstructured data from various sources to ingest, clean, transform, and store in appropriate formats Support the creation and optimization of data models in data warehouses (e.g., Postgres) Monitor data pipeline performance and troubleshoot issues as needed Collaborate with data analysts, data scientists, and software engineers to understand data needs Ensure data quality, integrity, and consistency across all data systems Maintain documentation for data processes and pipelines Learn and adapt to new tools, technologies, and best practices in data engineering Minimum Qualifications:
Active TS Clearance with SCI Eligibility Bachelor's degree in Computer Science, Engineering, Information Systems, or a related field Proficiency in SQL and at least one programming language (e.g., Python) Familiarity with relational databases and data warehousing concepts Understanding of ETL concepts and tools Exposure to workflow orchestration tools like Apache Airflow, NiFi and Kafka Strong analytical and problem-solving skills Excellent communication and teamwork abilities Eagerness to learn and grow in a fast-paced environment Experience in Jupyter Notebooks, PostgreSQL. Experience with version control systems (e.g., Git) Desired Qualifications:
Knowledge of data lake technologies and big data tools (e.g., Spark) Familiarity with containerization tools like Docker