PETADATA
Senior Data Lake Engineer
Location:
Dallas, TX (Remote) |
Work Type:
C2C |
Experience:
15+ Years
A seasoned Senior Data Lake Engineer with over 15 years of experience in data engineering and a strong focus on building and managing AWS-native Data Lake solutions is required. The ideal candidate will have deep expertise with AWS Lake Formation, serverless data processing using Lambda and Python, and experience with AI‑assisted development tools such as Amazon Q.
Responsibilities
Design, build, and optimize scalable, secure data lakes using AWS Lake Formation and best practices for data governance, cataloging, and access control.
Build and deploy AWS Lambda functions using Python for real‑time data processing, automation, and event‑driven workflows.
Develop and maintain robust data pipelines using AWS Glue, integrating data from structured and unstructured sources.
Leverage AI‑powered coding tools (Amazon Q, GitHub Copilot, etc.) to increase development speed and code quality.
Design and implement integrations between the Data Lake and DynamoDB, optimizing for performance and consistency.
Implement fine‑grained access control, encryption, and data masking using Lake Formation and IAM to meet compliance standards (GDPR, HIPAA).
Implement logging, monitoring, and optimization for Glue jobs and Lambda functions.
Collaborate with cross‑functional teams, mentor junior engineers, and provide technical leadership.
Required Skills
15+ years in data engineering with 5+ years focused on AWS‑native data lake development.
Expertise in AWS Lake Formation, Glue, Lambda, and DynamoDB.
Proficient in Python for serverless and data processing.
Experience using AI‑assisted development tools (Amazon Q, GitHub Copilot, AWS CodeWhisperer).
Strong knowledge of AWS security practices: IAM, encryption, compliance (GDPR, HIPAA).
Experience with workflow orchestration tools (Step Functions, Airflow, or others).
Excellent communication, problem‑solving, and collaboration skills.
Preferred Skills
AWS certifications (Data Analytics, Solutions Architect, etc.).
Experience with Athena, Redshift, or other AWS analytics services.
Familiarity with DevOps tools (Terraform, CloudFormation, CDK).
Knowledge of data cataloging and metadata management tools.
Education Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
We offer a professional work environment and opportunities for growth in the IT field.
Candidates required to attend phone/video calls and in‑person interviews. Background checks will be conducted after selection.
Referrals increase your chances of interviewing at PETADATA by 2x.
Please email your résumé to greeshmac@petadata.co.
After reviewing your experience and skills, one of our HR team members will contact you with next steps.
#J-18808-Ljbffr
Location:
Dallas, TX (Remote) |
Work Type:
C2C |
Experience:
15+ Years
A seasoned Senior Data Lake Engineer with over 15 years of experience in data engineering and a strong focus on building and managing AWS-native Data Lake solutions is required. The ideal candidate will have deep expertise with AWS Lake Formation, serverless data processing using Lambda and Python, and experience with AI‑assisted development tools such as Amazon Q.
Responsibilities
Design, build, and optimize scalable, secure data lakes using AWS Lake Formation and best practices for data governance, cataloging, and access control.
Build and deploy AWS Lambda functions using Python for real‑time data processing, automation, and event‑driven workflows.
Develop and maintain robust data pipelines using AWS Glue, integrating data from structured and unstructured sources.
Leverage AI‑powered coding tools (Amazon Q, GitHub Copilot, etc.) to increase development speed and code quality.
Design and implement integrations between the Data Lake and DynamoDB, optimizing for performance and consistency.
Implement fine‑grained access control, encryption, and data masking using Lake Formation and IAM to meet compliance standards (GDPR, HIPAA).
Implement logging, monitoring, and optimization for Glue jobs and Lambda functions.
Collaborate with cross‑functional teams, mentor junior engineers, and provide technical leadership.
Required Skills
15+ years in data engineering with 5+ years focused on AWS‑native data lake development.
Expertise in AWS Lake Formation, Glue, Lambda, and DynamoDB.
Proficient in Python for serverless and data processing.
Experience using AI‑assisted development tools (Amazon Q, GitHub Copilot, AWS CodeWhisperer).
Strong knowledge of AWS security practices: IAM, encryption, compliance (GDPR, HIPAA).
Experience with workflow orchestration tools (Step Functions, Airflow, or others).
Excellent communication, problem‑solving, and collaboration skills.
Preferred Skills
AWS certifications (Data Analytics, Solutions Architect, etc.).
Experience with Athena, Redshift, or other AWS analytics services.
Familiarity with DevOps tools (Terraform, CloudFormation, CDK).
Knowledge of data cataloging and metadata management tools.
Education Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
We offer a professional work environment and opportunities for growth in the IT field.
Candidates required to attend phone/video calls and in‑person interviews. Background checks will be conducted after selection.
Referrals increase your chances of interviewing at PETADATA by 2x.
Please email your résumé to greeshmac@petadata.co.
After reviewing your experience and skills, one of our HR team members will contact you with next steps.
#J-18808-Ljbffr