GovCIO
Extract, Transform, Load Engineer / Data Scientist
GovCIO, Providence, Rhode Island, us, 02912
Overview:
GovCIO is currently seeking a skilled ETL Engineer or Data Scientist to become a vital part of our ETL Team dedicated to designing, developing, and maintaining high-quality ETL pipelines and data infrastructure within AWS GovCloud environments. This position will be fully remote, allowing for flexibility while contributing to significant projects. Responsibilities: Design, develop, and maintain robust ETL pipelines and data infrastructure within AWS GovCloud environments. Collaborate closely with cross-functional teams to ensure efficient processing, analysis, and visualization of data from diverse sources to enhance system performance. Create scalable and high-performance data pipelines using a combination of AWS native services and open-source software. Manage complex data flows and contribute to operational monitoring efforts across a large AWS environment. Data Pipeline Development: Implement Extract, Transform, Load (ETL) solutions for transferring data from various sources, including AWS S3, CloudWatch, EventBridge, and other cloud-based services. Utilize AWS services such as Lambda, Kinesis, and Data Prepper to craft multi-account data pipelines. Utilize AWS CloudFormation for deploying and managing data pipeline infrastructure in an Infrastructure-as-Code (IaC) setting. Establish CloudWatch alarms and synthetic canaries for proactive system health monitoring. Log Aggregation & Metrics Collection: Deploy and configure Fluent-bit agents to aggregate logs from numerous critical systems. Develop custom Lua functions and regex parsers for transforming and directing logs appropriately. Data Analysis, Visualization & Alerting: Create data visualizations, dashboards, and alerts to monitor and identify anomalous system activity. Design and maintain AWS CloudWatch alarms for monitoring cloud applications, infrastructure, and service performance. Set up notifications through Amazon SNS and leverage communication tools (e.g., Slack, email) for timely updates on critical issues. Develop custom visualizations using Vega and establish alerts using Query DSL and Painless scripting. Cluster Management & Data Storage: Manage an OpenSearch cluster to support large-scale data ingestion and querying. Implement explicit mappings for ingested fields in OpenSearch and manage index state for optimal performance. Use Active Directory Federated Services for access control and multi-tenancy within the OpenSearch environment. Application Performance Monitoring: Instrument applications using OpenTelemetry for visibility into application performance. Qualifications: HS Diploma with 9+ years of professional experience. Ability to obtain a Top Secret security clearance. Required Skills and Experience: Must have IAT level II/III certification (e.g., CompTIA Security+(CE)). Experience and certifications in Linux and/or AWS GovCloud technologies. Preferred Skills and Experience: Strong experience with AWS services including Lambda, Kinesis, CloudWatch, S3, EventBridge, and CloudFormation. Proficiency in Python for developing and managing data pipelines. Experience with distributed NoSQL databases such as OpenSearch, Elasticsearch, MongoDB, or Splunk. Experience in deploying and configuring log aggregation and monitoring agents. Knowledge of Infrastructure-as-Code (IaC) principles, mainly through AWS CloudFormation. Effective written and oral communication skills. Relevant technology certifications. Company Overview:
GovCIO is a passionate team dedicated to transforming government IT. Each day, we strive to make a positive impact by delivering innovative IT services that enhance government operations and serve citizens effectively. We seek great individuals to help us achieve our mission. Are you ready to be a transformer? Interview & Hiring Process:
If you are selected for further consideration, you can expect a virtual video interview with the hiring manager and/or team. A valid photo ID must be presented, and the camera should be on during the interview. Additionally, expect an enhanced biometrics ID verification screening and background check, including criminal history and verification of education and employment history. Employee Perks:
At GovCIO, employees cite meaningful work and a collaborative environment as key factors in their satisfaction. Our employees gain access to numerous perks beyond standard health benefits, including but not limited to: Employee Assistance Program (EAP) Corporate Discounts Learning & Development platform with certification preparation resources Training, Education, and Certification Assistance Referral Bonus Program Internal Mobility Program Pet Insurance Flexible Work Environment Join us and contribute to a culture that values its workforce and emphasizes the continuous enhancement of the employee experience. We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender identity, sexual orientation, national origin, disability, or protected veteran status. Posted Salary Range: USD $130,000.00 - USD $155,000.00 /Yr. This position may include other compensation elements and total compensation will be discussed during the hiring process.
GovCIO is currently seeking a skilled ETL Engineer or Data Scientist to become a vital part of our ETL Team dedicated to designing, developing, and maintaining high-quality ETL pipelines and data infrastructure within AWS GovCloud environments. This position will be fully remote, allowing for flexibility while contributing to significant projects. Responsibilities: Design, develop, and maintain robust ETL pipelines and data infrastructure within AWS GovCloud environments. Collaborate closely with cross-functional teams to ensure efficient processing, analysis, and visualization of data from diverse sources to enhance system performance. Create scalable and high-performance data pipelines using a combination of AWS native services and open-source software. Manage complex data flows and contribute to operational monitoring efforts across a large AWS environment. Data Pipeline Development: Implement Extract, Transform, Load (ETL) solutions for transferring data from various sources, including AWS S3, CloudWatch, EventBridge, and other cloud-based services. Utilize AWS services such as Lambda, Kinesis, and Data Prepper to craft multi-account data pipelines. Utilize AWS CloudFormation for deploying and managing data pipeline infrastructure in an Infrastructure-as-Code (IaC) setting. Establish CloudWatch alarms and synthetic canaries for proactive system health monitoring. Log Aggregation & Metrics Collection: Deploy and configure Fluent-bit agents to aggregate logs from numerous critical systems. Develop custom Lua functions and regex parsers for transforming and directing logs appropriately. Data Analysis, Visualization & Alerting: Create data visualizations, dashboards, and alerts to monitor and identify anomalous system activity. Design and maintain AWS CloudWatch alarms for monitoring cloud applications, infrastructure, and service performance. Set up notifications through Amazon SNS and leverage communication tools (e.g., Slack, email) for timely updates on critical issues. Develop custom visualizations using Vega and establish alerts using Query DSL and Painless scripting. Cluster Management & Data Storage: Manage an OpenSearch cluster to support large-scale data ingestion and querying. Implement explicit mappings for ingested fields in OpenSearch and manage index state for optimal performance. Use Active Directory Federated Services for access control and multi-tenancy within the OpenSearch environment. Application Performance Monitoring: Instrument applications using OpenTelemetry for visibility into application performance. Qualifications: HS Diploma with 9+ years of professional experience. Ability to obtain a Top Secret security clearance. Required Skills and Experience: Must have IAT level II/III certification (e.g., CompTIA Security+(CE)). Experience and certifications in Linux and/or AWS GovCloud technologies. Preferred Skills and Experience: Strong experience with AWS services including Lambda, Kinesis, CloudWatch, S3, EventBridge, and CloudFormation. Proficiency in Python for developing and managing data pipelines. Experience with distributed NoSQL databases such as OpenSearch, Elasticsearch, MongoDB, or Splunk. Experience in deploying and configuring log aggregation and monitoring agents. Knowledge of Infrastructure-as-Code (IaC) principles, mainly through AWS CloudFormation. Effective written and oral communication skills. Relevant technology certifications. Company Overview:
GovCIO is a passionate team dedicated to transforming government IT. Each day, we strive to make a positive impact by delivering innovative IT services that enhance government operations and serve citizens effectively. We seek great individuals to help us achieve our mission. Are you ready to be a transformer? Interview & Hiring Process:
If you are selected for further consideration, you can expect a virtual video interview with the hiring manager and/or team. A valid photo ID must be presented, and the camera should be on during the interview. Additionally, expect an enhanced biometrics ID verification screening and background check, including criminal history and verification of education and employment history. Employee Perks:
At GovCIO, employees cite meaningful work and a collaborative environment as key factors in their satisfaction. Our employees gain access to numerous perks beyond standard health benefits, including but not limited to: Employee Assistance Program (EAP) Corporate Discounts Learning & Development platform with certification preparation resources Training, Education, and Certification Assistance Referral Bonus Program Internal Mobility Program Pet Insurance Flexible Work Environment Join us and contribute to a culture that values its workforce and emphasizes the continuous enhancement of the employee experience. We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender identity, sexual orientation, national origin, disability, or protected veteran status. Posted Salary Range: USD $130,000.00 - USD $155,000.00 /Yr. This position may include other compensation elements and total compensation will be discussed during the hiring process.