Serigor Inc is hiring: Data Analyst in Arlington
Serigor Inc, Arlington, VA, United States, 22201
Job Title: Data Analyst (Onsite/Hybrid)
Location: Arlington, VA
Duration: 6+ Months
Remote/Onsite Hybrid Model: This position requires presence at Corporate Headquarters in Arlington, VA and Dulles Airport Corporate office for at least four days a week (Monday–Thursday). The remaining day may be remote. Work hours are 8:00 am–6:00 pm.
This position does not require a vehicle for mobility between campuses.
This position does not require AOA security access badging.
The role designs and implements strategic data initiatives across the client, including enterprise data warehousing, metadata management, and machine learning solutions. Extensive experience in data architecture, ETL development, and advanced analytics using Qlik Sense is required.
Job Description and Role
- Designs and implements strategic data initiatives such as enterprise data warehouse, master data, data governance, data quality, metadata management, and data marts.
- Administers and monitors automated and manual data integration and Data/ETL jobs to verify execution and measure performance.
- Designs the Airports Authority-wide metadata and information management program, developing enterprise conceptual, logical, and physical data models. Provides data architecture support to major application development initiatives in an agile environment.
- Implements program standards and procedures to support data warehouse administration. Provides technical advice on data management and design to Application Development and Enterprise Architecture project teams, end‑users, and business stakeholders.
- Develops design and Data/ETL coding of Source Dependent Extracts (Relational DB, APIs, flat files), Source Independent Loads, and Post‑Load Processes from source to target systems for operational data stores and dimensional data warehouses into an in‑memory or Massively Parallel Processing platform or an SQL Server/Oracle Data Warehouse, using near real‑time loads and Change Data Capture.
- Data mining and extracting usable data; using machine learning tools to select features, create and optimize classifiers; carrying out preprocessing of structured and unstructured data.
- Enhances data collection procedures to include all relevant information for developing analytic systems.
- Processes, cleanses, and validates the integrity of data for analysis.
- Analyzes large amounts of information to find patterns and solutions. Develops prediction systems and machine learning algorithms.
- Delivers an ML project from beginning to end, including understanding the business need, aggregating data, exploring data, building & validating predictive models, and deploying completed models with concept‑drift monitoring and retraining to deliver business impact to the organization.
- Researches and implements novel ML approaches using AI services, ML platforms and frameworks (e.g., TensorFlow, PyTorch, OpenNN, H2O.ai SparkML, SageMaker, scikit‑learn, MXNet, Azure Synapse Analytics, Google BigQuery). Implementation includes descriptive, predictive, and prescriptive analytics.
- Performs other duties as assigned.
Qualifications
- Eight years of progressively responsible experience in data warehousing and integration, including experience applying data modeling techniques and ETL coding.
- Knowledge of and skill in developing and implementing a corporate data architecture program with responsibility for enterprise conceptual/logical data modeling, data policies, standards and compliance monitoring, metadata mapping, data governance, and as‑is/target data architecture to participate in strategic data initiatives.
- Experience in an ML engineer or data scientist role building and deploying ML models or hands‑on experience developing deep learning models.
- Knowledge of and skill in working with database management systems, such as Oracle, Redshift, MySQL, Snowflake and Microsoft SQL, and data warehouse solutions, and ETL tools (like Informatica Power Center, IICS) and reporting tools (like Qlik Sense, Tableau and PowerBI) to centrally manage and analyze data originating from disparate source systems.
- Programming skills – knowledge of statistical programming languages like R, Python, and database query languages such as SQL, Hive, Pig is desirable. Familiarity with Scala, Java, or C++ is an added advantage.
- Statistics – Good applied statistical skills, including knowledge of statistical tests, distributions, regression, maximum likelihood estimators, etc. Proficiency in statistics is essential for data‑driven companies.
- Machine Learning – good knowledge of machine learning methods such as k‑Nearest Neighbors, Naive Bayes, SVM, Decision Forests.
- Strong Math Skills (Multivariable Calculus and Linear Algebra) – understanding the fundamentals of Multivariable Calculus and Linear Algebra is important as they form the basis of many predictive performance or algorithm optimization techniques.
Seniority level
- Mid‑Senior level
Employment type
- Contract
Job function
- Information Technology
Industries
- IT Services and IT Consulting, Software Development, and Information Technology & Services