Application Management Services LLC

Senior Data Engineer (ETL / PySpark)

Application Management Services LLC, Princeton, New Jersey, us, 08543

Overview In order to be successful:

You should have a working knowledge of industry

standard Data Infrastructure

(e.g. Warehouse, BI, Analytics, Big-Data, etc.) tools with the goal of providing end users with analytics at the speed of thought.

You should be proficient at developing, architecting, standardizing and supporting technology platforms using Industry leading ETL solutions.

You should thrive in building scalable and high throughput systems

You should have experience with agile BI & ETL practices to assist with the interim Data preparation for Data Discovery & self-service needs.

You must have strong communication, presentation, problem-solving, and trouble-shooting skills.

You should be highly motivated to drive innovations company-wide.

Required Qualifications

5 years of experience in designing and developing ETL pipelines

leveraging pyspark/python.

Experience on python database libraries like SQLAlchemy, psycopg2 .etc

Strong understanding of data warehousing methodologies, ETL processing and dimensional data modeling.

Advanced SQL capabilities are required. Knowledge of database design techniques and experience working with extremely large data volumes is a plus.

Demonstrated experience and ability to work with business users to gather requirements and manage scope.

Experience in workflow tools such as oozie or Airflow or Tidal

Experience working in a big data environment with technologies such as Greenplum, Hadoop and HIVE

BA, BS, MS, PhD in Computer Science, Engineering or related technology field

Nice to have

Experience with large database and DW Implementation (20 TBs)

Understanding of VLDB performance aspects, such as table partitioning, sharding, table distribution and optimization techniques

Knowledge of reporting tools such as Qlik Sense, Tableau, Cognos

#J-18808-Ljbffr