Logo
System One

Senior Big Data Engineer (Python/PySpark)

System One, Strongsville, Ohio, United States, 44136

Save Job

Overview Position Title: Senior Big Data Engineer (Python/PySpark)

Locations/Flex: Pittsburgh PA - Two PNC Plaza or Cleveland OH - Strongsville Technology Center. The hiring team is not sourcing outside these hubs at this time. Working arrangement is Hybrid: 3 days in office, 2 remote. Acceptable time zones: EST. Working days: Mon–Fri, 40 hours. Working hours (flexible): 8:00 a.m. – 5:00 p.m. EST.

Responsibilities

Interact with the business to understand evolving requirements and adapt accordingly.

Understand business requirements and tech stack involved; guide the team toward an integral solution approach.

Demonstrate dedication and strong communication skills; participate in Agile ceremonies.

Hands-on experience with object-oriented programming, including its limitations.

Familiarity with cross-platform development.

Manage database storage and retrieval on data lake platforms; extensive experience in handling.

Extensive Unix/FTP/file handling.

Strong hands-on experience with SQL databases.

Must Have Technical Skills

Deep understanding of multi-process architecture and the threading limitations of Python.

PySpark data engineering skills.

Experience using libraries like Pandas, NumPy and MatPlotLib; experience with file-based systems (CSV/Parquet).

Extensive knowledge in understanding data and creating data pipelines.

Hands-on experience designing and implementing APIs.

Hands-on experience building microservices using FastAPI or related technologies.

Experience writing unit tests and ensuring code coverage.

Experience with Agile methodology/JIRA/Confluence.

Experience in version control using Git.

Flex Skills / Nice to Have

Added advantage if you have worked on Big Data, PySpark libraries.

Education / Certifications Bachelors Preferred

Python or Spark certifications are a plus

Role Differentiator Conversion possibility and growth opportunities are strong, with ongoing work and energizing new opportunities. The role supports the bank in a critical area and contributes to overall success.

Skills

Deep understanding of multi-process architecture and Python threading limitations.

Experience in version control using Git.

Experience with Pandas, NumPy and MatPlotLib; file-based systems (CSV/Parquet).

Experience with Agile methodology/JIRA/Confluence.

Experience writing unit tests and ensuring code coverage.

Understanding data and creating data pipelines.

Hands-on experience building microservices using FastAPI or related technologies.

PySpark data engineering skills.

How to Apply Share your resume with Ariz Khan at ariz.khan@systemone.com. Also connect on LinkedIn: Ariz J. Khan.

Ref: #404-IT Pittsburgh

#J-18808-Ljbffr