Logo
Diverse Lynx

Data Engineer

Diverse Lynx, Boston, Massachusetts, us, 02298

Save Job

Role name:

Engineer

Role Description:

Job Description for Data EngineerWho we are looking forThe State Street Security Architecture, Analytics Fusion Engineering (SA2FE) team is looking for a Data Engineer. The Fusion Analytics and Data Engineering team delivers models, insights, and tooling to help Cybersecurity teams make faster, more informed decisions as we work to secure State Streets digital footprint. As a DataAnalytics Engineer, you will develop the data flows, analytics pipelines, and production machine-learning systems -- in collaboration with data product managers, architects, engineers, and other team members -- to create ETL pipelines, automation, and analytics ML-driven data products that support our mission to build predictive models and intelligent systems that help secure State Streets information and infrastructure. What you will be responsible forAs a Data Engineer, you willUse your understanding of large scale data processing and analytics to wrangle our unique cybersecurity data and create automation, analyses and tools that point to the most significant business, governance, and risk management impacts.Participate in the design and buildout of petabyte scale systems for high availability, high throughput, data consistency, security, and end user privacy, defining our next generation of data analytics toolingBuild data modeling, automation, and ELT workflows to produce Raw, Rationalized, co-Related, and Reporting data flows for graph, timeseries, structured, and semi-structured cybersecurity dataEducation QualificationsMinimum QualificationsB.S., or M.S. in Computer Science or equivalent work experience5 years of experience building large scale distributed systems and data analytics processes on cloud native, in-memory, and fit-for-purpose hybrid infrastructure. Experience with cybersecurity data and globally distributed log event processing systems with data mesh and data federation as the architectural core is highly desirable.Experience in big data technologies like PrestoTrino, Spark Flink, Airflow Prefect, RedPanda Kafka, Iceberg Delta Lake, Snowflake Databricks, MemGraph Neo4J as well as modern security tooling like Splunk, Panther, Datadog, Elastic, Arcsight etc.Experience designing and building data warehouse, data lake or lake house using batch, streaming, lambda and data mesh solutions and with improving efficiency, scalability, and stability of system resources.Experience working with data warehouses or Databases like Snowflake, Redshift, Postgres, Cassandra etcExperience writing and optimizing complex SQL and ETL development and designing and building data warehouse, data lake or lake house solutions. Experience building Data APIs and integrations using tools like GraphQL, Apache Arrow, gRPC, ProtoBuf, designing large scale stream processing systems with Flink, Kafka, NiFI, and similar technologies.Experience with distributed systems and distributed data storage and large-scale data warehousing solutions, like BigQuery, Athena, Snowflake, Redshift, Presto, etc. Experience working with large datasets and best in class data processing technologies for both stream and batch processing, graph and time series data, notebooks and analytic visualization environments.Strong communication and collaboration skills particularly across teams or with functions like data scientists or business analyst.Preferred Experience5 years of experience with Python, Java, or similar languages, with cloud infrastructure (e.g. AWS, GCP, Azure), and deep experience working with big data processing infrastructures and ELT orchestrationExperience developing distributed batch and real-time feature stores, and developing coordinated batch, streaming and online model execution workflows, building and optimizing large scale data processing jobs in Spark, GraphXGraphFrames, Spark Structured Streaming, as well as scaling graph and time-series native operations.Experience with designing for data lineage, federation, governance, compliance, security, and privacy hands on experience with commercial DataSecOps platforms like Immuta, Satori andor experience building custom access control (RBACABAC), data masking, tokenization, and FPE systems for cloud data lake environments. Experience with globally distributed federated data systems is highly desirable.Experience with data quality monitoring and with building continuous data pipelines and implementing history and time-travel using modern data lake storage layers like Delta Lake, Iceberg, and LakeFSExperience with MLOps and iterative cycles of end-to-end development, MRM coordination, deployment, and monitoring of production grade ML models in a regulated high-growth tech environment5 years of experience with Python, Java, or similar languages, with cloud infrastructure (e.g. AWS, GCP, Azure), and deep experience working with big data processing infrastructures and ELT orchestrationWhy this role is important to usOur technology function, Global Technology Services (GTS), is vital to State Street and is the key enabler for our business to deliver data and insights to our clients. Were driving the companys digital transformation and expanding business capabilities using industry best practices and AI driven, digital-first customer experiences.We offer a collaborative environment where technology skills and innovation are valued in a global organization. Were looking for top technical talent to join our team and deliver creative technology solutions that help us become an end-to-end, next-generation financial services company. Join us if you want to grow your technical skills, solve real problems and make your mark on our industryAbout State StreetWhat we do. State Street is one of the largest custodian banks, asset managers and asset intelligence companies in the world. From technology to product innovation, were making our mark on the financial services industry. For more than two centuries, weve been helping our clients safeguard and steward the investments of millions of people. We provide investment servicing, data analytics, investment research trading and investment management to institutional clients.Work, Live and Grow. We make all efforts to create a great work environment. Our benefits packages are competitive and comprehensive. Details vary in locations, but you may expect generous medical care, insurance and savings plans among other perks. Youll have access to flexible Work Program to help you match your needs. And our wealth of development programs and educational support will help you reach your full potential.Inclusion, Diversity and Social Responsibility. We truly believe our employees diverse backgrounds, experiences and perspective are a powerful contributor to creating an inclusive environment where everyone can thrive and reach their maximum potential while adding value to both our organization and our clients. We warmly welcome the candidates of diverse origin, background, ability, age, sexual orientation, gender identity and personality. Another fundamental value at State Street is active engagement with our communities around the world, both as a partner and a leader. You will have tools to help balance your professional and personal life, paid volunteer days, matching gift program and access to employee networks that help you stay connected to what matters to you.State Street is an equal opportunity and affirmative action employer.Client more at StateStreet.com

Competencies:

Digital : Python, Digital : Microsoft Power BI, Digital : Databricks, PL/SQL

Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.