Affinity.co
Senior Data Engineer, AI Insights
Affinity.co, San Francisco, California, United States, 94199
Overview
Affinity stitches together billions of data points from massive datasets to create a powerful accurate representation of the world’s professional relationship graph. Based on this data we offer our users the insights and visibility they need to nurture and tap into their teams network of opportunities. This role is part of the AI Insights team which owns the services that power Affinity’s industry-leading relationship intelligence platform. Our team extracts and retrieves information from billions of structured and unstructured data points to deliver insights to our customers. As a Senior Data Engineer you will collaborate with machine learning engineers, software engineers and product managers to shape the future of private capital’s leading CRM platform. This involves designing and building scalable, efficient data extraction, load and transform (ELT) solutions, monitoring and managing data quality and ensuring data security and best practices. What you’ll be doing
Design scalable and reliable data pipelines to consume, integrate and analyze large volumes of complex data from different sources supporting the evolving needs of our business. Help define our data roadmap. You’ll collaborate with our team of machine learning engineers, software engineers, product and business leaders to use data to shape product development. Build and maintain frameworks for measuring and monitoring data quality and integrity. Establish and optimize CI/CD processes, test frameworks and infrastructure-as-code tooling. Build and implement robust data solutions using Spark, Python, Databricks, Kafka and the AWS ecosystem (including S3, Redshift, EMR, Athena, Glue). Identify skill and process gaps within the team and develop processes to drive team effectiveness and success. Articulate the trade-offs of different approaches to building ETL pipelines and storage solutions, providing clear recommendations aligned with product and business requirements. To confirm you have read this entire description please include the word #AI-Insights in your answer to the first application question. Qualifications
Dont meet every single requirement? Studies have shown that women and people of color are less likely to apply for jobs unless they meet every qualification. At Affinity we are dedicated to building a diverse, inclusive and authentic workplace so if you’re excited about this role but your past experience doesn’t perfectly align with the qualifications above we encourage you to apply anyways. You may be just the right candidate for this or other roles. Required
5 years of experience as a Data Engineer or Data Platform Engineer working on complex, sometimes ambiguous engineering projects across team boundaries. Proficiency in data modeling, data warehousing and ETL pipeline development is essential. Proven hands-on experience building scalable data platforms and reliable data pipelines using Spark and Databricks and familiarity with Hadoop, AWS SQS, AWS Kinesis, Kafka or similar technologies. Comfortable working with large datasets and high-scale data ingestion, transformation and distributed processing tools such as Apache Spark (Scala or Python). Strong proficiency in SQL. Familiar with industry-standard databases and analytics technologies including Data Warehousing and Data Lakes. Experience with cloud platforms such as AWS, Databricks, GCP, Azure or related technologies. Familiar with CI/CD processes and test frameworks. Comfortable partnering with product and machine learning teams on large strategic data projects. Nice to have
Hands-on experience with both relational and non-relational database/data stores including vector databases (e.g. Weaviate, Milvus), graph databases and text search engines (e.g. OpenSearch or Vespa clusters) with a focus on indexing and query optimization. Experience with Infrastructure as Code (IaC) tools such as Terraform. Experience implementing data consistency measures using validation and monitoring tools. Tech Stack
Our Data stack includes tools to build data pipelines between AWS RDS and DBX via scheduled batch jobs and streaming syncing. Spark SQL and MLlib for large-scale data processing in DBX. We also build data pipelines between RDS and other search-optimized engines, such as in-house data quality tools and governance tools to ensure data quality, security and compliance. How we work
Our culture is a key part of how we operate as well as our hiring process: We iterate quickly. As such you must be comfortable embracing ambiguity, able to cut through it and deliver value to our customers. We are candid, transparent and speak our minds while simultaneously caring personally with each person we interact with. We make data-driven decisions and make the best decision for the moment based on the information available. If you’d like to learn more about our values, click here. What you’ll enjoy at Affinity
We live our values: As owners we take pride in everything we do. We embrace a growth mindset, engage in respectful candor, act as playmakers and dive deep to create the best outcomes for our colleagues and clients. Health Benefits: We cover your medical, dental and vision insurance premiums with comprehensive PPO, HDHP and HMO options (in CA) and offer flexible personal & sick days to support your well-being. Retirement Planning: We offer a 401(k) plan to help you plan for your future. Learning & Development: We provide an annual education budget and a comprehensive L&D program. Wellness Support: We reimburse monthly for things like home internet, meals and wellness memberships/equipment to support your overall health and happiness. Team Connection: Virtual team-building activities and socials to keep our team connected because building strong relationships is key to success. Please note that the role compensation details below reflect the base salary only and do not include any equity or benefits. This represents the salary range that Affinity believes in good faith at the time of this posting that it will pay for the posted job. A reasonable estimate of the current range is $106,200 to $200,000 USD. Within the range individual pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant. About Affinity With more than 3,000 customers worldwide and backed by some of Silicon Valley’s best firms, Affinity has raised $120M to empower dealmakers to find, manage, and close more deals. Our Relationship Intelligence platform uses data exhaust from trillions of interactions between Investment Bankers, Venture Capitalists, Consultants and other strategic dealmakers to deliver automated relationship insights that drive hundreds of thousands of deals. We are proud to have received Inc. and Fortune Best Workplaces awards as well as to be Great Places to Work certified for the last 5 years running. Join us on our mission to make it possible for anyone to cultivate and fully harness their network to succeed. We use E-Verify Our company uses E-Verify to confirm the employment eligibility of all newly hired employees. To learn more about E-Verify including your rights and responsibilities please visit Experience : Senior IC Key Skills Apache Hive, S3, Hadoop, Redshift, Spark, AWS, Apache Pig, NoSQL, Big Data, Data Warehouse, Kafka, Scala Employment Type :
Full Time Experience :
years Vacancy :
1 Monthly Salary Salary : 106200 - 200000
#J-18808-Ljbffr
Affinity stitches together billions of data points from massive datasets to create a powerful accurate representation of the world’s professional relationship graph. Based on this data we offer our users the insights and visibility they need to nurture and tap into their teams network of opportunities. This role is part of the AI Insights team which owns the services that power Affinity’s industry-leading relationship intelligence platform. Our team extracts and retrieves information from billions of structured and unstructured data points to deliver insights to our customers. As a Senior Data Engineer you will collaborate with machine learning engineers, software engineers and product managers to shape the future of private capital’s leading CRM platform. This involves designing and building scalable, efficient data extraction, load and transform (ELT) solutions, monitoring and managing data quality and ensuring data security and best practices. What you’ll be doing
Design scalable and reliable data pipelines to consume, integrate and analyze large volumes of complex data from different sources supporting the evolving needs of our business. Help define our data roadmap. You’ll collaborate with our team of machine learning engineers, software engineers, product and business leaders to use data to shape product development. Build and maintain frameworks for measuring and monitoring data quality and integrity. Establish and optimize CI/CD processes, test frameworks and infrastructure-as-code tooling. Build and implement robust data solutions using Spark, Python, Databricks, Kafka and the AWS ecosystem (including S3, Redshift, EMR, Athena, Glue). Identify skill and process gaps within the team and develop processes to drive team effectiveness and success. Articulate the trade-offs of different approaches to building ETL pipelines and storage solutions, providing clear recommendations aligned with product and business requirements. To confirm you have read this entire description please include the word #AI-Insights in your answer to the first application question. Qualifications
Dont meet every single requirement? Studies have shown that women and people of color are less likely to apply for jobs unless they meet every qualification. At Affinity we are dedicated to building a diverse, inclusive and authentic workplace so if you’re excited about this role but your past experience doesn’t perfectly align with the qualifications above we encourage you to apply anyways. You may be just the right candidate for this or other roles. Required
5 years of experience as a Data Engineer or Data Platform Engineer working on complex, sometimes ambiguous engineering projects across team boundaries. Proficiency in data modeling, data warehousing and ETL pipeline development is essential. Proven hands-on experience building scalable data platforms and reliable data pipelines using Spark and Databricks and familiarity with Hadoop, AWS SQS, AWS Kinesis, Kafka or similar technologies. Comfortable working with large datasets and high-scale data ingestion, transformation and distributed processing tools such as Apache Spark (Scala or Python). Strong proficiency in SQL. Familiar with industry-standard databases and analytics technologies including Data Warehousing and Data Lakes. Experience with cloud platforms such as AWS, Databricks, GCP, Azure or related technologies. Familiar with CI/CD processes and test frameworks. Comfortable partnering with product and machine learning teams on large strategic data projects. Nice to have
Hands-on experience with both relational and non-relational database/data stores including vector databases (e.g. Weaviate, Milvus), graph databases and text search engines (e.g. OpenSearch or Vespa clusters) with a focus on indexing and query optimization. Experience with Infrastructure as Code (IaC) tools such as Terraform. Experience implementing data consistency measures using validation and monitoring tools. Tech Stack
Our Data stack includes tools to build data pipelines between AWS RDS and DBX via scheduled batch jobs and streaming syncing. Spark SQL and MLlib for large-scale data processing in DBX. We also build data pipelines between RDS and other search-optimized engines, such as in-house data quality tools and governance tools to ensure data quality, security and compliance. How we work
Our culture is a key part of how we operate as well as our hiring process: We iterate quickly. As such you must be comfortable embracing ambiguity, able to cut through it and deliver value to our customers. We are candid, transparent and speak our minds while simultaneously caring personally with each person we interact with. We make data-driven decisions and make the best decision for the moment based on the information available. If you’d like to learn more about our values, click here. What you’ll enjoy at Affinity
We live our values: As owners we take pride in everything we do. We embrace a growth mindset, engage in respectful candor, act as playmakers and dive deep to create the best outcomes for our colleagues and clients. Health Benefits: We cover your medical, dental and vision insurance premiums with comprehensive PPO, HDHP and HMO options (in CA) and offer flexible personal & sick days to support your well-being. Retirement Planning: We offer a 401(k) plan to help you plan for your future. Learning & Development: We provide an annual education budget and a comprehensive L&D program. Wellness Support: We reimburse monthly for things like home internet, meals and wellness memberships/equipment to support your overall health and happiness. Team Connection: Virtual team-building activities and socials to keep our team connected because building strong relationships is key to success. Please note that the role compensation details below reflect the base salary only and do not include any equity or benefits. This represents the salary range that Affinity believes in good faith at the time of this posting that it will pay for the posted job. A reasonable estimate of the current range is $106,200 to $200,000 USD. Within the range individual pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant. About Affinity With more than 3,000 customers worldwide and backed by some of Silicon Valley’s best firms, Affinity has raised $120M to empower dealmakers to find, manage, and close more deals. Our Relationship Intelligence platform uses data exhaust from trillions of interactions between Investment Bankers, Venture Capitalists, Consultants and other strategic dealmakers to deliver automated relationship insights that drive hundreds of thousands of deals. We are proud to have received Inc. and Fortune Best Workplaces awards as well as to be Great Places to Work certified for the last 5 years running. Join us on our mission to make it possible for anyone to cultivate and fully harness their network to succeed. We use E-Verify Our company uses E-Verify to confirm the employment eligibility of all newly hired employees. To learn more about E-Verify including your rights and responsibilities please visit Experience : Senior IC Key Skills Apache Hive, S3, Hadoop, Redshift, Spark, AWS, Apache Pig, NoSQL, Big Data, Data Warehouse, Kafka, Scala Employment Type :
Full Time Experience :
years Vacancy :
1 Monthly Salary Salary : 106200 - 200000
#J-18808-Ljbffr