Keystone AI
Data Engineer (Unstructured data specialism)
Keystone AI, San Francisco, California, United States, 94199
Overview
Join to apply for the
Data Engineer
role at
Keystone AI . Keystone AI is a premier strategy and economics consulting firm solving the most complex challenges of competition, strategy, and intellectual property for leading technology firms and global brands. We work at the forefront of influential technology cases changing consumer behavior and regulation laws and impacting society on a global scale. Keystone AI brings an interdisciplinary approach, leveraging the intersection of economics, technology, and business strategy to deliver transformative ideas.
K.ATS Foundry K.ATS Foundry is Keystone’s engineering center of excellence, embedding data, platform, and forensic expertise into the firm’s most complex and high‑impact projects. Foundry builds secure, reusable infrastructure and scalable technical solutions that accelerate project delivery and ensure defensible, data‑driven outcomes. Our engineers work across disciplines, from automating data pipelines and managing cloud platforms to conducting forensic code and hardware investigations, helping every engagement start faster, run smoother, and deliver greater impact.
Responsibilities
Architect and drive the design and implementation of data infrastructure and systems, addressing data engineering, data science, and software development needs of client‑facing teams with data‑driven approaches.
Evaluate client needs, propose recommendations, and implement tailored solutions across various practices and projects, including litigation support, regulation, and large language model integration.
Consult with client‑facing teams to design and architect data infrastructure to automate manual processes, optimize data delivery and processing, and ensure user experience.
Manage project big data requests, building reproducible ingestion pipelines and downstream analytic processes to derive value for teams.
Develop custom software solutions such as APIs to interact with large language models or end‑user portals for efficient data search.
Leverage appropriate infrastructure for optimal extraction, transformation, and loading (ETL/ELT) of data from diverse sources using SQL and big data technologies.
Lead creation, maintenance, and implementation of in‑house tools, libraries, and systems to increase efficiency and scalability.
Build tools and APIs to deploy data science and machine learning systems at scale.
Lead adoption of best practices in data engineering and software development.
Collaborate closely with cross‑functional teams of data scientists, engagement managers, and consultants to identify opportunities and deliver value.
Qualifications
Experience in data analysis, data science, and ETL/ELT on large datasets.
Proficiency in architecting cloud‑based solutions on AWS, GCP, or Azure.
Advanced knowledge of Python, R, SQL, and machine‑learning libraries (PyTorch, Keras, TensorFlow, Transformers, NLTK, scikit‑learn, OpenAI API).
Experience with data orchestration tools (Airflow, dbt, Prefect, Luigi).
Big data platform experience (Snowflake, Spark, BigQuery).
Solid understanding of the Software Development Lifecycle and interest in applying it in a fast‑paced, data‑focused role.
Version control (git) workflow experience.
Capacity to work within complex systems and large data sets.
Preferred: Leadership experience with data teams, matrix‑based organization or consulting firm background.
Required: CI/CD pipeline experience.
Broad knowledge of AI/ML/NLP/CV/AR concepts.
Experience with LLM model training, finetuning, evaluation, and implementation such as vector databases, RAG.
Experience with GCP, AWS, or Azure cloud services, including data services, lambda functions, Bedrock/Vertex AI.
A desire to grow a great team and organization.
The passion to learn, grow, and help improve our business.
Bachelor’s degree; ability to quickly adapt to new technologies.
Salary & Benefits US Salary Range: $135,000 – $186,000. Includes 401(k) contribution and competitive benefits package. Actual compensation within the range will depend on the level the individual is hired at based on skills, experience, and qualifications.
Diversity & Inclusion At Keystone AI we believe diversity matters. We seek to advance and promote diversity, foster an inclusive culture, and ensure colleagues have a deep sense of respect and belonging. If you are interested in growing your career with colleagues from varied backgrounds and cultures, consider Keystone AI.
Job Details
Seniority level: Mid‑Senior
Employment type: Full‑time
Job function: Information Technology
Industries: Business Consulting and Services
#J-18808-Ljbffr
Data Engineer
role at
Keystone AI . Keystone AI is a premier strategy and economics consulting firm solving the most complex challenges of competition, strategy, and intellectual property for leading technology firms and global brands. We work at the forefront of influential technology cases changing consumer behavior and regulation laws and impacting society on a global scale. Keystone AI brings an interdisciplinary approach, leveraging the intersection of economics, technology, and business strategy to deliver transformative ideas.
K.ATS Foundry K.ATS Foundry is Keystone’s engineering center of excellence, embedding data, platform, and forensic expertise into the firm’s most complex and high‑impact projects. Foundry builds secure, reusable infrastructure and scalable technical solutions that accelerate project delivery and ensure defensible, data‑driven outcomes. Our engineers work across disciplines, from automating data pipelines and managing cloud platforms to conducting forensic code and hardware investigations, helping every engagement start faster, run smoother, and deliver greater impact.
Responsibilities
Architect and drive the design and implementation of data infrastructure and systems, addressing data engineering, data science, and software development needs of client‑facing teams with data‑driven approaches.
Evaluate client needs, propose recommendations, and implement tailored solutions across various practices and projects, including litigation support, regulation, and large language model integration.
Consult with client‑facing teams to design and architect data infrastructure to automate manual processes, optimize data delivery and processing, and ensure user experience.
Manage project big data requests, building reproducible ingestion pipelines and downstream analytic processes to derive value for teams.
Develop custom software solutions such as APIs to interact with large language models or end‑user portals for efficient data search.
Leverage appropriate infrastructure for optimal extraction, transformation, and loading (ETL/ELT) of data from diverse sources using SQL and big data technologies.
Lead creation, maintenance, and implementation of in‑house tools, libraries, and systems to increase efficiency and scalability.
Build tools and APIs to deploy data science and machine learning systems at scale.
Lead adoption of best practices in data engineering and software development.
Collaborate closely with cross‑functional teams of data scientists, engagement managers, and consultants to identify opportunities and deliver value.
Qualifications
Experience in data analysis, data science, and ETL/ELT on large datasets.
Proficiency in architecting cloud‑based solutions on AWS, GCP, or Azure.
Advanced knowledge of Python, R, SQL, and machine‑learning libraries (PyTorch, Keras, TensorFlow, Transformers, NLTK, scikit‑learn, OpenAI API).
Experience with data orchestration tools (Airflow, dbt, Prefect, Luigi).
Big data platform experience (Snowflake, Spark, BigQuery).
Solid understanding of the Software Development Lifecycle and interest in applying it in a fast‑paced, data‑focused role.
Version control (git) workflow experience.
Capacity to work within complex systems and large data sets.
Preferred: Leadership experience with data teams, matrix‑based organization or consulting firm background.
Required: CI/CD pipeline experience.
Broad knowledge of AI/ML/NLP/CV/AR concepts.
Experience with LLM model training, finetuning, evaluation, and implementation such as vector databases, RAG.
Experience with GCP, AWS, or Azure cloud services, including data services, lambda functions, Bedrock/Vertex AI.
A desire to grow a great team and organization.
The passion to learn, grow, and help improve our business.
Bachelor’s degree; ability to quickly adapt to new technologies.
Salary & Benefits US Salary Range: $135,000 – $186,000. Includes 401(k) contribution and competitive benefits package. Actual compensation within the range will depend on the level the individual is hired at based on skills, experience, and qualifications.
Diversity & Inclusion At Keystone AI we believe diversity matters. We seek to advance and promote diversity, foster an inclusive culture, and ensure colleagues have a deep sense of respect and belonging. If you are interested in growing your career with colleagues from varied backgrounds and cultures, consider Keystone AI.
Job Details
Seniority level: Mid‑Senior
Employment type: Full‑time
Job function: Information Technology
Industries: Business Consulting and Services
#J-18808-Ljbffr