PTR Global
Data Quality Analyst (Media & Entertainment, Taxonomist, Metadata)
PTR Global, Culver City, California, United States, 90232
Data Quality Analyst (Media & Entertainment, Taxonomist, Metadata)
We’re looking for a hands-on Data Quality Analyst to understand datasets, define and capture metadata, build and maintain taxonomies, and improve data quality.
Profile data, standardize definitions, clean and de-duplicate records, and make data easier to discover and trust across the organization.
Responsibilities
Data discovery and profiling: inventory sources, assess data health, completeness, and lineage; summarize patterns and anomalies
Metadata modelling: define data dictionaries, business glossaries, and metadata schemas (technical and business); standardize naming conventions
Taxonomy and classification: design and maintain controlled vocabularies, tagging schemes, and hierarchies; map synonyms; apply metadata at scale
Data quality improvement: design validation rules, deduplication and standardization logic; implement profiling and monitoring; resolve data defects
Canonical mapping: map disparate source fields to a common model; document transformations and provenance
Cataloguing and stewardship: populate and maintain the data catalog; implement lineage, ownership, usage notes, and data access classifications
Collaboration: conduct stakeholder interviews to capture definitions and use cases; partner with data engineering/ML/BI to instrument metadata and checks in pipelines
Documentation and enablement: publish standards, playbooks, and change logs; train teams on taxonomy usage and good metadata practices
Enforce compliance and ethics: apply retention, sensitivity, and privacy tags (e.g., PII); escalate risks and enforce access policies
Required Experience
3+ years in data analysis, data stewardship, taxonomy, library/information science, or similar in a media & entertainment company
Strong SQL experience with data profiling and transformation. Familiarity with Python or R for data cleaning is a plus
Experience defining data dictionaries, glossaries, and metadata schemas; comfort with controlled vocabularies and classification
Practical knowledge of data quality techniques: deduping, standardization, validation rules, and root-cause analysis
Clear communicator who can translate between business and technical audiences; strong documentation habit
Exposure to data catalogs (Alation, Collibra, Atlan, DataHub), taxonomy tools (PoolParty, SmartLogic, Synaptica), or graph/ontology basics; familiarity with metadata standards (e.g., Dublin Core, schema.org) and data privacy (e.g., GDPR/CCPA)
Success Metrics
% of priority datasets cataloged with complete metadata and owners
Reduction in data quality issues (duplicates, nulls, invalid values)
Time-to-discover datasets/fields decreased; increase in catalog search success and usage
Adoption of taxonomy/controlled vocabularies across key teams
The First 90 Days
Audit and profile datasets (e.g., Client TV+, Client Music, App Store); publish a lightweight data health report and create a backlog
Stand up or improve a data dictionary/business glossary and agree on naming standards with stakeholders
Define initial taxonomy and tagging guidelines; apply to top datasets and iterate
Implement a basic data quality rule set
Document processes and handoffs; recommend tool/process changes for scale
Tooling Experience
Python or R for cleansing (pandas, dbt exposures if applicable)
Visualization for profiling/quality dashboards (Tableau, Power BI, Looker)
Version control and tickets (Git, Jira)
Skills
Media & Entertainment – Required
Data Science – 2-5 Years
Data Warehousing – 2-5 Years
Pay Range: $65/hr - $70/hr
The specific compensation for this position will be determined by a number of factors, including the scope, complexity and location of the role as well as the cost of labor in the market; the skills, education, training, credentials and experience of the candidate; and other conditions of employment. Our full-time consultants have access to benefits including medical, dental, and vision as well as 401K contributions.
#J-18808-Ljbffr
Profile data, standardize definitions, clean and de-duplicate records, and make data easier to discover and trust across the organization.
Responsibilities
Data discovery and profiling: inventory sources, assess data health, completeness, and lineage; summarize patterns and anomalies
Metadata modelling: define data dictionaries, business glossaries, and metadata schemas (technical and business); standardize naming conventions
Taxonomy and classification: design and maintain controlled vocabularies, tagging schemes, and hierarchies; map synonyms; apply metadata at scale
Data quality improvement: design validation rules, deduplication and standardization logic; implement profiling and monitoring; resolve data defects
Canonical mapping: map disparate source fields to a common model; document transformations and provenance
Cataloguing and stewardship: populate and maintain the data catalog; implement lineage, ownership, usage notes, and data access classifications
Collaboration: conduct stakeholder interviews to capture definitions and use cases; partner with data engineering/ML/BI to instrument metadata and checks in pipelines
Documentation and enablement: publish standards, playbooks, and change logs; train teams on taxonomy usage and good metadata practices
Enforce compliance and ethics: apply retention, sensitivity, and privacy tags (e.g., PII); escalate risks and enforce access policies
Required Experience
3+ years in data analysis, data stewardship, taxonomy, library/information science, or similar in a media & entertainment company
Strong SQL experience with data profiling and transformation. Familiarity with Python or R for data cleaning is a plus
Experience defining data dictionaries, glossaries, and metadata schemas; comfort with controlled vocabularies and classification
Practical knowledge of data quality techniques: deduping, standardization, validation rules, and root-cause analysis
Clear communicator who can translate between business and technical audiences; strong documentation habit
Exposure to data catalogs (Alation, Collibra, Atlan, DataHub), taxonomy tools (PoolParty, SmartLogic, Synaptica), or graph/ontology basics; familiarity with metadata standards (e.g., Dublin Core, schema.org) and data privacy (e.g., GDPR/CCPA)
Success Metrics
% of priority datasets cataloged with complete metadata and owners
Reduction in data quality issues (duplicates, nulls, invalid values)
Time-to-discover datasets/fields decreased; increase in catalog search success and usage
Adoption of taxonomy/controlled vocabularies across key teams
The First 90 Days
Audit and profile datasets (e.g., Client TV+, Client Music, App Store); publish a lightweight data health report and create a backlog
Stand up or improve a data dictionary/business glossary and agree on naming standards with stakeholders
Define initial taxonomy and tagging guidelines; apply to top datasets and iterate
Implement a basic data quality rule set
Document processes and handoffs; recommend tool/process changes for scale
Tooling Experience
Python or R for cleansing (pandas, dbt exposures if applicable)
Visualization for profiling/quality dashboards (Tableau, Power BI, Looker)
Version control and tickets (Git, Jira)
Skills
Media & Entertainment – Required
Data Science – 2-5 Years
Data Warehousing – 2-5 Years
Pay Range: $65/hr - $70/hr
The specific compensation for this position will be determined by a number of factors, including the scope, complexity and location of the role as well as the cost of labor in the market; the skills, education, training, credentials and experience of the candidate; and other conditions of employment. Our full-time consultants have access to benefits including medical, dental, and vision as well as 401K contributions.
#J-18808-Ljbffr