OSI Engineering
This range is provided by OSI Engineering. Your actual pay will be based on your skills and experience talk with your recruiter to learn more.
Base pay range
$80.00/hr - $95.00/hr A globally leading technology company is looking for an experienced
Data Engineer
to support large-scale data operations for machine learning workflows. In this role, you will collaborate closely with external data vendors and internal teams to ingest, validate, curate, and organize high-quality datasets that enable downstream ML model development. The ideal candidate will have a strong background in Python and hands-on experience working with AWS S3-based data pipelines. If you're passionate about data infrastructure and want to help power the next generation of AI models, we invite you to apply! Job Responsibilities: Collaborate with external data collection vendors to track and ingest incoming datasets. Design and execute robust data validation and curation pipelines to ensure data quality and consistency. Implement logic to bin and categorize data according to project-specific criteria. Run pseudo-labeling workflows on newly ingested data using pre-trained ML models. Maintain clear status and versioning of datasets throughout their lifecycle. Distribute and deliver validated data assets to various internal product and ML teams. Maintain logs and reports to ensure traceability and accountability across data operations. Candidate Requirements: 5+ years of industry experience in data engineering, data pipelines, or ML infrastructure. Strong proficiency in Python, including data processing and scripting. Experience working with AWS S3 for managing and organizing large-scale datasets. Familiarity with data quality assurance and curation processes. Comfortable operating in Unix/Linux environments, with familiarity in using command-line tools. Strong communication and coordination skills, especially when collaborating with external vendors and distributed teams. Self-driven, organized, and able to handle multiple data workflows in parallel. Nice to Have: Experience with ML pipelines, especially pseudo-labeling or active learning. Familiarity with data versioning tools or frameworks (e.g., DVC, LakeFS). Prior experience in managing vendor relationships or annotation workflows. Type:
Contract Duration:
12 months (with a possibility to extend) Work Location:
Sunnyvale, CA (100% on-site) Pay Range:
$ 82.00 - $ 97.00 (DOE) Seniority level
Seniority level
Mid-Senior level Employment type
Employment type
Full-time Job function
Industries
Software Development Referrals increase your chances of interviewing at OSI Engineering by 2x Sign in to set job alerts for Data Engineer roles.
San Jose, CA $113,400.00-$206,300.00 1 week ago Foster City, CA $192,000.00-$260,000.00 5 days ago San Jose, CA $111,500.00-$191,950.00 2 weeks ago Sunnyvale, CA $101,000.00-$135,000.00 1 week ago Sunnyvale, CA $125,000.00-$206,000.00 2 weeks ago Software Engineer, AI Platform - New Grad
Mountain View, CA $145,000.00-$170,000.00 4 days ago Mountain View, CA $122,000.00-$186,000.00 5 days ago San Mateo, CA $120,000.00-$160,000.00 1 week ago Sunnyvale, CA $117,000.00-$234,000.00 5 hours ago San Francisco Bay Area $150,000.00-$160,000.00 1 month ago Mountain View, CA $145,000.00-$170,000.00 4 days ago Mountain View, CA $150,000.00-$220,000.00 1 hour ago Mountain View, CA $138,225.00-$207,575.00 5 days ago San Jose, CA $187,040.00-$438,000.00 1 week ago Associate Software Engineer, Backend Python
Mountain View, CA $144,000.00-$223,000.00 1 week ago New Grads 2025 - Software Engineer, Algorithm
San Jose, CA $120,000.00-$165,000.00 10 months ago San Jose, CA $145,000.00-$410,000.00 1 week ago Software Engineer (L4), Content & Business Products
Data Engineer Graduate (Data Platfrom TikTok BP) - 2026 Start (BS/MS)
San Jose, CA $144,000.00-$259,200.00 1 day ago Data Engineer - Trust & Safety - San Jose
San Jose, CA $144,000.00-$329,334.00 2 weeks ago Mountain View, CA $141,000.00-$202,000.00 6 days ago Software Engineer, Artificial Intelligence
Sunnyvale, CA $141,000.00-$202,000.00 4 days ago Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
$80.00/hr - $95.00/hr A globally leading technology company is looking for an experienced
Data Engineer
to support large-scale data operations for machine learning workflows. In this role, you will collaborate closely with external data vendors and internal teams to ingest, validate, curate, and organize high-quality datasets that enable downstream ML model development. The ideal candidate will have a strong background in Python and hands-on experience working with AWS S3-based data pipelines. If you're passionate about data infrastructure and want to help power the next generation of AI models, we invite you to apply! Job Responsibilities: Collaborate with external data collection vendors to track and ingest incoming datasets. Design and execute robust data validation and curation pipelines to ensure data quality and consistency. Implement logic to bin and categorize data according to project-specific criteria. Run pseudo-labeling workflows on newly ingested data using pre-trained ML models. Maintain clear status and versioning of datasets throughout their lifecycle. Distribute and deliver validated data assets to various internal product and ML teams. Maintain logs and reports to ensure traceability and accountability across data operations. Candidate Requirements: 5+ years of industry experience in data engineering, data pipelines, or ML infrastructure. Strong proficiency in Python, including data processing and scripting. Experience working with AWS S3 for managing and organizing large-scale datasets. Familiarity with data quality assurance and curation processes. Comfortable operating in Unix/Linux environments, with familiarity in using command-line tools. Strong communication and coordination skills, especially when collaborating with external vendors and distributed teams. Self-driven, organized, and able to handle multiple data workflows in parallel. Nice to Have: Experience with ML pipelines, especially pseudo-labeling or active learning. Familiarity with data versioning tools or frameworks (e.g., DVC, LakeFS). Prior experience in managing vendor relationships or annotation workflows. Type:
Contract Duration:
12 months (with a possibility to extend) Work Location:
Sunnyvale, CA (100% on-site) Pay Range:
$ 82.00 - $ 97.00 (DOE) Seniority level
Seniority level
Mid-Senior level Employment type
Employment type
Full-time Job function
Industries
Software Development Referrals increase your chances of interviewing at OSI Engineering by 2x Sign in to set job alerts for Data Engineer roles.
San Jose, CA $113,400.00-$206,300.00 1 week ago Foster City, CA $192,000.00-$260,000.00 5 days ago San Jose, CA $111,500.00-$191,950.00 2 weeks ago Sunnyvale, CA $101,000.00-$135,000.00 1 week ago Sunnyvale, CA $125,000.00-$206,000.00 2 weeks ago Software Engineer, AI Platform - New Grad
Mountain View, CA $145,000.00-$170,000.00 4 days ago Mountain View, CA $122,000.00-$186,000.00 5 days ago San Mateo, CA $120,000.00-$160,000.00 1 week ago Sunnyvale, CA $117,000.00-$234,000.00 5 hours ago San Francisco Bay Area $150,000.00-$160,000.00 1 month ago Mountain View, CA $145,000.00-$170,000.00 4 days ago Mountain View, CA $150,000.00-$220,000.00 1 hour ago Mountain View, CA $138,225.00-$207,575.00 5 days ago San Jose, CA $187,040.00-$438,000.00 1 week ago Associate Software Engineer, Backend Python
Mountain View, CA $144,000.00-$223,000.00 1 week ago New Grads 2025 - Software Engineer, Algorithm
San Jose, CA $120,000.00-$165,000.00 10 months ago San Jose, CA $145,000.00-$410,000.00 1 week ago Software Engineer (L4), Content & Business Products
Data Engineer Graduate (Data Platfrom TikTok BP) - 2026 Start (BS/MS)
San Jose, CA $144,000.00-$259,200.00 1 day ago Data Engineer - Trust & Safety - San Jose
San Jose, CA $144,000.00-$329,334.00 2 weeks ago Mountain View, CA $141,000.00-$202,000.00 6 days ago Software Engineer, Artificial Intelligence
Sunnyvale, CA $141,000.00-$202,000.00 4 days ago Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr