Scribd, Inc.
Software Engineer II (Backend + Data pipelines)
Scribd, Inc., Jacksonville, Florida, United States, 32290
Overview
Join to apply for the
Software Engineer II (Backend + Data pipelines)
role at
Scribd, Inc. At Scribd (pronounced scribbed), our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our three products: Everand, Scribd, and Slideshare. We support a culture where our employees can be real and bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer. We balance individual flexibility with community connections through Scribd Flex, allowing employees to choose their daily work style in partnership with their manager. Occasional in-person attendance is required for all Scribd employees, regardless of location. We hire for GRIT — Goals, Results, Innovative ideas, and Team collaboration. We are looking for candidates who demonstrate these traits in their work. Team
The ML Data Engineering team powers metadata extraction, enrichment, and content understanding across Scribd brands. We process hundreds of millions of documents and billions of images to deliver high-quality metadata for discovery and trust for millions of users worldwide. We work at scale across UGC, ebooks, audiobooks, and more, combining machine learning, data engineering, and distributed systems. We deploy scalable ML and LLM powered solutions in production in collaboration with applied research and product teams. Role Overview
We are seeking a Software Engineer II with strong backend development experience and a passion for solving complex data challenges at scale. You will design, build, and optimize distributed systems that extract, enrich, and process metadata for a wide range of content. You will work with ML engineers, product managers, and cross functional partners to integrate machine learning models and LLM based services into production pipelines and deliver impactful, high performance solutions. This role offers the opportunity to work on cutting edge generative AI and metadata enrichment problems at a global scale. Tech Stack
Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, HTTP APIs, AWS (Lambda, ECS, SQS, ElastiCache, SageMaker, CloudWatch, Datadog) and Terraform. Key Responsibilities
Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content. Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines. Collaborate with cross-functional teams to deliver scalable, efficient, and reliable metadata solutions. Optimize and refactor existing systems for performance, scalability, and reliability. Ensure data accuracy, integrity, and quality through automated validation and monitoring. Participate in code reviews and maintain high-quality standards in the codebase. Manage and maintain data pipelines, security, and infrastructure. Requirements
4+ years of professional software engineering experience Proficiency in Python, Scala, Ruby, or similar languages Experience designing and building distributed systems at scale Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda Experience with infrastructure-as-code tools like Terraform (or similar) Experience working with a public cloud provider (AWS, Azure, or Google Cloud) Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads Proven ability to test, profile, and optimize systems for performance, scalability, and reliability Bachelor’s degree in Computer Science or equivalent professional experience Bonus: Experience working with LLMs or integrating ML models into production systems Compensation
At Scribd, your base pay is one part of your total compensation package and is determined within a range. Salary ranges vary by location and level. United States (outside California): $103,500 - $186,500; California: $126,000 - $196,000. In Canada: $131,500 CAD - $174,500 CAD. These ranges are based on local labor benchmarks; final pay is determined by experience and role requirements. This position is eligible for equity and a comprehensive benefits package. Working at Scribd
Are you currently based in a location where Scribd is able to employ you? Primary residence must be in or near one of the listed cities in the United States, Canada, or Mexico, or within commuting distance of those cities. US: Atlanta, Austin, Boston, Dallas, Denver, Chicago, Houston, Jacksonville, Los Angeles, Miami, New York City, Phoenix, Portland, Sacramento, Salt Lake City, San Diego, San Francisco, Seattle, Washington D.C. Canada: Ottawa, Toronto, Vancouver. Mexico City. Benefits, Perks, and Wellbeing
Benefits/perks may vary by location. Healthcare insurance (Medical/Dental/Vision): 100% paid for employees 12 weeks paid parental leave Short-term/long-term disability plans 401k/RSP matching Onboarding stipend for home office peripherals + accessories Learning & Development allowance Learning & Development programs Quarterly stipend for Wellness, WiFi, etc. Mental health support and resources Free subscription to the Scribd Inc. suite of products Referral bonuses Book benefit Sabbaticals Company-wide events Team engagement budgets Vacation & Personal Days Paid Holidays (+ winter break) Flexible Sick Time Volunteer Day Inclusive and diverse workplace resources Access to AI Tools: free access to AI tools to boost productivity and innovation Want to learn more about life at Scribd?
https://www.linkedin.com/company/scribd/life We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing accommodations@scribd.com about the need for adjustments at any point in the interview process. Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful. Seniority level
Mid-Senior level Employment type
Full-time Job function
Engineering and Information Technology Industries Referrals increase your chances of interviewing at Scribd, Inc. by 2x
#J-18808-Ljbffr
Join to apply for the
Software Engineer II (Backend + Data pipelines)
role at
Scribd, Inc. At Scribd (pronounced scribbed), our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our three products: Everand, Scribd, and Slideshare. We support a culture where our employees can be real and bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer. We balance individual flexibility with community connections through Scribd Flex, allowing employees to choose their daily work style in partnership with their manager. Occasional in-person attendance is required for all Scribd employees, regardless of location. We hire for GRIT — Goals, Results, Innovative ideas, and Team collaboration. We are looking for candidates who demonstrate these traits in their work. Team
The ML Data Engineering team powers metadata extraction, enrichment, and content understanding across Scribd brands. We process hundreds of millions of documents and billions of images to deliver high-quality metadata for discovery and trust for millions of users worldwide. We work at scale across UGC, ebooks, audiobooks, and more, combining machine learning, data engineering, and distributed systems. We deploy scalable ML and LLM powered solutions in production in collaboration with applied research and product teams. Role Overview
We are seeking a Software Engineer II with strong backend development experience and a passion for solving complex data challenges at scale. You will design, build, and optimize distributed systems that extract, enrich, and process metadata for a wide range of content. You will work with ML engineers, product managers, and cross functional partners to integrate machine learning models and LLM based services into production pipelines and deliver impactful, high performance solutions. This role offers the opportunity to work on cutting edge generative AI and metadata enrichment problems at a global scale. Tech Stack
Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, HTTP APIs, AWS (Lambda, ECS, SQS, ElastiCache, SageMaker, CloudWatch, Datadog) and Terraform. Key Responsibilities
Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content. Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines. Collaborate with cross-functional teams to deliver scalable, efficient, and reliable metadata solutions. Optimize and refactor existing systems for performance, scalability, and reliability. Ensure data accuracy, integrity, and quality through automated validation and monitoring. Participate in code reviews and maintain high-quality standards in the codebase. Manage and maintain data pipelines, security, and infrastructure. Requirements
4+ years of professional software engineering experience Proficiency in Python, Scala, Ruby, or similar languages Experience designing and building distributed systems at scale Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda Experience with infrastructure-as-code tools like Terraform (or similar) Experience working with a public cloud provider (AWS, Azure, or Google Cloud) Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads Proven ability to test, profile, and optimize systems for performance, scalability, and reliability Bachelor’s degree in Computer Science or equivalent professional experience Bonus: Experience working with LLMs or integrating ML models into production systems Compensation
At Scribd, your base pay is one part of your total compensation package and is determined within a range. Salary ranges vary by location and level. United States (outside California): $103,500 - $186,500; California: $126,000 - $196,000. In Canada: $131,500 CAD - $174,500 CAD. These ranges are based on local labor benchmarks; final pay is determined by experience and role requirements. This position is eligible for equity and a comprehensive benefits package. Working at Scribd
Are you currently based in a location where Scribd is able to employ you? Primary residence must be in or near one of the listed cities in the United States, Canada, or Mexico, or within commuting distance of those cities. US: Atlanta, Austin, Boston, Dallas, Denver, Chicago, Houston, Jacksonville, Los Angeles, Miami, New York City, Phoenix, Portland, Sacramento, Salt Lake City, San Diego, San Francisco, Seattle, Washington D.C. Canada: Ottawa, Toronto, Vancouver. Mexico City. Benefits, Perks, and Wellbeing
Benefits/perks may vary by location. Healthcare insurance (Medical/Dental/Vision): 100% paid for employees 12 weeks paid parental leave Short-term/long-term disability plans 401k/RSP matching Onboarding stipend for home office peripherals + accessories Learning & Development allowance Learning & Development programs Quarterly stipend for Wellness, WiFi, etc. Mental health support and resources Free subscription to the Scribd Inc. suite of products Referral bonuses Book benefit Sabbaticals Company-wide events Team engagement budgets Vacation & Personal Days Paid Holidays (+ winter break) Flexible Sick Time Volunteer Day Inclusive and diverse workplace resources Access to AI Tools: free access to AI tools to boost productivity and innovation Want to learn more about life at Scribd?
https://www.linkedin.com/company/scribd/life We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing accommodations@scribd.com about the need for adjustments at any point in the interview process. Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful. Seniority level
Mid-Senior level Employment type
Full-time Job function
Engineering and Information Technology Industries Referrals increase your chances of interviewing at Scribd, Inc. by 2x
#J-18808-Ljbffr