How Scribd Cut GenAI Costs 90% and Boosted Sign-Ups with Databricks
Scribd, Inc. operates three content brands — Scribd, SlideShare, and Everand — and manages a global library of more than 250 million documents, audiobooks, and eBooks. After fragmenting its AI development across multiple disconnected tools, the company consolidated onto the Databricks Data Intelligence Platform to run the full lifecycle of data and AI in one environment. The result: a 90% reduction in generative AI costs, a 7% lift in new user sign-ups, and the ability to move from prototype to production in weeks rather than months.
Tools & Technologies
1AI Categories
Challenge
Scribd’s data infrastructure was fragmented across separate tools for ETL, LLM experimentation, and model serving, creating slow feedback loops between data scientists and production teams and making it nearly impossible to operationalize AI efficiently across the business.
Solution
Scribd consolidated onto the Databricks Data Intelligence Platform, using Databricks Notebooks, Mosaic AI model serving, Delta Lake, Unity Catalog, and Lakeflow Jobs to run the full data and AI lifecycle in one environment — enabling the team to go from prototype to production in weeks while cutting generative AI operating costs by 90%.
Full Story
Scribd, Inc. connects millions of users to knowledge across three distinct platforms: Scribd, a user-powered library; SlideShare, a repository of presentations; and Everand, a subscription service for audiobooks and eBooks. The company’s content library has grown to more than 250 million pieces — multilingual, media-rich, and highly varied in format and quality. At that scale, ensuring content is discoverable, properly tagged, and free of low-quality material is not a manual task.
Access 390+ AI use cases, 392+ tools, and adoption signal rankings.