How Kantar Worldpanel Uses Databricks to Generate Market Insights Faster
Kantar Worldpanel is a leading international consumer data and market research company serving FMCG manufacturers and retailers worldwide. The company deployed Databricks’ Data Intelligence Platform to fine-tune large language models for linking paper receipt descriptions to product barcodes, automating a historically manual and resource-intensive task. Using GPT-4 for training data generation and a smaller fine-tuned model for production, Kantar automatically generated 120,000 labeled data pairs at 94% accuracy in a matter of hours.
Tools & Technologies
1AI Categories
Challenge
Kantar Worldpanel’s legacy receipt-matching system was rigid, required specialized skills that were scarce, and could not keep pace with demand for faster, higher-quality consumer insights—leaving manual coding teams to generate training data at a rate incompatible with scaling.
Solution
The company deployed the Databricks Data Intelligence Platform to run parallel LLM evaluations using MLflow, Databricks AI Search, and Unity Catalog, selecting GPT-4 to auto-generate 120,000 labeled training pairs before fine-tuning a smaller production model served via Databricks Agent Bricks.
Full Story
Kantar Worldpanel’s business depends on knowing precisely what products consumers purchased and when. At the core of their data pipeline is a process that links descriptions from paper receipts to standard product barcode names—a step that determines which buying signals reach client dashboards and, ultimately, what business decisions manufacturers and retailers make.
Access 399+ AI use cases, 401+ tools, and adoption signal rankings.