Professional ServicesResearch & Development

How Kantar Worldpanel Uses Databricks to Generate Market Insights Faster

Kantar Worldpanel is a leading international consumer data and market research company serving FMCG manufacturers and retailers worldwide. The company deployed Databricks’ Data Intelligence Platform to fine-tune large language models for linking paper receipt descriptions to product barcodes, automating a historically manual and resource-intensive task. Using GPT-4 for training data generation and a smaller fine-tuned model for production, Kantar automatically generated 120,000 labeled data pairs at 94% accuracy in a matter of hours.

Outcomes

94%Model accuracy
120,000 pairsTraining dataset generated
8B parametersProduction model size

Tools & Technologies

1M
MLflow
Open-source ML lifecycle platform for experiment tracking, model registry, and deployment across training frameworks.
2DA
Databricks Agent Bricks
Framework for building, evaluating, and deploying domain-specific AI agents on a lakehouse platform.
3G
GPT-4
GPT-4 is OpenAI's flagship large language model offering advanced reasoning, instruction following, and multimodal capabilities for enterprise and research applications.
4DU
Databricks Unity Catalog
Unified governance layer for managing access, lineage, and quality of data and AI assets across a lakehouse.
5DA
Databricks AI Search
Databricks AI Search enables semantic and hybrid search over enterprise data, allowing teams to compare and link documents and structured records using embedding-based retrieval.
6L
Llama
Open-weight foundation models in multiple sizes, released for research and commercial use with strong instruction-following capability.

AI Categories

Challenge

Kantar Worldpanel’s legacy receipt-matching system was rigid, required specialized skills that were scarce, and could not keep pace with demand for faster, higher-quality consumer insights—leaving manual coding teams to generate training data at a rate incompatible with scaling.

Solution

The company deployed the Databricks Data Intelligence Platform to run parallel LLM evaluations using MLflow, Databricks AI Search, and Unity Catalog, selecting GPT-4 to auto-generate 120,000 labeled training pairs before fine-tuning a smaller production model served via Databricks Agent Bricks.

Full Story

Kantar Worldpanel’s business depends on knowing precisely what products consumers purchased and when. At the core of their data pipeline is a process that links descriptions from paper receipts to standard product barcode names—a step that determines which buying signals reach client dashboards and, ultimately, what business decisions manufacturers and retailers make.

Access 399+ AI use cases, 401+ tools, and adoption signal rankings.

Source

DATABRICKS
October 2025
Original case study

Similar Cases

1K
How KPMG Achieves 60-80% Content Time Savings with Writer AI
KPMG
60-80%Time savings on derivative content creation
2P
How PwC Saves $150M with Microsoft 365 Copilot Across 200,000+ Employees
PwC
$150 millionTime Savings from Copilot
3E
How Experian Automates 35% of Customer Emails with Databricks Mosaic AI
Experian
35%Customer emails automated
4P
How PwC Uses CrewAI to Accelerate Enterprise-Scale GenAI Adoption
PwC
70%+Code generation accuracy
5BP
How Blue Pearl Uses IBM Bob to Cut Java Modernization from 30 Days to 3
Blue Pearl
~90%Reduction in Java modernization delivery time
6TA
How The AA Cuts Routine Query Time 70% with Databricks AI/BI Genie in Microsoft Teams
The AA
70%Routine query resolution time reduction
7WM
How West Monroe Cut IT Costs by $1.4M Annually with Moveworks AI
West Monroe
$1.4 millionAnnual IT support cost savings
8B
How Bordr Uses n8n to Power a $100K Relocation Business
Bordr
$100,000+Annual business revenue achieved
9T
How TaskUs Reduces Handle Time 20% with Pinecone-Powered TaskGPT
TaskUs
20%Average handle time reduction
10E
How EY Uses Elasticsearch to Power RAG for Finance Clients
EY
10–15%Accuracy improvement in document extraction
See all use cases →