Financial ServicesBusiness Intelligence

How YipitData Uses Databricks Agent Bricks to Scale Data Intelligence 20x

YipitData, a data intelligence firm serving institutional investors and enterprises, processes millions of transaction records daily from credit card data, web-scraped receipts, and alternative data sources. To scale its merchant tagging and entity resolution workflows beyond what manual regex rules could handle, YipitData embedded Databricks Agent Bricks directly into its production pipelines. In a single quarter the company expanded automated company coverage from 3,000 to 60,000—a 20x increase—while achieving 92–95% tagging accuracy out of the box.

Outcomes

20x (3,000 → 60,000)Company coverage increase in one quarter
1 hour vs. 24 hoursProcessing time for 1 million records
92–95%Tagging accuracy out of the box

Tools & Technologies

1
DL
Databricks Lakebase
Databricks
2DA
Databricks Agent Bricks
Framework for building, evaluating, and deploying domain-specific AI agents on a lakehouse platform.
3DU
Databricks Unity Catalog
Unified governance layer for managing access, lineage, and quality of data and AI assets across a lakehouse.

AI Categories

Challenge

YipitData needed to tag and enrich millions of daily transaction records across thousands of merchants and companies, but its manual regex-based approach was capped by analyst throughput and struggled to handle the nuance and ambiguity in free-form text inputs at scale.

Solution

YipitData deployed Databricks Agent Bricks via SQL-accessible batch inference directly inside its existing Databricks pipelines, pairing AI Information Extraction agents with Databricks Lakebase for entity resolution, all governed by Unity Catalog to maintain data lineage and compliance without moving data outside the platform.

Full Story

YipitData sits at the intersection of financial data and machine intelligence: it converts billions of transaction records from credit card data, web-scraped receipts, and dozens of alternative data sources into actionable market intelligence for institutional investors. The company’s value proposition depends on reliably tagging each transaction to the correct merchant or company—matching messy, ambiguous vendor records to real businesses at a scale that keeps pace with daily data refreshes. As the company’s customer base and data volumes grew, the gap between what was possible and what was achievable became increasingly visible.

Access 365+ AI use cases, 380+ tools, and adoption signal rankings.

Source

DATABRICKS
June 2026
Original case study

Similar Cases

1K
How Klarna’s AI Assistant Resolves 80% of Queries in Under 2 Minutes
Klarna
80%Reduction in average customer query resolution time
2S
How Stripe Deploys Claude Code to 1,370 Engineers with Zero-Configuration Rollout
Stripe
1,370Engineers Deployed
3A
How Airtree Uses Claude Cowork to Automate VC Research & Reporting
Airtree
Reduced from 2 days to minutesMarket & competitor research time
4W
How WEX Achieved 30% Developer Productivity Gains with GitHub Copilot
WEX
~30%Developer productivity increase with GitHub Copilot
5B
How Block Gives 4,000 Employees AI-Powered Data Access via Claude and Databricks
Block
75% saving 8-10+ hoursEngineers saving time weekly
6NB
How NBIM Uses Claude Enterprise to Save 20% Time on Investment Analysis
Norges Bank Investment Management
20%Weekly time savings per employee
7F
How Fireblocks Uses Snowflake AI Agents to Handle 40-50% of Data Queries
Fireblocks
40–50%Share of data queries handled by AI agent
8F
How Finhike Uses Google Cloud and Gemini to Deliver 99.95% Availability in Volatile Markets
Finhike
99.95%Platform availability
9B
How Bradesco Uses IBM Watson to Deliver Customer Service in Seconds
Bradesco
From 10 minutes to secondsResponse time reduction
10MB
How Millennium bcp Uses BigQuery ML to Boost Loan Conversion Rates
Millennium bcp
2.6x higherConversion rate lift — owned media (BigQuery audiences vs. other first-party audiences)
See all use cases →