TechnologyCustomer Service

How Assembled Cuts Support Response Time 95% with Pinecone RAG

Assembled is a workforce management and customer support optimization platform serving enterprises like Stripe, Etsy, and DoorDash. To power Assembled Assist, the company built a hybrid RAG pipeline combining Pinecone vector search with Algolia keyword retrieval and LLMs from OpenAI and Anthropic. Support tasks that previously took 40 minutes now complete in 2 minutes—a 95% reduction in handling time.

Outcomes

~95%Ticket handling time reduction
2 minutesPost-AI task completion time

Tools & Technologies

1OL
OpenAI LLMs
Suite of large language models by OpenAI powering text generation, reasoning, and conversational AI.
2C
Claude
Anthropic's AI assistant for analysis, writing, and reasoning tasks.
3A
Algolia
Search and discovery platform by Algolia offering fast, relevance-tuned search APIs for websites and apps.
4P
Pinecone
Managed vector database by Pinecone for real-time semantic search and similarity matching at scale.

AI Categories

Challenge

Support agents lacked fast access to accurate answers, requiring up to 40 minutes per ticket to search knowledge bases and draft responses manually—a process that couldn’t scale as client support volumes grew.

Solution

Assembled built Assembled Assist, a RAG pipeline powered by Pinecone for semantic vector retrieval and Algolia for keyword search, fused via Reciprocal Rank Fusion and completed by OpenAI and Anthropic LLMs to generate ticket responses in seconds.

Full Story

Assembled helps enterprise support teams at companies like Stripe, Etsy, and DoorDash run more efficiently, providing tools for workforce management, performance tracking, and ticket resolution. As AI began reshaping customer service expectations, Assembled saw an opportunity to close a persistent gap: support agents were spending too much time searching for accurate answers rather than delivering them.

Access 449+ AI use cases, 414+ tools, and adoption signal rankings.

Source

PINECONE
January 2025
Original case study

Similar Cases

1R
How Rakuten Uses Claude Code to Cut Feature Delivery from 24 to 5 Days
Rakuten
79%Reduction in average time to market for new features
2PA
How Palo Alto Networks Saves 351K Hours with Moveworks AI
Palo Alto Networks
351,000 hoursEmployee productivity hours saved
3H
How Hostinger Uses Claude to Build Websites from Natural Language
Hostinger
Minutes vs. daysWebsite creation time
4A
How Airtree Uses Claude Cowork to Automate VC Research & Reporting
Airtree
Reduced from 2 days to minutesMarket & competitor research time
5M
How MagicSchool Uses Claude to Reduce Teacher Burnout at Scale
MagicSchool
7 millionEducators using platform
6A
How Anything Uses Claude to Power a No-Code App Builder for 1.5M Users
Anything
800,000+Apps created by users
7N
How Notion Built Agent Orchestration on Claude to Cut Costs 90%
Notion
90%Infrastructure cost reduction via prompt caching
8J
How Jamf Uses Claude to Automate Workflows Across 16 Departments
Jamf
Under 45 minutesPerformance review skill build time
9C
How Cognition Tripled Merged PRs Per Week Using Claude to Power Devin, Its Autonomous AI Engineer
Cognition
3.5×Increase in merged PRs per week after adopting Claude Sonnet 3.6
10P
Pfizer Migrates to SAP S/4HANA on IBM Power10
Pfizer
93%Database reduction
See all use cases →