TechnologySoftware Engineering

How Gong Achieves 10x Cost Savings with Pinecone Serverless for Smart Trackers

Gong is a revenue intelligence platform that analyzes billions of customer interactions to help sales teams improve performance. To power Smart Trackers—its patented AI system for detecting and classifying concepts in sales conversations—Gong adopted Pinecone as its core vector database, storing billions of sentence-level embeddings across real conversations. Migrating to Pinecone Serverless delivered a 10x reduction in infrastructure costs while sustaining peak search performance across a massive corpus.

Impact

10x

Infrastructure cost reduction

Billions

Vectors stored

Challenge

Gong needed to track and classify complex, contextually varied concepts across billions of conversation sentences without relying on brittle keyword matching—and to do so at a cost and latency that made real-time user interaction feasible.

Solution

Gong deployed Pinecone as the vector database for Smart Trackers, storing billions of sentence embeddings to enable semantic retrieval that powers an active learning loop—then migrated to Pinecone Serverless to achieve 10x cost reduction at equivalent performance.

Tools & Technologies

What Leaders Say

Users want to track different concepts that occur in conversations, and simple keywords do not work.

Jacob Eckel, VP, R&D Division Manager, Gong

Our choice to work with Pinecone wasn’t just based on technology; it was rooted in their commitment to our success. They listened, understood, and delivered beyond our expectations.

Jacob Eckel, VP, R&D Division Manager, Gong
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

Gong has built one of the richest repositories of sales conversation data in the world, analyzing calls, emails, and meetings to surface coaching insights and deal intelligence. As early as 2020, Gong was among the first revenue platforms to deploy a vector database for semantic search at scale—recognizing that keyword matching couldn’t capture the nuance and contextual variation in human conversation.

The core challenge was conceptual tracking at enterprise scale. Gong’s Smart Trackers system allows sales managers to define concepts—like a competitor mention or a pricing objection—by providing a handful of example sentences. The system then automatically retrieves semantically similar examples from billions of real conversation records to train a live classification model. Traditional keyword search fails here: conversations are dynamic, and the same concept surfaces in countless phrasings. Only vector similarity search could reliably find them.

Gong selected Pinecone as its vector database partner, storing billions of sentence-level embeddings and enabling the fast similarity searches that power Smart Trackers’ active learning loop. Users label a few sentences, the system retrieves semantically similar candidates from the corpus, and the model improves continuously through feedback. When Pinecone introduced its serverless architecture, Gong migrated and achieved a 10x cost reduction while maintaining performance at peak loads.

The business impact is direct: Smart Trackers allows Gong users to build and fine-tune concept-detection models without any technical expertise. Sales managers, not data scientists, define what matters. The system’s accuracy and speed—sustained by Pinecone’s retrieval infrastructure—make this self-service model practical at scale.

Gong’s architecture demonstrates a maturing pattern in AI product development: managed vector databases as the retrieval backbone for active learning systems. As the corpus of conversation data grows, so does the value of the underlying infrastructure—and the competitive moat built by Gong’s proprietary dataset.

Similar Cases

1
1up
10x faster
response generation speed for rfps and compliance questionnaires

1up, a sales knowledge automation platform, integrated Pinecone's vector database to power a RAG-based system that delivers real-time, highly accurate answers to complex sales queries. The solution replaced a slow, home-grown embedding system and achieved 10x faster response generation for RFPs and compliance questionnaires. Sales reps can now handle high volumes of queries with confidence, reducing reliance on colleagues and accelerating the go-to-market process.

TechnologyAAWSPPinecone
C
CustomGPT.ai
>400M
vectors stored

CustomGPT.ai built a RAG-as-a-Service platform on Pinecone storing over 400M vectors, achieving sub-20ms query latency and the #1 ranking in an independent RAG accuracy benchmark.

TechnologyPPinecone
D
Delphi
>100M
vectors stored

Delphi is an AI platform that enables coaches, creators, and experts to deploy interactive “Digital Minds”—always-on conversational agents trained on their unique content. Scaling from proof of concept to a commercial platform with thousands of customers required a vector database that could support millions of isolated namespaces, billions of vectors, and sub-second retrieval under variable load. Delphi selected Pinecone, achieving P95 query latency of 100ms and keeping retrieval under 30% of total response time—freeing the engineering team to build product rather than manage infrastructure.

TechnologyPPinecone
TX
Terminal X
0.68 to 0.91
f1 retrieval accuracy improvement

Terminal X is a vertical AI platform for institutional investors that acts as a 24/7 research agent, processing millions of financial documents for hedge funds, asset managers, and private equity firms. By rebuilding its retrieval architecture on Pinecone’s vector database, Terminal X improved F1 retrieval accuracy from 0.68 to 0.91, cut average latency by over 35%, and doubled deployment velocity. Users now save approximately three hours per day, and investment memo preparation dropped from two days to half a day.

Financial ServicesTechnologyPPinecone
Z
ZoomInfo
>50%
increase in user engagement

ZoomInfo, a B2B go-to-market intelligence platform with hundreds of millions of professional contact records, needed a vector database to power real-time personalized contact recommendations for sales and marketing teams. The company deployed Pinecone’s serverless vector database with Dedicated Read Nodes to run semantic search over 390 million contact embeddings with sub-second latency. The result was a 50% increase in user engagement, a 2x improvement in recommendation relevancy, and 50x more peak request capacity.

TechnologyPPinecone
A
Assembled
~95%
ticket handling time reduction

Assembled is a workforce management and customer support optimization platform serving enterprises like Stripe, Etsy, and DoorDash. To power Assembled Assist, the company built a hybrid RAG pipeline combining Pinecone vector search with Algolia keyword retrieval and LLMs from OpenAI and Anthropic. Support tasks that previously took 40 minutes now complete in 2 minutes—a 95% reduction in handling time.

TechnologyAAlgoliaOLOpenAI LLMs
A
Allspice
20% → 97%
ingredient matching accuracy

Allspice, a food technology startup building a kitchen operating system for consumers and recipe publishers, deployed Pinecone’s vector database to solve the inherent messiness of ingredient data that traditional text search could not handle. The implementation raised ingredient matching accuracy from roughly 20% to 97%, enabling the launch of recipe importing as a core product feature and expanding into a platform-wide semantic layer for search, recommendations, and conversational AI.

TechnologyTtext-embedding-3-largePPinecone
B
BambooHR
tens of thousands
employee questions answered

BambooHR built an AI-powered HR assistant using Cohere's Embed and Rerank models to answer employee questions accurately, saving HR teams thousands of hours while handling sensitive data securely.

TechnologyCRCohere RerankCECohere Embed