How Vanguard Uses Pinecone to Boost Customer Support with 12% More Accurate Responses

Vanguard partnered with Pinecone to build Agent Assist, an internal RAG-powered AI chat tool that helps customer support representatives find answers faster and more accurately. By replacing keyword-based search with hybrid vector retrieval, Vanguard achieved 12% more accurate search results and meaningfully reduced call times — even during high-demand periods like tax season.

Impact

12%

Search result accuracy improvement

Reduced

Customer call times

Reduced

Operational overhead during peak seasons

Challenge

Vanguard's customer support teams relied on keyword-based search that returned links to lengthy documents, forcing agents to manually hunt for answers — driving up call times, reducing satisfaction, and requiring costly seasonal hiring surges. The team needed a scalable, real-time retrieval solution capable of handling a highly dynamic financial document dataset.

Solution

Vanguard's CAI team built Agent Assist, an internal RAG-powered chat assistant using Pinecone Serverless as the vector database, combining BM25 sparse embeddings with dense embeddings for hybrid retrieval, and leveraging metadata filtering to ensure agents always access the most current documents.

Tools & Technologies

What Leaders Say

One of the reasons we chose Pinecone beyond functionality is because Pinecone was willing to work with Vanguard, specifically to meet our security control and performance requirements by creating a dedicated AWS account and cluster for us.

Hung Pham, ML Engineer at Vanguard
Get the full story.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

Vanguard, one of the world's largest investment management firms, has long prioritized delivering exceptional client experiences — including responsive, knowledgeable customer support. With millions of clients relying on Vanguard for retirement planning, investments, and financial advice, the quality and speed of support interactions carry real financial consequences. The company's Center for Analytics and Insights (CAI) team, operating within the Chief Data Analytics office, was tasked with modernizing how customer service representatives access information during live calls.

The core challenge was a retrieval problem. Vanguard's support teams were using keyword-based search to locate relevant financial documents, but this approach only surfaced links to lengthy source files — leaving agents to manually sift through dense content to find specific answers. This inefficiency drove up call times and eroded customer satisfaction. During peak periods like tax season, Vanguard's traditional workaround was to hire additional representatives to absorb the volume, adding significant operational cost without addressing the root cause.

To move beyond keyword search, the CAI team first experimented with JSON storage and cosine similarity-based retrieval. These early solutions proved too slow, struggled to scale with growing datasets, and frequently returned results that lacked contextual relevance. The team then evaluated a range of vector database options — including pgvector, Faiss, and Redis — before selecting Pinecone. Key decision factors included Pinecone's support for hybrid search (combining BM25 sparse embeddings with dense embeddings), real-time indexing capabilities, advanced metadata filtering for compliance, and enterprise-grade security features such as AWS PrivateLink. Pinecone also worked directly with Vanguard to provision a dedicated AWS account and cluster tailored to their security and performance requirements.

The resulting system, called Agent Assist, is an internal RAG-powered chat assistant built on top of Pinecone Serverless. Financial documents stored as HTML pages are scraped, preprocessed with a custom chunking strategy, and encoded into dual dense and sparse embeddings — with sparse embeddings trained in-house using BM25. Hybrid retrieval is configured with an Alpha value of 0.5 to balance precision across domain-specific financial terminology. To ensure agents always access current information, documents are tagged daily as "live" or "stale" using metadata filtering, with outdated documents archived to DynamoDB for regulatory compliance.

Since deploying Agent Assist, Vanguard has seen measurable gains across accuracy, efficiency, and compliance. Hybrid retrieval improved search result accuracy by over 12% compared to dense-only retrieval. Call times dropped as agents could surface precise answers in real time, and the team no longer needs to scale headcount during peak seasons to manage volume. Metadata tagging also introduced stronger audit traceability, supporting Vanguard's compliance obligations. Looking ahead, Vanguard plans to expand its use of RAG and Contextual-Aware Generation (CAG) systems, with Pinecone serving as a foundational layer in its broader AI knowledge ecosystem.

Similar Cases

IE
Intercontinental Exchange
Qualitative shift
it visibility

Intercontinental Exchange (ICE) operates global financial exchanges, clearing houses, and mortgage technology serving markets worldwide. To move beyond lagging IT metrics like SLAs and satisfaction surveys, ICE deployed Moveworks’ HelpBot on Microsoft Teams, powered by an NLU-driven Employee Experience Insights (EXI) engine that converts raw IT tickets into a prioritized action list. EXI revealed hidden pain points—including that Outlook was ICE’s top driver of IT issues—giving the IT leadership team visibility they previously couldn’t achieve with conventional analytics.

Financial ServicesMAMoveworks AI Assistant
E
Ellevest
50–70%
style guide error correction time saved

Ellevest is a women-focused financial services and investing platform founded on the premise that the financial industry was built for men and has left women behind. With 20 writers producing 150+ articles annually and an intersectional style guide that evolves with cultural norms, the team adopted Writer to enforce brand consistency and compliance at scale. Time spent correcting style guide mistakes has dropped by 50–70%, and writers across the company now ship content aligned to Ellevest’s voice without requiring intensive manual editorial review.

Financial ServicesWWriter
NA
New American Funding
360 hours
weekly hours saved

New American Funding (NAF) is one of the largest independent mortgage banks in the US, with a marketing team of nearly 60 people spanning content, performance marketing, brand communications, and social. Facing an overwhelmed content operation across a regulated industry, NAF deployed Writer’s enterprise generative AI platform to streamline content production, enforce compliance guardrails, and maintain brand voice at scale. The platform now saves the team 360 hours per week and has compressed content tasks from hours to minutes.

Financial ServicesWWriter
SB
Shinhan Bank
1.2M+
staff hours saved annually

Shinhan Bank is one of South Korea’s largest financial institutions, operating more than 800 branches and serving millions of customers. Facing a workforce burdened by repetitive compliance, loan review, and data-transfer tasks across siloed systems, the bank launched the ‘My Bot’ initiative—deploying personal digital assistants to all 14,000 employees using Automation Anywhere’s platform. The result is one of the world’s largest enterprise digital assistant deployments, saving over 1.2 million staff hours annually and automating more than 100,000 property loan reviews each year.

Financial ServicesAAAutomation Anywhere
W
WEX
~30%
developer productivity increase with github copilot

WEX, a global fintech company that processes payments for fleet management, employee benefits, and corporate spending, consolidated a fractured developer ecosystem of 300+ Azure DevOps organizations onto GitHub Enterprise and deployed GitHub Copilot across its engineering workforce of 1,700+. The result was approximately 30% higher developer productivity, ~60% ROI on Copilot licenses, and a 99% reduction in deployment cycle times.

Financial ServicesGEGitHub EnterpriseGCGitHub Copilot
CC
Chipper Cash
95%+
selfie verification accuracy

Chipper Cash, a fintech serving over five million customers across Africa, deployed a Pinecone-powered facial similarity search system to detect and block fraudulent duplicate sign-ups in real time. The solution slashed identity verification latency from up to 20 minutes down to under 2 seconds, and reduced fraudulent sign-ups by 10x across all markets.

Financial ServicesPPineconeSSnowflake
MB
Millennium bcp
2.6x higher
conversion rate lift — owned media (bigquery audiences vs. other first-party audiences)

Millennium bcp, Portugal's largest private bank, used Google Cloud's BigQuery machine learning tools to build predictive audience models for personal loan campaigns. By segmenting existing customers by propensity to borrow, the bank dramatically improved both owned and paid media performance. The result was a 2.6x higher conversion rate and a 36% drop in cost per acquisition.

Financial ServicesBBigQueryGAGoogle Analytics 4
F
Fiserv
$10M
sla penalties avoided

Fiserv built safe, scalable AI automation on UiPath Platform with built-in governance, avoiding $10M in SLA penalties and onboarding 20,000+ QSR locations on schedule.

Financial ServicesUPUiPath Platform