How Notion Built Agent Orchestration on Claude to Cut Costs 90%

Notion is a collaborative AI workspace used by millions of people, from individuals to Fortune 100 companies, where teams organize knowledge, manage projects, and now delegate real work to AI agents. After deploying Claude to power AI writing, search, and database features across the product, Notion extended to agent orchestration using Claude Managed Agents — letting teams kick off dozens of concurrent tasks from a single board and receive finished deliverables, from code to client presentations. Prompt caching alone cut Notion’s infrastructure costs by 90% and latency by up to 85%.

Impact

90%

Infrastructure cost reduction via prompt caching

Up to 85%

Latency reduction via prompt caching

30+

Concurrent agent tasks from single task board

35%

Information search time reduction for Osaka Gas

10 minutes across 300 daily queries

Time saved per search for Remote

$35,000+

Annual AI tool cost savings for dbt Labs

Challenge

Notion’s knowledge base was difficult to search without AI, and even after deploying AI features, agent interactions were isolated to single users with no shared visibility, approval flows, or collaborative interface for team-level agent work.

Solution

Notion integrated Claude for enterprise search, writing assistance, and database autofill, then built agent orchestration on Claude Managed Agents so teams could delegate complex tasks from shared task boards and receive deliverables including code, presentations, and websites.

Tools & Technologies

What Leaders Say

We want Notion to be the best place for teams to work with agents and get things done. We integrated Claude Managed Agents, which can handle long-running sessions, manage memory, and deliver high-quality outputs over time, to make that possible.

Eric Liu, Product Manager, Notion

We saw that customers were willing to jump through hoops to have a native experience of agents within Notion, and Claude was the one people wanted most.

Eric Liu, Product Manager, Notion

Prompt caching makes Notion AI faster and cheaper, all while maintaining quality. This enables us to create a more responsive user experience for our customers.

Simon Last, Co-founder, Notion

We’ve found that Opus 4.6 excels at interpreting what users actually want, producing shareable content on the first try.

Sarah Sachs, AI Lead Engineer, Notion
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

Notion started as a workspace where knowledge lives: meeting notes, product specs, process guides, and project docs. As the product grew to serve millions of users including major enterprises, the first AI challenge was making all of that knowledge findable. Customer support teams needed troubleshooting steps on demand. Sales reps needed to find process documentation without waiting. Product designers needed brand guidelines without searching manually. Claude became the engine for Notion’s Enterprise Search, AI Writer, and Autofill features, helping users query their entire workspace in natural language and get answers drawn from connected apps and internal documents.

But as AI agents became capable of producing real work, a second and more ambitious challenge emerged: most agent interactions were one-to-one, a single person with a single agent on a single machine. There was no shared visibility, no approval workflows, no way for colleagues to step in and review or iterate together. Notion wanted to bring agents into its collaborative model. As Product Manager Eric Liu described it: powerful agents were being built for every vertical slice of work — why not bring them all into Notion?

Notion selected Claude Opus for its agent layer after testing multiple providers. Co-founder Simon Last cited Claude’s response quality and instruction-following as decisive, especially for use cases where tone and feel matter. AI Lead Engineer Sarah Sachs highlighted Opus 4.6’s ability to interpret intent accurately and produce shareable outputs on the first attempt. Claude Managed Agents provided the infrastructure for long-running sessions, memory management, and high-quality outputs over time — without requiring Notion to build a custom agent runtime.

The result is a workflow that turns Notion’s existing task boards into an agent dispatch system. A team creates a task, moves it to “ready to start,” and Notion invokes a Claude session that picks up context from connected pages, API docs, design systems, and product requirements. For engineering teams, this produces prototypes and code changes. For non-technical teams, it generates presentations, brand strategy decks, and sample websites. Liu described kicking off 30 prototype tasks at once and returning to find them all completed. On the infrastructure side, prompt caching cut Notion’s costs by 90% and latency by up to 85% across the millions of daily AI interactions.

The measurable enterprise impact has been significant. Osaka Gas reduced time spent searching for information by 35%. Remote saves an estimated 10 minutes per search across 300 daily queries. dbt Labs eliminated the need for separate AI tools, saving over $35,000 annually. For new employees, the AI assistant is used 10-20 times daily in their first weeks. Notion is now building toward a workspace where the same interfaces humans use for collaboration — task boards, suggested edits, version history — also serve as the interface for directing agent work.

Similar Cases

S
Sentry
1 million+
root cause analyses processed annually

Sentry is a software monitoring platform that ingests billions of events daily, giving development teams deep context to debug production issues. After deploying Claude for root cause analysis through their Seer agent, Sentry extended the workflow using Claude Managed Agents to generate merge-ready pull requests automatically — closing the loop from detection to fix without custom agent infrastructure. The result: over 1 million root cause analyses processed annually and reviews on over 600,000 pull requests each month, shipped by a single engineer in weeks instead of months.

TechnologyCMClaude Managed Agents
P
Pfizer
93%
database reduction

Pfizer achieved a 93% database reduction and 20% cost avoidance by migrating their global SAP environment to S/4HANA on IBM Power10 infrastructure.

PharmaceuticalsTechnologyICIBM ConsultingIPIBM Power Virtual Server
A
Allspice
20% → 97%
ingredient matching accuracy

Allspice, a food technology startup building a kitchen operating system for consumers and recipe publishers, deployed Pinecone’s vector database to solve the inherent messiness of ingredient data that traditional text search could not handle. The implementation raised ingredient matching accuracy from roughly 20% to 97%, enabling the launch of recipe importing as a core product feature and expanding into a platform-wide semantic layer for search, recommendations, and conversational AI.

TechnologyTtext-embedding-3-largePPinecone
J
Jamf
Under 45 minutes
performance review skill build time

Jamf deployed Claude Enterprise across 16 departments, then built interactive workflow skills using Claude Cowork that transformed manual spreadsheet-based processes into guided, conversational experiences. Performance reviews that previously required months of effort are now built in under 45 minutes, and non-engineering teams independently create custom data dashboards.

TechnologyCEClaude EnterpriseCCClaude Cowork
R
Rappi
40%
search response latency reduction

Rappi, Latin America’s fastest-growing on-demand delivery app serving over 300 cities, replaced its keyword-based search engine with Oracle AI Vector Search and Oracle Cloud Infrastructure Generative AI to enable semantic and image-based product discovery. The upgrade reduced search response latency by 40% and improved conversion rate by 25%, driving higher engagement and order volumes across the platform.

TechnologyOAOracle AI Vector SearchOAOracle Autonomous AI Database
C
Confluent
15,000+
hours saved monthly

Confluent, a data streaming platform company with 2,000+ employees and 4,000+ customers, deployed Glean to solve the knowledge fragmentation that came with rapid growth from 250 to 2,000+ employees across 20+ systems. Glean indexed the company's full tool stack — Slack, Salesforce, Confluence, and more — enabling instant knowledge retrieval across all teams. The result: 15,000+ hours saved monthly, a 13% increase in support team satisfaction, and over 70% employee adoption.

TechnologyGGlean
H
Headstart
90–97%
code written by claude

Headstart, an AI-native software studio, uses Claude 3.5 Sonnet to write 90-97% of client code, compressing enterprise software project timelines from months to weeks and delivering 10-100x development speed.

TechnologyC3Claude 3.5 Sonnet
L
Lusha
300%
increase in outbound leads

Lusha is a B2B sales intelligence platform with 1.5 million users and a database of over 200 million business contacts. By deploying Elasticsearch as both a full-text search engine and a vector database for AI-powered lead recommendations, Lusha helps customers generate 300% more leads, achieve conversion rates up to 10x higher, and realize return on investment of up to 1,000%.

TechnologyEElasticsearch