How Notion Built Agent Orchestration on Claude to Cut Costs 90%

Notion is a collaborative AI workspace used by millions of people, from individuals to Fortune 100 companies, where teams organize knowledge, manage projects, and now delegate real work to AI agents. After deploying Claude to power AI writing, search, and database features across the product, Notion extended to agent orchestration using Claude Managed Agents — letting teams kick off dozens of concurrent tasks from a single board and receive finished deliverables, from code to client presentations. Prompt caching alone cut Notion’s infrastructure costs by 90% and latency by up to 85%.

Impact

90%

Infrastructure cost reduction via prompt caching

Up to 85%

Latency reduction via prompt caching

30+

Concurrent agent tasks from single task board

35%

Information search time reduction for Osaka Gas

10 minutes across 300 daily queries

Time saved per search for Remote

$35,000+

Annual AI tool cost savings for dbt Labs

Challenge

Notion’s knowledge base was difficult to search without AI, and even after deploying AI features, agent interactions were isolated to single users with no shared visibility, approval flows, or collaborative interface for team-level agent work.

Solution

Notion integrated Claude for enterprise search, writing assistance, and database autofill, then built agent orchestration on Claude Managed Agents so teams could delegate complex tasks from shared task boards and receive deliverables including code, presentations, and websites.

Tools & Technologies

What Leaders Say

We want Notion to be the best place for teams to work with agents and get things done. We integrated Claude Managed Agents, which can handle long-running sessions, manage memory, and deliver high-quality outputs over time, to make that possible.

Eric Liu, Product Manager, Notion

We saw that customers were willing to jump through hoops to have a native experience of agents within Notion, and Claude was the one people wanted most.

Eric Liu, Product Manager, Notion

Prompt caching makes Notion AI faster and cheaper, all while maintaining quality. This enables us to create a more responsive user experience for our customers.

Simon Last, Co-founder, Notion

We’ve found that Opus 4.6 excels at interpreting what users actually want, producing shareable content on the first try.

Sarah Sachs, AI Lead Engineer, Notion
Get the full story.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

Notion started as a workspace where knowledge lives: meeting notes, product specs, process guides, and project docs. As the product grew to serve millions of users including major enterprises, the first AI challenge was making all of that knowledge findable. Customer support teams needed troubleshooting steps on demand. Sales reps needed to find process documentation without waiting. Product designers needed brand guidelines without searching manually. Claude became the engine for Notion’s Enterprise Search, AI Writer, and Autofill features, helping users query their entire workspace in natural language and get answers drawn from connected apps and internal documents.

But as AI agents became capable of producing real work, a second and more ambitious challenge emerged: most agent interactions were one-to-one, a single person with a single agent on a single machine. There was no shared visibility, no approval workflows, no way for colleagues to step in and review or iterate together. Notion wanted to bring agents into its collaborative model. As Product Manager Eric Liu described it: powerful agents were being built for every vertical slice of work — why not bring them all into Notion?

Notion selected Claude Opus for its agent layer after testing multiple providers. Co-founder Simon Last cited Claude’s response quality and instruction-following as decisive, especially for use cases where tone and feel matter. AI Lead Engineer Sarah Sachs highlighted Opus 4.6’s ability to interpret intent accurately and produce shareable outputs on the first attempt. Claude Managed Agents provided the infrastructure for long-running sessions, memory management, and high-quality outputs over time — without requiring Notion to build a custom agent runtime.

The result is a workflow that turns Notion’s existing task boards into an agent dispatch system. A team creates a task, moves it to “ready to start,” and Notion invokes a Claude session that picks up context from connected pages, API docs, design systems, and product requirements. For engineering teams, this produces prototypes and code changes. For non-technical teams, it generates presentations, brand strategy decks, and sample websites. Liu described kicking off 30 prototype tasks at once and returning to find them all completed. On the infrastructure side, prompt caching cut Notion’s costs by 90% and latency by up to 85% across the millions of daily AI interactions.

The measurable enterprise impact has been significant. Osaka Gas reduced time spent searching for information by 35%. Remote saves an estimated 10 minutes per search across 300 daily queries. dbt Labs eliminated the need for separate AI tools, saving over $35,000 annually. For new employees, the AI assistant is used 10-20 times daily in their first weeks. Notion is now building toward a workspace where the same interfaces humans use for collaboration — task boards, suggested edits, version history — also serve as the interface for directing agent work.

Similar Cases

S
Sentry
1 million+
root cause analyses processed annually

Sentry is a software monitoring platform that ingests billions of events daily, giving development teams deep context to debug production issues. After deploying Claude for root cause analysis through their Seer agent, Sentry extended the workflow using Claude Managed Agents to generate merge-ready pull requests automatically — closing the loop from detection to fix without custom agent infrastructure. The result: over 1 million root cause analyses processed annually and reviews on over 600,000 pull requests each month, shipped by a single engineer in weeks instead of months.

SoftwareCMClaude Managed Agents
C
Classmethod
up to 90%
reduction in development time

Classmethod, a leading Japanese cloud integrator, deployed Claude Code across its engineering teams to address chronic developer shortages. The tool automated code generation, review, and testing workflows, reducing development time by up to 90% on specific tasks and cutting code review time by 80%.

SoftwareCCClaude Code
T
Tabnine
50%
improvement in response times

Tabnine integrated Claude 3.5 Sonnet via Amazon Bedrock into its AI coding assistant, serving over 1 million monthly developers. The migration delivered 50% faster response times, a 20% increase in free-to-paid conversions, and a 20-30% reduction in churn—while meeting strict security and compliance requirements for regulated industries.

SoftwareCClaudeABAmazon Bedrock
F
Factory
550,000 hours
development time saved

Factory built autonomous AI agents called Droids using Claude 3 Opus and Claude 3 Haiku to automate labor-intensive software engineering tasks at scale. These Droids handle code review, documentation, refactors, migrations, and feature requests across the entire software development lifecycle. Customers using Factory Droids saved 550,000 hours of development time and saw a 20% reduction in development cycle time.

SoftwareC3Claude 3 OpusC3Claude 3 Haiku
A
Apollo
35%
increase in meeting bookings

Apollo integrated Claude 3.5 Haiku into its sales engagement platform to power intelligent, personalized messaging and prospect research at scale. The AI-driven features help sales teams write highly effective outreach without requiring technical expertise, generating over 5 million Claude-powered messaging actions per month. Customers using Claude-powered messaging saw a 35% boost in meeting bookings and a 15% increase in retention rates.

Sales TechnologySoftwareC3Claude 3.5 Haiku
A
Attention
1.6 million hours
admin hours automated

Attention built an AI-powered sales platform using Claude as its core reasoning engine, automating post-call admin work and delivering actionable sales intelligence at scale. By replacing manual CRM updates, follow-up emails, and coaching reviews with Claude-driven agents, Attention has saved over 1.6 million hours of admin work. Customers report up to 40% improvements in win rates thanks to AI outputs accurate enough to trust in live deals.

SoftwareC(Claude (Haiku)CSClaude Sonnet
Y
Yoodli
23%
increase in deals closed by reps practicing 3+ scenarios/week

Yoodli is an AI-powered experiential learning platform that helps enterprise sales teams practice high-stakes conversations before they happen. By integrating Claude into its real-time roleplay engine, Yoodli delivers realistic AI personas that coach reps at scale — helping customers like Snowflake and Google Cloud achieve measurable performance gains.

SoftwareCHClaude Haiku 4.5CSClaude Sonnet
C
CustomGPT.ai
>400M
vectors stored

CustomGPT.ai built a RAG-as-a-Service platform on Pinecone storing over 400M vectors, achieving sub-20ms query latency and the #1 ranking in an independent RAG accuracy benchmark.

SoftwarePPinecone