How Pinterest Delivers 10M AI Recommendations per Second on AWS

Pinterest built an AI-powered discovery engine on AWS processing 18TB daily, delivering 10 million AI recommendations per second across 10,000+ GPU instances, driving 17% revenue growth and 70% AI-driven discovery.

Impact

17% YoY

Revenue Growth

70%

AI-Driven Discovery

11%

MAU Growth

10M per second

AI Recommendations

Challenge

Needed to process billions of images and deliver personalized recommendations from 500+ petabytes while maintaining user trust at massive scale.

Solution

Built microservices architecture on AWS with 10,000+ GPU instances, Pinterest Canvas diffusion model, visual search recognizing 2.5B objects, and voice-enabled AI assistant.

Tools & Technologies

What Leaders Say

For over a decade, we have been leveraging AI to craft a uniquely positive online experience, striving to make every moment on Pinterest additive, not addictive.

Kartik Paramasivam, Chief Architect, Pinterest
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

Pinterest needed to evolve its AI-powered discovery system to process billions of images and deliver personalized recommendations from 500+ petabytes of data without compromising user trust or prioritizing addictive engagement.

Pinterest implemented a microservices architecture on AWS with Amazon EKS for rapid AI deployment, 10,000+ EC2 G5 instances for inference, and 600+ P4/P4de instances for training. Key innovations include Pinterest Canvas (latent diffusion for image generation), visual search recognizing 2.5 billion objects via SageMaker, and Pinterest Assistant (voice-enabled conversational AI). The infrastructure processes 18 terabytes of data daily.

Results: 17% year-over-year revenue growth, 11% increase in monthly active users, 230 basis point improvement in search fulfillment, and 70% of user discovery now AI-driven.

Similar Cases

P
Postman
Up to 1,150/year
developer hours saved

Postman selected Claude Opus 4.6 as the default model for Agent Mode, saving developers up to 1,150 hours per year and nearly $1M annually for a 10-person team in API development automation.

TechnologyCAClaude APIABAmazon Bedrock
A
ASAPP
91%
first-call resolution rate

ASAPP is an AI-native customer service platform that orchestrates large language models to automate contact center interactions for enterprise clients. By deploying Anthropic’s Claude through Amazon Bedrock, ASAPP eliminated its homegrown PII redaction layer and reduced call escalations by up to 40%, while helping clients achieve a 91% first-call resolution rate. The platform now automates more than 90% of contact center interactions, with human agents freed to handle three times the volume of complex cases.

TechnologyABAmazon BedrockCClaude
I
Intuit
Higher
helpfulness rating vs. non-claude experiences

Intuit integrated Claude via Amazon Bedrock into its Intuit Assist feature within TurboTax to generate plain-language explanations of tax calculations. The integration combines Claude's natural language capabilities with Intuit's proprietary tax knowledge engine, serving millions of customers during peak tax season. The result was higher helpfulness ratings and improved completion rates for federal tax filings.

Financial ServicesTechnologyIAIntuit AssistABAmazon Bedrock
T
Tabnine
50%
improvement in response times

Tabnine integrated Claude 3.5 Sonnet via Amazon Bedrock into its AI coding assistant, serving over 1 million monthly developers. The migration delivered 50% faster response times, a 20% increase in free-to-paid conversions, and a 20-30% reduction in churn—while meeting strict security and compliance requirements for regulated industries.

TechnologyABAmazon BedrockCClaude
B
BambooHR
tens of thousands
employee questions answered

BambooHR built an AI-powered HR assistant using Cohere's Embed and Rerank models to answer employee questions accurately, saving HR teams thousands of hours while handling sensitive data securely.

TechnologyCRCohere RerankCECohere Embed
P
Pfizer
93%
database reduction

Pfizer achieved a 93% database reduction and 20% cost avoidance by migrating their global SAP environment to S/4HANA on IBM Power10 infrastructure.

PharmaceuticalsTechnologyICIBM ConsultingIPIBM Power Virtual Server
BO
Blue Origin
2,700+
ai agents deployed

Blue Origin deployed 2,700+ AI agents with 70% company-wide adoption, achieving a 90% reduction in hardware development time using Amazon Bedrock.

Aerospace & DefenseManufacturingALAWS LambdaAEAmazon EKS
C
Confluent
15,000+
hours saved monthly

Confluent, a data streaming platform company with 2,000+ employees and 4,000+ customers, deployed Glean to solve the knowledge fragmentation that came with rapid growth from 250 to 2,000+ employees across 20+ systems. Glean indexed the company's full tool stack — Slack, Salesforce, Confluence, and more — enabling instant knowledge retrieval across all teams. The result: 15,000+ hours saved monthly, a 13% increase in support team satisfaction, and over 70% employee adoption.

TechnologyGGlean