How Pinterest Delivers 10M AI Recommendations per Second on AWS
Pinterest built an AI-powered discovery engine on AWS processing 18TB daily, delivering 10 million AI recommendations per second across 10,000+ GPU instances, driving 17% revenue growth and 70% AI-driven discovery.
Impact
17% YoY
Revenue Growth
70%
AI-Driven Discovery
11%
MAU Growth
10M per second
AI Recommendations
Challenge
Needed to process billions of images and deliver personalized recommendations from 500+ petabytes while maintaining user trust at massive scale.
Solution
Built microservices architecture on AWS with 10,000+ GPU instances, Pinterest Canvas diffusion model, visual search recognizing 2.5B objects, and voice-enabled AI assistant.
Tools & Technologies
What Leaders Say
“For over a decade, we have been leveraging AI to craft a uniquely positive online experience, striving to make every moment on Pinterest additive, not addictive.”
Sign up to read complete case studies, access detailed metrics, and unlock all use cases.
Full Story
Pinterest needed to evolve its AI-powered discovery system to process billions of images and deliver personalized recommendations from 500+ petabytes of data without compromising user trust or prioritizing addictive engagement.
Pinterest implemented a microservices architecture on AWS with Amazon EKS for rapid AI deployment, 10,000+ EC2 G5 instances for inference, and 600+ P4/P4de instances for training. Key innovations include Pinterest Canvas (latent diffusion for image generation), visual search recognizing 2.5 billion objects via SageMaker, and Pinterest Assistant (voice-enabled conversational AI). The infrastructure processes 18 terabytes of data daily.
Results: 17% year-over-year revenue growth, 11% increase in monthly active users, 230 basis point improvement in search fulfillment, and 70% of user discovery now AI-driven.