Technologyengineering

How WP Engine Uses Elastic and Gemini to Power AI Search for 1.5M WordPress Users

WP Engine, the leading WordPress hosting platform serving more than 1.5 million users across 200,000 websites in 150+ countries, deployed Elastic’s Search AI Platform alongside Google Cloud Vertex AI and Gemini to build Smart Search AI and enable retrieval-augmented generation (RAG) capabilities for its customers. The integration allows WP Engine to deliver natural language search, context-aware product recommendations, and AI-powered chatbots to website owners without requiring them to stitch together multiple vendors. Response times dropped to as low as five milliseconds, and the platform handled traffic spikes from hundreds of thousands to tens of millions of queries per minute with zero downtime.

Impact

~5 milliseconds

Search query response time

Zero

Downtime during traffic spike

1.5 million+

WordPress users served

Significant

Development time savings

Challenge

WP Engine needed to deliver advanced AI search and recommendation features across 200,000+ WordPress websites with highly varied data configurations, without building complex multi-vendor AI infrastructure from scratch.

Solution

WP Engine deployed Elastic’s Search AI Platform with Google Cloud Vertex AI integration, using Elasticsearch’s vector database for semantic and hybrid search to power Smart Search AI and enable RAG applications for their customers.

Tools & Technologies

What Leaders Say

Elastic is simply the best solution for search and search hosting. It means we can offer a cutting-edge search experience that is genuinely useful for an extremely wide variety of customers and types of websites.

Luke Patterson, Senior Product Manager, WP Engine

Elasticsearch integration with Google’s AI tools means you don’t have to build complicated connections from scratch. You can focus on creating great user experiences, not wiring systems together.

Shane Daly, Principal Software Engineer, WP Engine

There was zero downtime, not a single hiccup. The Elastic Cloud infrastructure scaled up exactly as we had hoped.

Shane Daly, Principal Software Engineer, WP Engine
Get the full story.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

WP Engine has been the backbone of the WordPress ecosystem since 2010, providing managed hosting and developer tools to more than 1.5 million users across 200,000+ websites spanning 150 countries. As AI capabilities moved from experiments to customer expectations, WP Engine found itself at a crossroads: its clients — ranging from media companies to e-commerce brands — needed advanced AI features on their websites, but building those features across the sprawling and inconsistent landscape of WordPress configurations required a technology partner that could handle the complexity at scale.

The challenge was structural. WordPress data exists in countless configurations: different themes, plugins, custom post types, and content structures. Building consistent AI search or recommendation capabilities across this variety demanded a platform that could extract any data type, normalize it, and store it as vectors — all while maintaining the speed and reliability that web visitors expect. WP Engine’s existing infrastructure was not built for vector search or generative AI, and developing those capabilities from scratch would have consumed engineering resources that should be directed at product innovation.

WP Engine chose Elastic’s Search AI Platform as the foundation, deploying Elasticsearch’s vector database at the heart of its technology stack and integrating it with Google Cloud Vertex AI. This pairing gave WP Engine a natively integrated retrieval engine that combines the multimodal strengths of Google’s Gemini models with Elastic’s AI-powered semantic and hybrid search. The practical output was Smart Search AI — a WP Engine product feature that delivers natural language search results for any WordPress site. The same infrastructure enables RAG applications: WP Engine customers can now build AI chatbots and recommendation engines grounded in their own business data without maintaining custom pipelines.

The reliability test came when a large media customer used WP Engine’s search API in a way that sent traffic on the production server from hundreds of thousands to tens of millions of queries within minutes. The result: zero downtime. The Elastic Cloud infrastructure absorbed the spike without a single disruption. Response times across the platform stabilized at around five milliseconds — fast enough to deliver seamless real-time experiences for website visitors and protect conversion rates. “There was zero downtime, not a single hiccup,” said Shane Daly, Principal Software Engineer at WP Engine. “The Elastic Cloud infrastructure scaled up exactly as we had hoped.”

For WP Engine and its customers, the Elastic and Google Cloud integration has shifted AI deployment from a multi-month engineering project to a rapid capability rollout. The same media customer that triggered the traffic spike was previously using a custom recommendation engine integrated with a separate LLM; that same capability is now available natively through WP Engine’s platform, eliminating the need for a bespoke solution. As AI search and personalization become baseline expectations for website visitors, WP Engine is positioned as the platform that makes those capabilities accessible to organizations of any size.

Similar Cases

FD
Fifth Dimension
50x
document processing capacity increase

Fifth Dimension, a UK-based AI analytics company serving the real estate industry, migrated to Google Cloud to overcome critical infrastructure bottlenecks. By adopting Vertex AI, Cloud Run, and serverless architecture, the company achieved 50x processing scalability, 6x revenue growth, and a 30% reduction in infrastructure costs — all within a rapid growth trajectory from founding in 2023 to global scale by 2025.

TechnologyVAVertex AIPPub/Sub
Z
Zapier
89%
ai adoption rate across all employees

Zapier deployed Claude Enterprise and Claude Code company-wide, achieving 89% AI adoption across all employees — the highest adoption rate in company history. The automation platform now runs 800+ internal AI agents and saw 10x year-over-year growth in Anthropic app usage. Adoption began grassroots before a formal rollout.

TechnologyCEClaude EnterpriseCCClaude Code
S
Stairwell
40,000+ characters
security data processed per claude request

Stairwell, a cybersecurity company, integrated Claude into its Maleval threat detection platform to summarize complex security findings for analysts. Claude's large context window allows it to process 40,000+ character API responses in a single pass, converting dense technical data into clear, actionable insights with minimal prompt engineering.

TechnologyCybersecurityCClaude
I
Intuit
Higher
helpfulness rating vs. non-claude experiences

Intuit integrated Claude via Amazon Bedrock into its Intuit Assist feature within TurboTax to generate plain-language explanations of tax calculations. The integration combines Claude's natural language capabilities with Intuit's proprietary tax knowledge engine, serving millions of customers during peak tax season. The result was higher helpfulness ratings and improved completion rates for federal tax filings.

Financial TechnologyTechnologyIAIntuit AssistABAmazon Bedrock
C
Canva
65%
daily ai use adoption rate among employees

Canva deployed Claude Enterprise across its entire workforce in mid-2024 as part of a deliberate multi-vendor AI strategy. Claude quickly emerged as the standout tool, achieving a 65% daily active use rate among employees. Engineering, product, content, and business teams all adopted Claude for productivity, prototyping, and creative work.

TechnologyDesign SoftwareCEClaude Enterprise
R
reMarkable
35%
support cases autonomously resolved by agentforce agent

reMarkable built two Agentforce-powered AI agents — 'Mark' for customer support and 'Saga' for internal IT help — autonomously resolving 35% of inbound support cases and significantly reducing IT team workload, enabling the company to scale without proportional headcount growth.

TechnologyConsumer ElectronicsSSSalesforce Service CloudSDSalesforce Data Cloud
C
ChargeGuru
6 weeks
migration completed

ChargeGuru merged two legacy EV charging platforms into a single system in 6 weeks using Make, compressing typical 3-month delivery cycles to 2-day turnarounds and enabling non-technical teams to build independently.

TechnologyMMake
C
CustomGPT.ai
10,000+
paying customers served

CustomGPT.ai is a no-code RAG-as-a-Service platform enabling businesses to build domain-specific AI agents on their own data. By building its vector retrieval infrastructure on Pinecone, the company scaled to over 10,000 paying customers, stores 400+ million vectors, and delivers sub-20ms P50 query latency at 99.95%+ uptime. The result is a platform that earned the #1 ranking in a RAG accuracy benchmark, with Pinecone providing the foundation that let the engineering team focus entirely on product differentiation rather than infrastructure management.

TechnologyPPinecone