How Datadog Uses OpenAI Codex for System-Level Code Review Across 1,000+ Engineers

Datadog integrated OpenAI Codex into live PR review workflows, catching 22% of incidents that passed human review and enabling 1,000+ engineers with AI-powered system-level code analysis.

Impact

~22%

Incidents Preventable by AI

1,000+

Engineers Using Codex

Challenge

Needed to scale code review beyond senior engineers. Static analysis tools were shallow and noisy, frequently ignored by developers.

Solution

Integrated OpenAI Codex for automatic PR review across major repositories, analyzing code changes within broader system context to surface architectural risks.

Tools & Technologies

What Leaders Say

For me, a Codex comment feels like the smartest engineer I have worked with and who has infinite time to find bugs.

Brad Carter, Engineering Manager, AI DevX Lead, Datadog
Get the full story.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

Datadog needed to scale code review beyond senior engineers who understand system architecture and interconnected risks. Traditional static analysis tools provided shallow, noisy suggestions that engineers often ignored.

Datadog integrated OpenAI Codex into live development workflows, automatically reviewing pull requests across major repositories. The system analyzes code changes within broader system context rather than in isolation — surfacing risks that human reviewers miss.

Codex identified more than 10 cases (approximately 22% of incidents examined) where AI feedback would have prevented issues that passed human review. Over 1,000 engineers now use Codex regularly for code review.

Similar Cases

D
Delphi
>100M
vectors stored

Delphi is an AI platform that enables coaches, creators, and experts to deploy interactive “Digital Minds”—always-on conversational agents trained on their unique content. Scaling from proof of concept to a commercial platform with thousands of customers required a vector database that could support millions of isolated namespaces, billions of vectors, and sub-second retrieval under variable load. Delphi selected Pinecone, achieving P95 query latency of 100ms and keeping retrieval under 30% of total response time—freeing the engineering team to build product rather than manage infrastructure.

TechnologyPPinecone
N
Notion
Millions
notion ai users reached

Notion, the connected workspace platform used by millions worldwide, integrated Cohere Rerank into its search pipeline to power Notion AI’s search accuracy across multilingual enterprise workspaces. Every search and Notion AI interaction now routes through Cohere Rerank, delivering dramatically improved relevance while cutting the cost and complexity of embedding-based retrieval for smaller workspaces.

TechnologyCRCohere Rerank
F
Fujitsu
World-class score
jglue benchmark performance

Fujitsu, the global IT and digital transformation company with 124,000 employees, partnered with Cohere to develop Takane — a state-of-the-art Japanese large language model built on the Cohere Command series. Designed for private deployment in regulated sectors such as finance, healthcare, and government, Takane delivers world-class performance on the JGLUE benchmark and is now integrated into Fujitsu’s AI service offerings and data intelligence platform.

TechnologyCCCohere Command
PA
Palo Alto Networks
351,000 hours
employee productivity hours saved

Palo Alto Networks, the global cybersecurity leader with nearly 15,000 employees, deployed Moveworks as an AI Assistant named Sheldon to deliver autonomous support across Slack, email, and ServiceNow. The platform resolves 4,000 IT and HR issues per month while saving 351,000 employee hours, enabling the company to scale its hybrid FLEXWORK model without adding headcount.

TechnologyMMoveworks
PS
Pure Storage
30+ minutes
time saved per search

Pure Storage, a Santa Clara-based enterprise data storage company, deployed Glean to unify knowledge access across Jira, GitHub, and internal wikis for teams spanning engineering, legal, and customer support. The AI-powered search platform cuts information-retrieval time by more than 30 minutes per search and enables employees to build custom GenAI applications in as little as 5 minutes, while boosting overall employee satisfaction scores by 39 points.

TechnologyGGlean
C
CoreWeave
2–5 days (down from 4–8 days)
mean time to resolution

CoreWeave, a global AI cloud provider serving top AI labs and enterprises, deployed Cohere’s North agentic AI platform to overhaul its Slack-based customer support workflow in 90 days. North automated ticket triage, context gathering, and routing recommendations, cutting mean resolution time from 4–8 days to 2–5 days while sustaining customer satisfaction scores between 4.9 and 5.0.

TechnologyCNCohere North
S
Salesforce
20%
productivity increase

Salesforce, the world’s leading CRM company, deployed Writer across more than 3,000 employees spanning marketing, communications, product, and customer success. Using Writer’s AI Studio no-code builder and Knowledge Graph RAG, teams create and launch custom agents in minutes without engineering support. Users report a 20% productivity gain—equivalent to reclaiming one full workday per week—with 78% saying the platform positively affects their daily work.

TechnologyWWriter
FD
Fifth Dimension
50x
document processing capacity increase

Fifth Dimension, a UK-based AI analytics company serving the real estate industry, migrated to Google Cloud to overcome critical infrastructure bottlenecks. By adopting Vertex AI, Cloud Run, and serverless architecture, the company achieved 50x processing scalability, 6x revenue growth, and a 30% reduction in infrastructure costs — all within a rapid growth trajectory from founding in 2023 to global scale by 2025.

TechnologyVAVertex AIPPub/Sub