TecnologíaIngeniería de Software

Cómo Datadog Usa OpenAI Codex para Revisión de Código a Nivel de Sistema con Más de 1.000 Ingenieros

Datadog integró OpenAI Codex en los flujos de revisión de pull requests en vivo, detectando el 22% de los incidentes que superaron la revisión humana y habilitando a más de 1.000 ingenieros con análisis de código a nivel de sistema impulsado por IA.

Impacto

~22%

Incidentes Prevenibles por IA

1,000+

Ingenieros que Usan Codex

Desafío

Necesitaba escalar la revisión de código más allá de los ingenieros senior. Las herramientas de análisis estático eran superficiales y ruidosas, y los desarrolladores frecuentemente las ignoraban.

Solución

Integró OpenAI Codex para la revisión automática de pull requests en los principales repositorios, analizando los cambios de código dentro del contexto más amplio del sistema para detectar riesgos arquitectónicos.

Herramientas y tecnologías

Lo que dicen los líderes

Para mí, un comentario de Codex parece el ingeniero más inteligente con el que he trabajado y que tiene tiempo ilimitado para encontrar errores.

Brad Carter, Gerente de Ingeniería, Líder de IA DevX, Datadog
Entiende todo el contexto.

Regístrate para leer casos de estudio completos, acceder a métricas detalladas y recibir todos los reportes.

Historia completa

Datadog necesitaba escalar la revisión de código más allá de los ingenieros senior que comprenden la arquitectura del sistema y los riesgos interconectados. Las herramientas tradicionales de análisis estático ofrecían sugerencias superficiales y ruidosas que los ingenieros frecuentemente ignoraban.

Datadog integró OpenAI Codex en los flujos de desarrollo en vivo, revisando automáticamente los pull requests en los principales repositorios. El sistema analiza los cambios de código dentro del contexto más amplio del sistema, en lugar de de forma aislada, identificando riesgos que los revisores humanos pasan por alto.

Codex identificó más de 10 casos (aproximadamente el 22% de los incidentes analizados) en los que el feedback de la IA habría prevenido problemas que superaron la revisión humana. Más de 1.000 ingenieros utilizan ahora Codex regularmente para la revisión de código.

Casos similares

P
Pfizer
93%
database reduction

Pfizer achieved a 93% database reduction and 20% cost avoidance by migrating their global SAP environment to S/4HANA on IBM Power10 infrastructure.

PharmaceuticalsTechnologyICIBM ConsultingIPIBM Power Virtual Server
J
Jamf
Under 45 minutes
performance review skill build time

Jamf deployed Claude Enterprise across 16 departments, then built interactive workflow skills using Claude Cowork that transformed manual spreadsheet-based processes into guided, conversational experiences. Performance reviews that previously required months of effort are now built in under 45 minutes, and non-engineering teams independently create custom data dashboards.

TechnologyCEClaude EnterpriseCCClaude Cowork
C
Confluent
15,000+
hours saved monthly

Confluent, a data streaming platform company with 2,000+ employees and 4,000+ customers, deployed Glean to solve the knowledge fragmentation that came with rapid growth from 250 to 2,000+ employees across 20+ systems. Glean indexed the company's full tool stack — Slack, Salesforce, Confluence, and more — enabling instant knowledge retrieval across all teams. The result: 15,000+ hours saved monthly, a 13% increase in support team satisfaction, and over 70% employee adoption.

TechnologyGGlean
H
Headstart
90–97%
code written by claude

Headstart, an AI-native software studio, uses Claude 3.5 Sonnet to write 90-97% of client code, compressing enterprise software project timelines from months to weeks and delivering 10-100x development speed.

TechnologyC3Claude 3.5 Sonnet
L
Lusha
300%
increase in outbound leads

Lusha is a B2B sales intelligence platform with 1.5 million users and a database of over 200 million business contacts. By deploying Elasticsearch as both a full-text search engine and a vector database for AI-powered lead recommendations, Lusha helps customers generate 300% more leads, achieve conversion rates up to 10x higher, and realize return on investment of up to 1,000%.

TechnologyEElasticsearch
A
Aquant
98%+
retrieval accuracy

Aquant is an agentic AI platform purpose-built for professionals servicing complex industrial and medical equipment at large manufacturing companies. When the company’s homegrown vector search infrastructure—built on PostgreSQL extensions—began to slow under real-time production demands, Aquant migrated to Pinecone as the retrieval backbone for its AI platform. The switch delivered sub-100ms semantic search, pushed retrieval accuracy above 98%, and helped Aquant’s customers cut average service resolution time by 49%.

TechnologyPPinecone
N
Nextdoor
2–3x
engineering productivity improvement

Nextdoor, the neighborhood social network, deployed Glean as a unified Work AI layer embedded directly into the tools employees already use. Rather than mandating adoption, the team built a self-reinforcing learning loop of Slack channels, live office hours, and quick-win storytelling that turned early experimentation into company-wide AI habits — with engineering productivity gains of 2–3x and RevOps workflows shrinking from hours to minutes.

TechnologyGGlean
H
Hostinger
Minutes vs. days
website creation time

Hostinger partnered with Anthropic to build Hostinger Horizons, an AI-powered platform that converts natural language prompts into complete, functional websites and applications. The solution eliminates the steep learning curve of traditional web builders, enabling non-technical users to create professional online presences in minutes instead of days.

TechnologyCClaude