TechnologyOperations

How Ensono Uses Snowflake ML to Predict IT Failures and Cut MTTR by Up to 70%

Ensono, a managed services provider handling over 60 billion retail transactions and government platforms for 24 million constituents, built two AI-powered systems on Snowflake to shift IT operations from reactive to predictive. The Envision Predictive Engine (EPE) and DiagnoseNow application reduced mean time to resolution by 54–70%, cut major incidents by 22%, and improved SLA performance by 38% across its enterprise client base.

Impact

54–70%

Reduction in mean time to resolution (MTTR)

22%

Reduction in major incidents

38%

SLA performance improvement

< 2 minutes

Time to generate AI incident analysis

75M+ events, 9M+ alerts

Events analyzed by EPE

Challenge

Ensono’s MSP engineers managed IT environments for large enterprise clients generating millions of alerts with no reliable way to predict which would escalate to major incidents, while manual root cause analysis slowed incident resolution and data labeling for ML models required significant human effort to scale.

Solution

Ensono built the Envision Predictive Engine and DiagnoseNow on Snowflake’s AI Data Cloud, using Snowflake ML for model training and deployment, Cortex AI for GPT-powered data labeling and automated root cause analysis, and Streamlit in Snowflake for the engineer-facing incident resolution interface integrated with ServiceNow.

Tools & Technologies

What Leaders Say

When EPE proposes major incidents, the MTTR is 54% lower.

Jim Piazza, Chief AI Officer, Ensono

We’ve reached new heights in terms of customer satisfaction. Eighty percent of our clients recommend us to other customers. That’s a tangible measure of the quality of the delivery we provide to every one of our clients.

Jim Piazza, Chief AI Officer, Ensono

We recognized early on that Snowflake had a unique value to our business. Partly because it holds so much of our data, but also because of its extensive capabilities for building, hosting and running inference against machine learning models.

Jim Piazza, Chief AI Officer, Ensono

We wanted to deploy models as quickly as possible. And with Snowflake ML, we don’t have to worry about creating or finding another model hosting platform because we can use the Model Registry to manage and deploy models for inference with our existing pipelines.

John Stamford, Vice President, Data Science and Machine Learning, Ensono
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

Ensono operates as a managed services provider for large enterprise clients whose IT environments span hundreds of servers, thousands of SaaS accounts, and terabytes of operational data. The company supports critical infrastructure at scale—processing over 60 billion retail transactions and giving 24 million constituents access to government platforms. At that scale, a single misclassified ticket or delayed incident response doesn’t just affect one client: it ripples across dozens of complex environments where downtime has direct financial and operational consequences.

The traditional MSP model of monitor-alert-respond was structurally inadequate. Engineers received floods of alerts with no reliable mechanism to identify which were true precursors to major incidents and which were noise. Data labeling for model training was a manual, dashboard-intensive process that was difficult to scale. And when incidents did occur, root cause analysis required time-consuming manual investigation before the right fix could be applied. Ensono’s Chief AI Officer Jim Piazza set a specific goal: shift the operating model to prevent-predict-optimize.

Ensono built two systems using Snowflake’s AI Data Cloud. The first, Envision Predictive Engine (EPE), is an ML model that ingests data from millions of events and alerts across client environments, estimates each support ticket’s probability of becoming a service-impacting event, and surfaces high-priority tickets as ServiceNow popup notifications for frontline engineers. GPT models accessed via Snowflake Cortex AI automated the historically manual data labeling process, saving engineering hours at scale. The second system, DiagnoseNow, built using Streamlit in Snowflake and Cortex AI, automates root cause analysis by pulling case-specific details, event timelines, error summaries, and recommended actions—all within under two minutes of a request being initiated.

The results across both systems are concrete. When EPE flags a major incident, MTTR is 54% lower than baseline. In some cases, DiagnoseNow pilot testing showed MTTR reductions as high as 70%. Combined, the two systems have helped Ensono spot over 1,700 issues and reduce major incidents by 22%. SLA performance improved by 38%, and the company’s NPS-equivalent metric shows 80% of clients actively recommend Ensono to other organizations—a direct indicator of the operational improvement clients experience.

Ensono is continuing to expand the AI layer. The team is adopting Snowpark Container Services to accelerate DiagnoseNow response times, and Snowflake-managed Model Context Protocol (MCP) servers are planned for the next phase of the decision engine. For Piazza, the direction is clear: “Having more specialized models working together, with different systems, greatly improves the quality of outcomes. It’s like having a team of experts on demand.”

Similar Cases

I
Intercom
$1.4M
annual savings from sales team efficiency

Intercom, the AI-first customer service platform, built a Sales Cockpit on Snowflake’s AI Data Cloud powered by Cortex AI to give sales reps a unified view of customer data and AI-generated insight decks. The tool saves more than 2,000 hours per month across the sales organization, equivalent to $1.4 million in annual savings, and reduced the time to produce customer insight reports by 96%.

TechnologySSnowflakeSCSnowflake Cortex AI
A
Allspice
20% → 97%
ingredient matching accuracy

Allspice, a food technology startup building a kitchen operating system for consumers and recipe publishers, deployed Pinecone’s vector database to solve the inherent messiness of ingredient data that traditional text search could not handle. The implementation raised ingredient matching accuracy from roughly 20% to 97%, enabling the launch of recipe importing as a core product feature and expanding into a platform-wide semantic layer for search, recommendations, and conversational AI.

TechnologyTtext-embedding-3-largePPinecone
P
Pfizer
93%
database reduction

Pfizer achieved a 93% database reduction and 20% cost avoidance by migrating their global SAP environment to S/4HANA on IBM Power10 infrastructure.

PharmaceuticalsTechnologyICIBM ConsultingIPIBM Power Virtual Server
J
Jamf
Under 45 minutes
performance review skill build time

Jamf deployed Claude Enterprise across 16 departments, then built interactive workflow skills using Claude Cowork that transformed manual spreadsheet-based processes into guided, conversational experiences. Performance reviews that previously required months of effort are now built in under 45 minutes, and non-engineering teams independently create custom data dashboards.

TechnologyCEClaude EnterpriseCCClaude Cowork
R
Rappi
40%
search response latency reduction

Rappi, Latin America’s fastest-growing on-demand delivery app serving over 300 cities, replaced its keyword-based search engine with Oracle AI Vector Search and Oracle Cloud Infrastructure Generative AI to enable semantic and image-based product discovery. The upgrade reduced search response latency by 40% and improved conversion rate by 25%, driving higher engagement and order volumes across the platform.

TechnologyOAOracle AI Vector SearchOAOracle Autonomous AI Database
C
Confluent
15,000+
hours saved monthly

Confluent, a data streaming platform company with 2,000+ employees and 4,000+ customers, deployed Glean to solve the knowledge fragmentation that came with rapid growth from 250 to 2,000+ employees across 20+ systems. Glean indexed the company's full tool stack — Slack, Salesforce, Confluence, and more — enabling instant knowledge retrieval across all teams. The result: 15,000+ hours saved monthly, a 13% increase in support team satisfaction, and over 70% employee adoption.

TechnologyGGlean
H
Headstart
90–97%
code written by claude

Headstart, an AI-native software studio, uses Claude 3.5 Sonnet to write 90-97% of client code, compressing enterprise software project timelines from months to weeks and delivering 10-100x development speed.

TechnologyC3Claude 3.5 Sonnet
L
Lusha
300%
increase in outbound leads

Lusha is a B2B sales intelligence platform with 1.5 million users and a database of over 200 million business contacts. By deploying Elasticsearch as both a full-text search engine and a vector database for AI-powered lead recommendations, Lusha helps customers generate 300% more leads, achieve conversion rates up to 10x higher, and realize return on investment of up to 1,000%.

TechnologyEElasticsearch