TechnologyProduct Development

How Mutiny Uses Claude to Give Every Sales Rep a Full Creative Team

Mutiny, a GTM automation startup, rebuilt its entire platform around Claude after Anthropic’s Opus outperformed competitors on its internal design evaluation benchmark. The agent-first architecture lets sales reps describe what they need and Claude generates fully branded assets in a single shot from just a website URL. Design satisfaction scores improved 3x after making Claude Opus the default model, and asset creation is now 4.5x faster than previous workflows.

Impact

3x

Improvement in design satisfaction

4.5x

Faster asset creation for sales teams

9 out of 10

Sales reps reporting competitive edge

Challenge

Early LLM integrations were siloed — each AI capability required narrow guardrails and could not be combined dynamically, preventing Mutiny from delivering a fully autonomous, multimodal creative agent.

Solution

Mutiny re-architected its platform around Claude Opus as the default model, building an agent-first experience with tool-based access for research, brand building, and design. An LLM-native data model aligned to frameworks Claude performs well on enables fully branded asset generation in a single shot.

Tools & Technologies

What Leaders Say

We really saw an opportunity to re-architect the whole system and make this a very agent-first experience. The agent is now multimodal and has tool-based access to do research, build the brand, and design the entire experience.

Nikhil Mathew, Co-founder and CTO, Mutiny

It was the highest benchmark we had achieved on our internal design eval. Not only are we seeing high-quality outputs, but a much wider range of those outputs as well.

Nikhil Mathew, Co-founder and CTO, Mutiny
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Full Story

Mutiny builds tools that help GTM teams create the assets they need to generate pipeline and close deals — personalised pitch decks, deal rooms, and business cases generated in the customer’s own brand. The vision was always to eliminate the bottleneck of waiting on marketing or design.

For years, that vision was constrained. Early LLM integrations were siloed: AI could pull brand styling, generate text variations, or conduct research, but each task required specific guardrails and could not be combined dynamically.

The arrival of Claude Opus changed what Mutiny decided to build. Co-founder and CTO Nikhil Mathew saw an opportunity to re-architect the system as an agent-first experience — multimodal, with tool-based access to research, build brand assets, and design the full output. Mutiny built a two-part benchmark to evaluate models: tool-call accuracy, and a creative evaluation using vision capabilities to score outputs across brand alignment, typography, colour, layout, and originality.

After intensive testing with Anthropic’s team, Claude achieved the highest benchmark the company had recorded. Mutiny rebuilt its creation experience around an LLM-native data model and made Claude Opus the default model.

Design satisfaction — measured through in-app user feedback — improved 3x. Asset creation is now 4.5x faster than previous workflows. Nine out of ten sales reps using Mutiny report the product gives them a competitive edge. The agent generates fully branded pitch decks, deal rooms, and business cases from a single website URL, in one shot.

Similar Cases

I
Intuit
Higher
helpfulness rating vs. non-claude experiences

Intuit integrated Claude via Amazon Bedrock into its Intuit Assist feature within TurboTax to generate plain-language explanations of tax calculations. The integration combines Claude's natural language capabilities with Intuit's proprietary tax knowledge engine, serving millions of customers during peak tax season. The result was higher helpfulness ratings and improved completion rates for federal tax filings.

Financial ServicesTechnologyIAIntuit AssistABAmazon Bedrock
FD
Fifth Dimension
50x
document processing capacity increase

Fifth Dimension, a UK-based AI analytics company serving the real estate industry, migrated to Google Cloud to overcome critical infrastructure bottlenecks. By adopting Vertex AI, Cloud Run, and serverless architecture, the company achieved 50x processing scalability, 6x revenue growth, and a 30% reduction in infrastructure costs — all within a rapid growth trajectory from founding in 2023 to global scale by 2025.

TechnologyGCGoogle Cloud Pub/SubGCGoogle Cloud Run
S
Stairwell
40,000+ characters
security data processed per claude request

Stairwell, a cybersecurity company, integrated Claude into its Maleval threat detection platform to summarize complex security findings for analysts. Claude's large context window allows it to process 40,000+ character API responses in a single pass, converting dense technical data into clear, actionable insights with minimal prompt engineering.

CybersecurityTechnologyCClaude
A
ASAPP
91%
first-call resolution rate

ASAPP is an AI-native customer service platform that orchestrates large language models to automate contact center interactions for enterprise clients. By deploying Anthropic’s Claude through Amazon Bedrock, ASAPP eliminated its homegrown PII redaction layer and reduced call escalations by up to 40%, while helping clients achieve a 91% first-call resolution rate. The platform now automates more than 90% of contact center interactions, with human agents freed to handle three times the volume of complex cases.

TechnologyABAmazon BedrockCClaude
L
Lindy
10x
customer growth

Lindy's AI agent platform is built on Claude, enabling 10x customer growth, 72% reduction in time-to-qualified-lead, and handling 70%+ of routine support tickets.

TechnologyCClaude
G
GitLab
25–50%
productivity gains across internal workflows

GitLab is the most comprehensive DevSecOps platform, supporting the entire software development lifecycle for enterprises worldwide. The company integrated Claude 3 models across its AI-powered Duo feature set—covering code generation, interactive chat, planning summarization, and vulnerability remediation—to deliver AI capabilities that align with its commitments to stability, security, and privacy. Teams report 25–50% productivity gains across internal workflows, with AI feature development now measured in weeks rather than years.

TechnologyCClaude
A
Assembled
~95%
ticket handling time reduction

Assembled is a workforce management and customer support optimization platform serving enterprises like Stripe, Etsy, and DoorDash. To power Assembled Assist, the company built a hybrid RAG pipeline combining Pinecone vector search with Algolia keyword retrieval and LLMs from OpenAI and Anthropic. Support tasks that previously took 40 minutes now complete in 2 minutes—a 95% reduction in handling time.

TechnologyAAlgoliaOLOpenAI LLMs
A
Assembled
20%
increase in customer satisfaction

Assembled is a support operations platform serving enterprise customers including Stripe, Robinhood, and Warner Brothers, coordinating AI agents and human support staff through a unified interface. By deploying Claude as the reasoning engine for Assembled Assist, the company automated more than half of support cases while maintaining customer satisfaction above 90%. A multi-model architecture built around Claude also provided resilience during a competitor outage, with Assembled migrating all LLM workflows in under twenty minutes.

TechnologyCClaude