Published on

August 13, 2025

min read

Mistral AI Unlocks 2.5x Faster Training Speeds With NVIDIA GB200 NVL72 on CoreWeave

Mistral AI was built on CoreWeave’s AI Cloud platform from the start. Learn how they unlocked a 2.5x boost in model training speed by leveraging the NVIDIA GB200 NVL72 platform on CoreWeave’s high-per

No items found.

Copied

Mistral AI Unlocks 2.5x Faster Training Speeds With NVIDIA GB200 NVL72 on CoreWeave

Challenge

Since launching in 2023, Mistral AI has taken the AI scene by storm, proving that the next generation of groundbreaking AI innovation doesn’t have to be led from Silicon Valley. In an era long dominated by North American hyperscalers, tech giants, and well-funded AI labs, Mistral AI has emerged as a bold new entrepreneurial force and a major player in today’s AI revolution.

By releasing powerful, open-source large language models (LLMs) at unprecedented speed, Mistral AI has not only made waves in the developer community but also redefined what’s possible for lean, fast-moving startups. Its mission is bold and clear: to deliver LLMs that empower greater access, more ownership, and better flexibility. That empowers labs and developers alike with models that are reliable, adaptable, and easy to integrate.

But that ambitious vision brings intense pressure. Mistral AI knew it needed to move faster than the market and scale without compromise. They knew they needed to grow at a pace that demanded a highly specialized, deeply collaborative infrastructure partner from day one in order to effectively tackle the following challenges:

Limited time and resources: To stay competitive as a young startup, Mistral AI needed to get its models to market fast. That meant its developers needed to stay focused on developing exceptional models, not firefighting infrastructure challenges.
Reliability and resiliency: For streamlined organizations like Mistra Al, saving on costs is absolutely essential. Mistral AI’s teams required a partner that could ensure their clusters stayed healthy—and when there was an issue, that it could be rapidly resolved to minimize potential downtime and any impact on their bottom line.
Rapid growth at scale: Growth is a major goal of any startup, but with rapid growth comes the need to quickly adjust to exponential increases in scale. Mistral AI needed a partner that could match its acceleration without delay or bottlenecks.

“CoreWeave is one of the few providers that has real experience at very large scale for exactly what we do, so large language model training.”
Timothée Lacroix, CTO
Mistral AI‍

Solution‍

Mistral AI chose to partner with CoreWeave for a reason that is as bold and focused as their own mission: CoreWeave is purpose-built for AI. Every layer of our platform—from data center architecture to application services—is meticulously engineered to deliver the highest levels of performance, efficiency, and scalability for modern AI workloads.

Unlike traditional cloud platforms that stretch to meet a wide range of use cases, the CoreWeave AI Cloud platform is fine-tuned for AI acceleration. For Mistral AI, this meant they had access to a partner that could support their demand for speed, was flexible enough to meet their aspirations, and was capable and experienced enough to let innovation soar without compromise.

That’s exactly why Mistral AI has consistently chosen to partner with CoreWeave since signing their first contract for H100s in 2023, standing up H200 and GB200 clusters over the last two years that have exceeded their highest expectations and gotten their AI models to market at a lightning-fast pace.

Mistral AI was able to tackle its critical challenges and accelerate innovation by using the following solutions:

Slurm on Kubernetes (SUNK): Our SUNK solution facilitates greater workload fungibility and resource sharing between training and inference workloads by running both use cases on the same cluster. With SUNK, Mistral AI’s engineering teams could collaborate without compromise, improving workflows and reducing bottlenecks across training and inference. ‍
Observability with ultra-granular metrics: Our observability platform provides detailed metrics and data for Mistral AI clusters out-of-the box, with no additional setup or extra charges required. Mistral AI can now easily visualize entire fleets of GPUs in one place, complete with full analytical detail about every node. Plus, Mistral AI can see real-time ingress and egress traffic throughput from each node in a cluster to external Internet endpoints, such as external model weight data sources, to identify under-optimized workloads.‍
Automated health checks and 24/7 support: CoreWeave Mission Control continuously monitors the health of Mistral AI’s nodes, ensuring job interruptions and issues are promptly flagged and resolved as quickly as possible. Plus, our FleetOps, CloudOps, and Data Center Technician teams all work in tandem 24/7 to monitor for signs of deterioration across our fleets and our AI Cloud platform environments.‍
NVIDIA GB200 NVL72: Mistral AI was among the first to gain access to NVIDIA GB200 NVL72 racks. This opportunity unlocked an unprecedented level of computing power their teams could leverage to run training and inference jobs—accelerating both overall productivity and time-to-market. With the support of CoreWeave infrastructure, Mistral AI was able to train models 2.5x faster than previously experienced on NVIDIA H200s—and also vastly exceeded the performance, efficiency, and speed they experienced on NVIDIA H100s.

“[Our models] were trained 100% on CoreWeave infrastructure. I think not being on that kind of infrastructure would’ve delayed us by at least a few months.”
‍Timothée Lacroix, CTO
Mistral AI‍

Results

Mistral AI's meteoric rise isn't just based on their ability to move fast and rapidly scale. It's also about their ability to strategically choose the right partner—one who is ready and able to grow and scale with their constantly growing needs. Mistral AI’s partnership with CoreWeave allows their teams to focus more exclusively on model innovation, without burning time or resources on inevitable AI infrastructure challenges. That intense focus, combined with best-in-class compute, helped propel Mistral AI into the global spotlight with game-changing results:

Less maintenance, more focus: CoreWeave’s strong foundation and expert knowledge of AI infrastructure swiftly takes care of maintenance tasks and massively reduces interruptions, giving Mistral AI’s teams the ability to focus their efforts on AI innovation. It also allows them to engage in faster experimentation, reduce the time required for training cycles, and achieve faster time-to-market.
Cost-efficient scaling: CoreWeave infrastructure enables efficiency that unlocks faster speeds for Mistral AI, resulting in higher productivity and critical cost savings without sacrificing their ability to rapidly innovate at scale or introducing unnecessary strain or burnout to their teams.
A long-lasting, strategic partnership: CoreWeave and Mistral AI’s partnership goes above and beyond simply providing plug-in-and-play infrastructure. Mistral AI benefits from a hands-on relationship with CoreWeave expert engineers determined to provide 24/7 support. In return, CoreWeave benefits from continually developing and adapting its infrastructure capabilities to the specific needs of an AI partner like Mistral AI, driving further innovation and efficiency.

“What we don’t see with CoreWeave are all of the many interruptions or issues that we would see elsewhere.”
Timothée Lacroix, CTO
Mistral AI

Through its open-source models and flexible deployment options, Mistral AI delivers foundational AI tools to leading developers and enterprises everywhere, enabling a new level of access to AI innovation.

Whether releasing base models to the open-source community or delivering bespoke deployments for large enterprise clients, Mistral AI is creating an ecosystem where AI development is democratized for all.
Partner with CoreWeave experts to explore how our purpose-built AI Cloud can help your organization reach its most ambitious AI goals—today and into the future. Reach out now to get started.

‍

Published on

August 13, 2025

Mistral AI Unlocks 2.5x Faster Training Speeds With NVIDIA GB200 NVL72 on CoreWeave

No items found.

Copied

Mistral AI Unlocks 2.5x Faster Training Speeds With NVIDIA GB200 NVL72 on CoreWeave

Challenge

Solution‍

Results

Mistral AI Unlocks 2.5x Faster Training Speeds With NVIDIA GB200 NVL72 on CoreWeave

Related Case studies

IBM Boosts Performance by 80% Using CoreWeave’s Accelerated Infrastructure

CoreWeave Delivers Scalable, Secure, and High-Performance Cloud Solutions for Jane Street

How Trillion Labs Scales with CoreWeave H100 Clusters

Authentik and CoreWeave Collaborate to Streamline Onboarding for an AI/ML Developer Hackathon

Mistral AI and CoreWeave Demonstrate Partnership at NVIDIA GTC, Mistral AI Hackathon

UbiOps and CoreWeave Partner to Enable AI Productization at Scale

How CoreWeave and Loft Labs Leverage vCluster to Run Virtual Clusters in Kubernetes at Scale

Decart & Cerebrium Commit to Empowering Next Million Users With LLM Applications

GPU-Accelerated Medical Animation: A Product Insight and CoreWeave Collaboration

Odyssey Hosts McDonald’s First Metaverse Experience, Powered by CoreWeave

Products

Solutions

AI Infrastructure

Why CoreWeave

Resources

About