CoreWeave and Perplexity Partner to Scale AI Inference Workloads

CoreWeave and Meta Announce $21 Billion Expanded AI Infrastructure Agreement
🕧 6 min

CoreWeave, Inc., The Essential Cloud for AI, has announced a multi-year strategic partnership with Perplexity to support the company’s growing inference workloads on the CoreWeave Cloud. In addition, both organizations will collaborate to pilot new services aimed at improving AI performance, scalability, and operational efficiency.

As AI-powered applications increasingly operate in real-world environments, companies must ensure that their infrastructure delivers high performance and reliability. Perplexity develops AI-native products and services designed to run continuously, where inference speed and consistency directly influence user experience. Therefore, the company requires an infrastructure platform that can support high-performance computing while maintaining low latency and predictable operational costs.

Read More:ITTech Pulse Exclusive Interview with Michael Jacobs, Head of Social Innovation at IBM

To address these needs, the CoreWeaveCloud platform provides infrastructure specifically designed for AI workloads. The platform delivers consistent performance, enabling organizations to manage large-scale inference tasks while scaling resources quickly as demand grows. Moreover, the infrastructure allows companies to move seamlessly from development to long-term production without needing to redesign existing systems or tools.

Under the terms of the partnership, Perplexity will run its next-generation inference workloads on CoreWeave’s platform. By leveraging dedicated NVIDIA GB200 NVL72-powered clusters, CoreWeave will provide the computing capacity required to support Perplexity’s rapid growth. At the same time, the infrastructure will meet the advanced performance requirements of the company’s Sonar and Search API ecosystem.

In addition to supporting inference workloads, CoreWeave will deploy Perplexity Enterprise Max across its internal operations. This integration will enable employees to search both the web and internal knowledge sources, conduct multi-step research tasks, visualize and analyze data, and interact with advanced AI models through a single platform.

“We’re proud to partner with Perplexity as they scale their inference workloads on CoreWeave’s AI cloud,” said Max Hjelm, senior vice president of revenue at CoreWeave. “AI applications running in production require more than just access to raw infrastructure – they require best-in-class performance and reliability as well as a cloud platform designed end-to-end for AI that simplifies compute operations.”

Read More: How Domain-Specific Language Models Are Trained: Data, Fine-Tuning, and Governance

Furthermore, Perplexity has already begun running inference workloads using the CoreWeave Kubernetes Service as part of the initial deployment phase. The company is also utilizing W&B Models to train, fine-tune, and manage models throughout the entire lifecycle, from early experimentation to full-scale production.

“We were impressed by the combination of CoreWeave’s technical aptitude and partner-first mindset that help AI-native companies accelerate their growth and scaling goals,” said Dmitry Shevelenko, chief business officer at Perplexity. “CoreWeave is an essential partner in our efforts to optimize our infrastructure and the models we use to provide Perplexity users across industries with the strongest AI tools and agents on the market.”

Importantly, the partnership aligns with Perplexity’s broader multi-cloud strategy, which focuses on leveraging specialized infrastructure providers for advanced AI workloads. At the same time, the collaboration highlights CoreWeave’s role as a dedicated AI cloud provider supporting organizations that operate large-scale AI systems in demanding production environments.

CoreWeave continues to set performance benchmarks across the AI cloud industry. The company recently achieved industry-leading MLPerf benchmark results and remains the only AI cloud provider to earn the top Platinum ranking in both SemiAnalysis ClusterMAX 1.0 and 2.0 evaluations, which assess cloud performance, efficiency, and reliability for AI workloads.

Write to us [⁠wasim.a@demandmediaagency.com] to learn more about our exclusive editorial packages and programmes.

  • ITTech Pulse News Desk is a premier news hub delivering latest updates and in-depth analysis on Information Technology. Covering AI, cybersecurity, cloud computing, and emerging trends, it empowers IT professionals, business leaders, and tech enthusiasts to always remain on top of the industry.

Recommended Reads :