IBM and Groq have entered into a strategic technology and go-to-market partnership designed to speed up the deployment of agentic AI across enterprise environments. The collaboration brings Groq’s high-performance inference platform, GroqCloud, directly into IBM’s watsonx Orchestrate, enabling faster automation for complex workflows in sectors including healthcare, government, finance, retail, and manufacturing.
The partnership will also enhance RedHat open-source vLLM technology with Groq’s custom LPU architecture, offering enterprises improved speed, cost efficiency, and performance when moving AI agents from pilots to full production. IBM Granite models are planned to be supported on GroqCloud as part of the initiative.
GroqCloud, built on a specialized Language Processing Unit, delivers more than 5X faster inference than traditional GPU systems, with consistently low latency even at global scale. This allows enterprises to deploy AI agents that can analyze and respond to high-volume, real-time queries — such as thousands of parallel patient inquiries in healthcare — with improved accuracy and reliability.
Rob Thomas, Senior Vice President, Software and Chief Commercial Officer at IBM, said the partnership reinforces IBM’s commitment to helping organizations successfully deploy AI with measurable business outcomes. “When enterprises go from experimentation to production, they must ensure complex workflows can be deployed successfully to guarantee high-quality experiences,” he said.
Jonathan Ross, CEO and Founder of Groq, said the companies are enabling AI to act instantly and learn continuously. “With Groq’s speed and IBM’s enterprise expertise, we’re making agentic AI real for business,” he noted. “This partnership is about transforming how enterprises work with AI and accelerating adoption with confidence.”
The collaboration focuses on delivering high-speed inference, enhanced security and privacy controls for regulated industries, and seamless integration with watsonx Orchestrate for flexible, purpose-built automation patterns. Developers will gain new capabilities in inference orchestration, hardware acceleration, and load balancing — all within familiar tooling environments.
IBM will begin offering access to GroqCloud immediately as the companies move jointly to empower enterprises with faster, more intelligent, and operationally viable AI deployments globally.
The Marcus Evans 2nd Edition Model Risk Management, Canada conference taking place in Toronto, Canada…
Economists say Shanghai is strengthening its role as China’s reform engine, accelerating innovation and global…
U.S. shoppers are set to spend nearly $80 billion this Black Friday and Cyber Monday,…
Waiken has unveiled a US$450 million investment plan through 2031 to strengthen its entertainment and…
A new Transamerica report reveals how American middle class is navigating retirement planning amid financial…
Switzerland leads the Global Investment and Resilience Index, outperforming major economies in its ability to…