Skip to main content
Company Startup

Groq

Founded 2016 · Mountain View, USA · ~$750M Series E at $6.9B valuation (Sept 2025); ~$20B Nvidia licensing deal (Dec 2025)

8.8/10 Strong
Flagship

Free 30 req/min / paid usage-based

Editorial · no paid placements

Groq is the AI inference company behind the LPU chip and GroqCloud, founded 2016 by Jonathan Ross, valued at $6.9B with a ~$20B Nvidia licensing deal. Verified June 16, 2026.

Products by Groq

  1. Groq Fast LLM inference on Groq's LPU cloud. Free tier for prototyping, paid usage-based API pricing, Llama/Qwen/GPT OSS open-model routes, Batch API discounts, and prompt-caching savings.
    Free 30 req/min / paid usage-based 8.8/10

Groq is an AI inference company that designs the LPU (Language Processing Unit) chip and runs Groq (GroqCloud), a fast token-as-a-service platform for running open models. Founded in 2016 by Jonathan Ross, who helped create Google’s TPU, Groq raised $750 million at a $6.9 billion valuation in September 2025 and struck a roughly $20 billion licensing deal with Nvidia in December 2025, after which Ross and other leaders moved to Nvidia and Simon Edwards became CEO.

Key Facts

Founded2016
FounderJonathan Ross (ex-Google TPU)
CEOSimon Edwards
HQMountain View, USA
Funding$750M Series E at $6.9B (Sept 2025)
Nvidia deal~$20B licensing agreement (Dec 2025)
HardwareLPU inference chip
PlatformGroqCloud, fast inference for open models

What They Do

side of AI: running models fast and cheaply rather than training them. Its LPU is purpose-built for low-latency, and GroqCloud exposes that hardware as an API where developers run open models like Llama and DeepSeek at speeds conventional GPUs struggle to match. The pitch is speed and cost-per-token for inference at scale.

The December 2025 Nvidia licensing deal reshaped the company: it validated Groq’s LPU technology, brought in significant value, and moved founder Jonathan Ross and others to Nvidia, while Groq continues under new leadership repositioning as an inference neocloud. Groq is also developing next-generation chips with Samsung Foundry.

Current Flagship Products

  • Groq: GroqCloud, the company’s fast inference platform (token-as-a-service) for running open models on its LPU hardware, used by developers who need low latency.

Strategic Position

Groq’s moat is differentiated inference hardware and a real speed advantage, now partly validated by the Nvidia licensing deal. Its challenges are formidable: competing on inference against Nvidia GPUs and other accelerators, executing a leadership transition after key founders departed, and scaling a capital-intensive chip-plus-cloud business. The Nvidia relationship is both validation and a complication for independence.

For AIpedia readers, Groq matters when inference speed and cost on open models are the priority, weighed against GPU-based providers and other inference clouds.

Sources

  • Groq for AIpedia’s canonical product record.
  • DCD and Sacra for the $750M raise, $6.9B valuation, and funding history.
  • The Motley Fool and IntuitionLabs reporting for the December 2025 Nvidia licensing deal and leadership moves.
Share LinkedIn
Spotted an error or want to share your experience with Groq?

Every tool page is re-verified on a recurring cycle, and corrections land faster when readers flag them directly. If you spot a stale fact, a missing capability, or have used Groq and want to share what worked or didn't, the editorial desk reviews every message sent through this form.

Email editorial@aipedia.wiki