AI– 0

Google DiffusionGemma Generates Text Blocks In Parallel

VentureBeat·June 11, 2026 at 03:16 PM

Google has introduced DiffusionGemma, an experimental open-source language model that utilizes a diffusion process for text generation, enabling parallel processing of token blocks rather than sequential, one-by-one generation. This novel approach allows the model to process 256 tokens simultaneously, leading to significant speed improvements, potentially up to four times faster than traditional models on GPUs, especially in local or low-concurrency inference scenarios. While DiffusionGemma offers enhanced speed and a unique self-correction capability, Google acknowledges that its overall output quality is currently lower than standard autoregressive models like Gemma 4, making it more suitable for specific constrained generation tasks rather than open-ended creative writing.

Google DiffusionGemma Generates Text Blocks In Parallel

Microsoft SkillOpt upgrades AI agent skills automatically

Coinbase launches AI agent for trading

Deezer launches AI music detector for playlists

Jeff Bezos' AI startup valued at $41 billion