Google says that DiffusionGemma can generate more than 1,000 tokens per second when running on a single H100, a server-grade ...
On Thursday, Inception Labs released Mercury Coder, a new AI language model that uses diffusion techniques to generate text faster than conventional models. Unlike traditional models that create text ...