LLM Benchmark - Search News

Testing The Limits: Three Ways AI Benchmarks Are Evolving

When it comes to real-world evaluation, appropriate benchmarks need to be carefully selected to match the context of AI ...

Microsoft reportedly develops LLM series that can rival OpenAI, Anthropic models

Microsoft Corp. has developed a series of large language models that can rival algorithms from OpenAI and Anthropic PBC, ...

New technique helps LLMs rein in CoT lengths, optimizing reasoning without exploding compute costs

Carnegie Mellon University researchers propose a new LLM training technique that gives developers more control over chain-of-thought length.

Tom's Hardware on MSN1d

AMD RDNA 3 professional GPUs with 48GB can beat Nvidia 24GB cards in AI — putting the 'Large' in LLM

AMD published DeepSeek R1 benchmarks of its W7900 and W7800 Pro series 48GB GPUs, massively outperforming the 24GB RTX 4090.

BetaKit1d

Cohere says Command A model edges out LLM competition in speed and energy efficiency

Canada’s leading large-language model (LLM) developer Cohere has unveiled its new Command A model, which the company claims ...

MacStories3d

The M3 Ultra Mac Studio for Local LLMs

Speaking of the new Mac Studio and Apple making the best computers for AI: this is a terrific overview by Max Weinbach about the new M3 Ultra chip and its real-world performance with various on-device ...

Pliops Announces Collaboration with vLLM Production Stack to Enhance LLM Inference Performance

Together, Pliops and the vLLM Production Stack, an open-source reference implementation of a cluster-wide full-stack vLLM serving system, are delivering unparalleled performance and efficiency for LLM ...

InfoQ11d

Hugging Face Publishes Guide on Efficient LLM Training across GPUs

Training LLMs on GPU Clusters, an open-source guide that provides a detailed exploration of the methodologies and ...

Morningstar2d

Pliops Announces Collaboration with vLLM Production Stack to Enhance LLM Inference Performance

Aimed at revolutionizing large language model (LLM) inference performance, this partnership comes at a pivotal moment as the AI community gathers next week for the GTC 2025 conference. Together ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results