Cache Language Model - Search News

Dnotitia's STAR-KV cuts KV cache by up to 20x, earns ICML 2026 Spotlight selection

KV, a low-rank KV cache compression method achieving up to 20x reduction, with the paper selected as a Spotlight at ICML 2026 ...

The Manila Times

Dnotitia Unveils STAR-KV, Achieving UP to 20x KV Cache Compression, Selected as an ICML 2026 Spotlight Paper

Introduces a low-rank-based approach to KV cache compression, one of the key bottlenecks in long-context AISpeeds up attention computation by up to 6.9x and overall generation throughput by up to 3.1x ...

The LancetOpinion

Deception in clinical large language models: an under-recognised safety risk

Large language models (LLMs) are rapidly being integrated into clinical workflows, supporting tasks such as diagnosis ...

Decrypt

LongCat-2.0: The Stealth AI Model That Was Quietly Topping OpenRouter All Along

Chinese tech company Meituan officially unveiled LongCat-2.0 on June 30, confirming the open-license, 1.6-trillion-parameter mixture-of-experts AI model is the same system that sp ...

20h

The new enterprise AI expert every company needs - and why

These experts understand how to optimize frontier models. Advanced data and neural networking skills are crucial. If you're ...

Couchbase’s AI Data Plane aims to turn fragmented data into real enterprise agent memory

Industry discussions about what’s holding back AI often focus on security, graphics processing unit availability and other ...

Meituan open sources LongCat-2.0, the 1.6T, near-frontier agentic coding model that's been leading OpenRouter — trained entirely on Chinese chips

By registering the LongCat-2.0 repository under the open-source MIT License, Meituan positions the architecture with maximum ...

Decrypt

Show inaccessible results

Dnotitia's STAR-KV cuts KV cache by up to 20x, earns ICML 2026 Spotlight selection

Dnotitia Unveils STAR-KV, Achieving UP to 20x KV Cache Compression, Selected as an ICML 2026 Spotlight Paper

Deception in clinical large language models: an under-recognised safety risk

LongCat-2.0: The Stealth AI Model That Was Quietly Topping OpenRouter All Along

The new enterprise AI expert every company needs - and why

Couchbase’s AI Data Plane aims to turn fragmented data into real enterprise agent memory

Meituan open sources LongCat-2.0, the 1.6T, near-frontier agentic coding model that's been leading OpenRouter — trained entirely on Chinese chips

Ornith Is the Open-Source Coding Model Built for Agents, Not Humans

Coinbase Cuts AI Spend by Half With Open-Weight Models, Smarter Routing

Buckle Up: The Bad Guys Now Have A Model As Powerful As Mythos

A Practitioner's Guide to Using Large Language Models and Generative AI in Economic History

OpenAI wants to drop its next AI model, but the US GOV has stepped in to put a hand on the brake