N
Hacker Next
new
past
show
ask
show
jobs
submit
login
▲
Google's TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x
(
arstechnica.com
)
16 points by
gmays
11 hours ago
|
3 comments
add comment
Rendered at 01:56:53 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
redanddead 9 hours ago
[-]
You'd think it'd be bigger news on hn
axiologist 9 hours ago
[-]
See
https://news.ycombinator.com/item?id=47513475
from two days ago.