Comment on Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192

<- View Parent
Rentlar@lemmy.ca ⁨21⁩ ⁨hours⁩ ago

I do expect operational savings from this optimization, but my guesstimate would be a 2-5x savings rather than the reported 9x savings when looked at over a fixed time period.

source
Sort:hotnewtop