Comment

Comment on AI Is A Money Trap

<- View Parent

Feyd@programming.dev ⁨4⁩ ⁨months⁩ ago

We’re only a few years off from platform-agnostic local inference at mass-market prices.

What makes you confident in that? What will change?

source

Sort:hotnew top

Powderhorn@beehaw.org ⁨4⁩ ⁨months⁩ ago
There are already large local models. It’s a question of having the hardware, which has historically gotten more powerful with each generation. I don’t think it’s going to be phones for quite some time, but on desktop, absolutely.

source
- Feyd@programming.dev ⁨4⁩ ⁨months⁩ ago
  For business use, laptops without powerful graphics cards have been the norm for quite some time. Do you see businesses deciding to change to desktops to accommodate the power for local models? I think it’s pretty optimistic to think that laptops are going to be that powerful in the next 5 years. The advancement in chip capability has dramatically slowed, and to put them in laptops they’d need to be incredibly more power efficient as well.
  
  source
  - Powderhorn@beehaw.org ⁨4⁩ ⁨months⁩ ago
    For the security tradeoff of sensitive data not heading to the cloud for processing? Not all businesses, but many would definitely see value in it. We’re also discussing this as though the options are binary … models could also be hosted on company servers that employees VPN into.
    
    source
  - jarfil@beehaw.org ⁨4⁩ ⁨months⁩ ago
    Keywords: NPU, unified RAM
    
    Apple is doing it, AMD is doing it.
    
    GPUs are an inefficient way of doing inference. They’ve been great for research purposes, into what type of NPU may be the best one, but that’s been answered already for LLMs. Current step is, achieving mass production.
    
    5 years sounds realistic, unless WW3.
    
    source