Comment

Comment on Don’t believe the hype: AGI is far from inevitable

<- View Parent

vrighter@discuss.tchncs.de ⁨9⁩ ⁨months⁩ ago

incremental improvements on a dead end, still gets you to the dead end.

source

Sort:hotnew top

ContrarianTrail@lemm.ee ⁨9⁩ ⁨months⁩ ago
Then you need to give me an explanation for why it’s a dead end

source
- vrighter@discuss.tchncs.de ⁨9⁩ ⁨months⁩ ago
  because, having coded them myself, I am under no illusions as to their capabilities. They are not magic. “just” some matrix multiplications that generate a probability distribution for the next token, which is then randomly sampled.
  
  source
  - ContrarianTrail@lemm.ee ⁨9⁩ ⁨months⁩ ago
    You seem to be talking about LLMs now and I’m not. LLMs being a dead end is perfectly compatible with what I just said. We’ll just try a different approach next then. Even the fact of realizing they’re a dead end is yet another step towards AGI.
    
    source
    vrighter@discuss.tchncs.de ⁨9⁩ ⁨months⁩ ago
    yeah, so that means that it’s not incremental improvement on what we have that we need. That will get us nowhere. We need a (as yet unknown) completely different approach. Which is the opposite of incremental improvement.
    
    source
    -> View More Comments
- Eccitaze@yiffit.net ⁨9⁩ ⁨months⁩ ago
  Did you read the article, or the actual research paper? They present a mathematical proof that any hypothetical method of training an AI that produces an algorithm that performs better than random chance could also be used to solve a known intractible problem, which is impossible with all known current methods. This means that any algorithm we can produce that works by training an AI would run in exponential time or worse.
  
  The paper authors point out that this also has severe implications for current AI, too–since the current AI-by-learning method that underpins all LLMs is fundamentally NP-hard and can’t run in polynomial time, “the sample-and-time requirements grow non-polynomially (e.g. exponentially or worse) in n.” They present a thought experiment of an AI that handles a 15-minute conversation, assuming 60 words are spoken per minute (keep in mind the average is roughly 160). The resources this AI would require to process this would be 60*15 = 900. The authors then conclude:
  
  “Now the AI needs to learn to respond appropriately to conversations of this size (and not just to short prompts). Since resource requirements for AI-by-Learning grow exponentially or worse, let us take a simple exponential function O(2n ) as our proxy of the order of magnitude of resources needed as a function of n. 2^900 ∼ 10^270 is already unimaginably larger than the number of atoms in the universe (∼10^81 ). Imagine us sampling this super-astronomical space of possible situations using so-called ‘Big Data’. Even if we grant that billions of trillions (10 21 ) of relevant data samples could be generated (or scraped) and stored, then this is still but a miniscule proportion of the order of magnitude of samples needed to solve the learning problem for even moderate size n.”
  
  That’s why LLMs are a dead end.
  
  source
  - ContrarianTrail@lemm.ee ⁨9⁩ ⁨months⁩ ago
    But I wasn’t talking about LLMs
    
    source
    Eccitaze@yiffit.net ⁨9⁩ ⁨months⁩ ago
    You literally were LMAO
    
    Other than that, we will keep incrementally improving our technology and it’s only a matter of time untill we get there. May take 5 years, 50 or 500 but it seems pretty inevitable to me.
    
    Literally a direct quote. In what world is this not talking about LLMs?
    
    source
    -> View More Comments