Another possibility is that humans just aren’t smart enough to figure out AGI. While I’m sure that we will continue incrementally improving technology in some form, it’s not at all self-evident that these improvements will eventually add up to AGI.
Comment on Don’t believe the hype: AGI is far from inevitable
ContrarianTrail@lemm.ee 2 months ago
AGI is inevitable unless:
-
General intelligence is substrait independent and what the brain does cannot be replicated in silica. However since both are made of matter and matter obeys the laws of physics, I see no reason to assume this.
-
We destroy ourselves before we reach AGI.
Other than that, we will keep incrementally improving our technology and it’s only a matter of time untill we get there. May take 5 years, 50 or 500 but it seems pretty inevitable to me.
zygo_histo_morpheus@programming.dev 2 months ago
ContrarianTrail@lemm.ee 2 months ago
I get what you’re saying but to me, that still just sounds like a timescale issue. I can’t think of a scenario where we’ve improved something so much that there’s just absolutely nothing we could improve on further. With AI we only need to reach the point of making it have human-level cognitive capabilities and from there on it can improve itself.
zygo_histo_morpheus@programming.dev 2 months ago
There are a couple of reasons that might not work:
- Maybe we’ll asymptotically approach a point that is lower than human-level cognitive capabilities
- Gradual improvements are susceptible to getting stuck in a local maxima. This is a problem in evolution as well. A lot of animals could in theory evolve, say, human level intelligence in principle, but to reach that point they’d have to go through a bunch of intermediate steps that lead to worse fitness. Gradual scientific improvements are a bit like evolution in this way.
- We also lose knowledge over time. Something as dramatic as a nuclear war would significantly set back the progress in developing AGI, but something less dramatic might also lead to us forgetting things that we’ve already learned.
To be clear, most of the arguments I’m making aren’t really about AGI specifically but about humanities capability to develop arbitrary technically feasible technologies in general.
BarryZuckerkorn@beehaw.org 2 months ago
I can’t think of a scenario where we’ve improved something so much that there’s just absolutely nothing we could improve on further.
Progress itself isn’t inevitable. Just because it’s possible doesn’t mean that we’ll get there, because the history of human development shows that societies can and do stall, reverse, etc.
And even if all human societies tends towards progress, it could still hit dead ends and stop there. Conceptually, it’s like climbing a mountain through the algorithm of “if there is a higher elevation near you, go towards that, and avoid stepping downward in elevation.” Eventually that algorithm brings you to a local peak. But the local peak might not be the highest point on the mountain, and while it is theoretically possible to have gotten to the other true peak from the beginning, the person who is insistent on never stepping downward is now stuck. Or, it’s possible to get to the true peak but it requires climbing downward for a time and climbing up past elevations we’ve already been to, on paths we hadn’t been on. One can imagine a society that refuses to step downward, breaking the inevitability of progress.
This paper identifies a specific dead end and advocates against hoping for general AI through computational training. It is, in effect, arguing that even though we can still see plenty of places that are higher elevation than where we are standing, we’re headed towards a dead end, and should climb back down. I suspect that not a lot of the actual climbers will heed that advice.
vrighter@discuss.tchncs.de 2 months ago
incremental improvements on a dead end, still gets you to the dead end.
ContrarianTrail@lemm.ee 2 months ago
Then you need to give me an explanation for why it’s a dead end
vrighter@discuss.tchncs.de 2 months ago
because, having coded them myself, I am under no illusions as to their capabilities. They are not magic. “just” some matrix multiplications that generate a probability distribution for the next token, which is then randomly sampled.
ContrarianTrail@lemm.ee 2 months ago
You seem to be talking about LLMs now and I’m not. LLMs being a dead end is perfectly compatible with what I just said. We’ll just try a different approach next then. Even the fact of realizing they’re a dead end is yet another step towards AGI.
Eccitaze@yiffit.net 2 months ago
Did you read the article, or the actual research paper? They present a mathematical proof that any hypothetical method of training an AI that produces an algorithm that performs better than random chance could also be used to solve a known intractible problem, which is impossible with all known current methods. This means that any algorithm we can produce that works by training an AI would run in exponential time or worse.
The paper authors point out that this also has severe implications for current AI, too–since the current AI-by-learning method that underpins all LLMs is fundamentally NP-hard and can’t run in polynomial time, “the sample-and-time requirements grow non-polynomially (e.g. exponentially or worse) in n.” They present a thought experiment of an AI that handles a 15-minute conversation, assuming 60 words are spoken per minute (keep in mind the average is roughly 160). The resources this AI would require to process this would be 60*15 = 900. The authors then conclude:
“Now the AI needs to learn to respond appropriately to conversations of this size (and not just to short prompts). Since resource requirements for AI-by-Learning grow exponentially or worse, let us take a simple exponential function O(2n ) as our proxy of the order of magnitude of resources needed as a function of n. 2^900 ∼ 10^270 is already unimaginably larger than the number of atoms in the universe (∼10^81 ). Imagine us sampling this super-astronomical space of possible situations using so-called ‘Big Data’. Even if we grant that billions of trillions (10 21 ) of relevant data samples could be generated (or scraped) and stored, then this is still but a miniscule proportion of the order of magnitude of samples needed to solve the learning problem for even moderate size n.”
That’s why LLMs are a dead end.
ContrarianTrail@lemm.ee 2 months ago
But I wasn’t talking about LLMs