this post was submitted on 25 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] gebregl@alien.top 1 points 11 months ago (24 children)

We need a name for the fallacy where people call highly nonlinear algorithms with billions of parameters "just statistics", as if all they're doing is linear regression.

ChatGPT isn't AGI yet, but it is a huge leap in modeling natural language. The fact that there's some statistics involved explains neither of those two points.

[–] venustrapsflies@alien.top 1 points 11 months ago (11 children)

It’s not a fallacy at all. It is just statistics, combined with some very useful inductive biases. The fallacy is trying to smuggle some extra magic into the description of what it is.

Actual AGI would be able to explain something that no human has understood before. We aren’t really close to that at all. Falling back on “___ may not be AGI yet, but…” is a lot like saying “rocket ships may not be FTL yet, but…”

[–] InterstitialLove@alien.top 1 points 11 months ago (8 children)

The fallacy is the part where you imply that humans have magic.

"An LLM is just doing statistics, therefore an LLM can't match human intellect unless you add pixie dust somewhere." Clearly the implication is that human intellect involves pixie dust somehow?

Or maybe, idk, humans are just the result of random evolutionary processes jamming together neurons into a configuration that happens to behave in a way that lets us build steam engines, and there's no fundamental reason that jamming together perceptrons can't accomplish the same thing?

[–] red75prime@alien.top 1 points 11 months ago (1 children)

LLMs might still lack something that the human brain has. Internal monologue, for example, that allows us to allocate more than fixed amount of compute per output token.

[–] InterstitialLove@alien.top 1 points 11 months ago (1 children)

You can just give an LLM an internal monologue. It's called a scratchpad.

I'm not sure how this applies to the broader discussion, like honestly I can't tell if we're off-topic. But once you have LLMs you can implement basically everything humans can do. The only limitations I'm aware of that aren't trivial from an engineering perspective are

  1. current LLMs mostly aren't as smart as humans, like literally they have fewer neurons and can't model systems as complexly
  2. humans have more complex memory, with a mix of short-term and long-term and a fluid process of moving between them
  3. humans can learn on-the-go, this is equivalent to "online training" and is probably related to long-term memory
  4. humans are multimodal, it's unclear to what extent this is a "limitation" vs just a pedantic nit-pick, I'll let you decide how to account for it
[–] red75prime@alien.top 1 points 11 months ago

It's called a scratchpad.

And the network still uses skills that it learned in a fixed-computation-per-token regime.

Sure, future versions will lift many existing limitations, but I was talking about current LLMs.

load more comments (6 replies)
load more comments (8 replies)
load more comments (20 replies)