this post was submitted on 25 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] InterstitialLove@alien.top 1 points 11 months ago (8 children)

The fallacy is the part where you imply that humans have magic.

"An LLM is just doing statistics, therefore an LLM can't match human intellect unless you add pixie dust somewhere." Clearly the implication is that human intellect involves pixie dust somehow?

Or maybe, idk, humans are just the result of random evolutionary processes jamming together neurons into a configuration that happens to behave in a way that lets us build steam engines, and there's no fundamental reason that jamming together perceptrons can't accomplish the same thing?

[–] red75prime@alien.top 1 points 11 months ago (2 children)

LLMs might still lack something that the human brain has. Internal monologue, for example, that allows us to allocate more than fixed amount of compute per output token.

[–] InterstitialLove@alien.top 1 points 11 months ago (1 children)

You can just give an LLM an internal monologue. It's called a scratchpad.

I'm not sure how this applies to the broader discussion, like honestly I can't tell if we're off-topic. But once you have LLMs you can implement basically everything humans can do. The only limitations I'm aware of that aren't trivial from an engineering perspective are

  1. current LLMs mostly aren't as smart as humans, like literally they have fewer neurons and can't model systems as complexly
  2. humans have more complex memory, with a mix of short-term and long-term and a fluid process of moving between them
  3. humans can learn on-the-go, this is equivalent to "online training" and is probably related to long-term memory
  4. humans are multimodal, it's unclear to what extent this is a "limitation" vs just a pedantic nit-pick, I'll let you decide how to account for it
[–] red75prime@alien.top 1 points 11 months ago

It's called a scratchpad.

And the network still uses skills that it learned in a fixed-computation-per-token regime.

Sure, future versions will lift many existing limitations, but I was talking about current LLMs.

load more comments (5 replies)