this post was submitted on 26 Jul 2024

230 points (96.7% liked)

science

21879 readers

295 users here now

A community to post scientific articles, news, and civil discussion.

rule #1: be kind

founded 2 years ago

MODERATORS

m3t00@lemmy.world

laverabe@lemmy.world

DeadPand@midwest.social

Joleee@lemmy.world

laverabe@lemmy.zip

230

AI models fed AI-generated data quickly spew nonsense (www.nature.com)

submitted 1 year ago by ArcticDagger@feddit.dk to c/science@lemmy.world

51 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] 0laura@lemmy.world 2 points 1 year ago* (last edited 1 year ago)

no, not really. the improvement gets less noticeable as it approaches the limit, but I'd say the speed at which it improves is still the same. especially smaller models and context window size. there's now models comparable to chatgpt or maybe even gpt 4.0 (I don't remember, one or the other) with context window size of 128k tokens, that you can run on a GPU with 16gb of vram. 128k tokens is around 90k words I think. that's more than 4 bee movie scripts. it can "comprehend" all of that at once.