Futurology

0 readers

1 users here now

founded 1 year ago

MODERATORS

voidx@futurology.today

Lugh@futurology.today

wahming@monyet.cc

Espiritdescali@futurology.today

NVIDIA's Eos supercomputer can train a 175 billion parameter GPT-3 model in under four minutes (www.engadget.com)

submitted 1 year ago by Lugh@futurology.today to c/futurology@futurology.today

2 comments fedilink hide all child comments

top 2 comments

sorted by: hot top controversial new old

[–] blackfire@lemmy.world 4 points 1 year ago (1 children)

So it was a perf test of a 1b token size model not the full 3.7T that get3 is trained with. I mean great. They are showing improvement but this is just a headline grabber they haven't done anything actually useful here.

permalink
fedilink
source
hideshow 2 child comments

[–] Oisteink@feddit.nl 1 points 1 year ago

Just checking in to say they are still there - so many rascals showing off rigs these days

permalink
fedilink
source