Technology

86068 readers

3650 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

166

Nvidia reveals new A.I. chip, says costs of running LLMs will ‘drop significantly’ (www.cnbc.com)

submitted 2 years ago by L4s@lemmy.world to c/technology@lemmy.world

18 comments fedilink hide all child comments

Nvidia reveals new A.I. chip, says costs of running LLMs will ‘drop significantly’::Currently, Nvidia dominates the market for AI chips, with over 80% market share, according to some estimates.

all 26 comments

sorted by: hot top controversial new old

[–] Zerfallen@lemmy.world 33 points 2 years ago (1 children)

I'm sure the cost to the consumer will remain exactly the same, or somehow increase.

[–] GenderNeutralBro@lemmy.sdf.org 7 points 2 years ago (1 children)

I'm not worried about that. There will be open competition, because most of this stuff is open-source. Cheaper hardware will open the door for anyone like you or me to set up our own services. Anyone can set up a server with their own hardware (or rent it from Amazon or wherever) and run their own chatbot (with blackjack! and hookers!) instead of using ChatGPT.

This is already possible on consumer hardware, just not with the biggest and best networks. Right now, if I wanted to run, say, BLOOM (an open-source LLM), I'd need to spend close to $100K on hardware. Obviously, that's out of reach for a hobbyist, so I'm limited to using smaller, less advanced networks like LLaMa or GPT-J. Cheaper hardware will help break the hold that the big players currently have over the industry.

[–] abhibeckert@lemmy.world 1 points 2 years ago* (last edited 2 years ago) (1 children)

if I wanted to run, say, BLOOM (an open-source LLM), I’d need to spend close to $100K on hardware

Doesn't that dozens of notes with over a terabyte of RAM each? And state of the art networking?

Sounds closer to $100M than $100K.

[–] GenderNeutralBro@lemmy.sdf.org 3 points 2 years ago

If you want to train your own network like they did, you'd want something like that, yeah, but to run the trained network you "only" need ~360GB of memory.

For context, even if you wanted to run this in CPU, there are currently no A5 mobos (Ryzen 7000 series) that support more than 192GB of memory. You literally can't even run it on high-end consumer hardware.

[+] dinckelman@lemmy.world 22 points 2 years ago (4 children)

[deleted]

[–] leonardo_arachoo@lemm.ee 16 points 2 years ago (1 children)

AI might not survive the next decade? I already use it every day at work. The productivity gains are enormous and far from saturated. I think it's more likely that AI will survive and consumers (humans) will not survive.

[–] Toribor@corndog.social 20 points 2 years ago (1 children)

I think people simultaneously overestimate the capability of current machine learning models while underestimating their long term impact. These models are going to be in everything. They are very resource hungry and will absolutely be a driver of hardware innovation for the next decade and probably longer.

[–] CrabAndBroom@lemmy.ml 16 points 2 years ago (1 children)

I'm liking AMD still. They're not perfect of course but they seem to have far less fuckery going on than Intel and Nvidia, and they have open source drivers that play nice with Linux.

[–] dinckelman@lemmy.world 9 points 2 years ago

I always have this thought in the back of my mind too, but the issue is that while raw performance is a bit better than the counterparts, Nvidia still offers more features for the money, and I don't always have money to throw away. Typically i'd upgrade my gpu once every 5 years or so

[–] eager_eagle@lemmy.world 10 points 2 years ago

fuck nvidia - but this comment won't age well

[–] phillaholic@lemm.ee 6 points 2 years ago (1 children)

How are they killing their consumer market? If they change their mind and put out a better gpu people will buy it.

[–] dinckelman@lemmy.world 2 points 2 years ago (1 children)

You've answered your own question. They used to release upgraded hardware with a reasonable generational boost almost yearly. Now the gap has widened, and they're iterating on old hardware, by giving it more juice and a larger cooler. Not to mention the astronomical prices that have outclassed previous top-end cards at the current mid-range

[–] phillaholic@lemm.ee 2 points 2 years ago

Not really. It’s not the right way to state it. They aren’t concerned with making money from the consumer market right now. Killing it implies it’s never coming back.