Technology

59402 readers

2521 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

122

Opera is testing letting you download LLMs for local use, a first for a major browser (www.zdnet.com)

submitted 7 months ago by MichaelTen@lemmy.world to c/technology@lemmy.world

39 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] tal@lemmy.today 2 points 7 months ago (8 children)

I have no idea how this is set up to work technically, but most of the heavy lifting is gonna be on the GPU. I'm not sure that it matters much whether the browser is what's pushing data to the GPU or some other package.

[–] Bandicoot_Academic@lemmy.one 4 points 7 months ago (3 children)

Most people probably don't have a dedicated GPU and an iGPU is probably not powerfull enough to run an LLM at decent speed. Also a decent model requires like 20GB of RAM which most people don't have.

[–] douglasg14b@lemmy.world 7 points 7 months ago (2 children)

It doesn't just require 20GB of RAM, it requires that in VRAM. Which is a much higher barrier to entry.

[–] Hamartiogonic@sopuli.xyz 1 points 7 months ago (1 children)

But what if you have an AMD APU. Doesn’t that use your normal RAM as VRAM?

[–] T156@lemmy.world 2 points 7 months ago* (last edited 7 months ago)

Not exactly. Most integrated chips have a small pool of dedicated VRAM, and then a bit more that they share with the system memory, though it's generally only a portion, not all of it. It's only Apple's unified memory, and maybe other mobile chips that has them both share memory pool entirely, for better or worse, as far as I'm aware.

But it is worth noting that if you don't have enough VRAM and have to put it into RAM, the minimum expectation is that you have twice the amount of RAM space. So if you have a GPU with 4GB of VRAM, and need to offload the extra to the system, you don't need 16 GB, you need 32 GB.

load more comments (4 replies)