Technology

84807 readers

5769 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

431

DeepSeek ditches Nvidia for Huawei chips in V4 launch (cybernews.com)

submitted 3 weeks ago* (last edited 3 weeks ago) by inari@piefed.zip to c/technology@lemmy.world

89 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] avidamoeba@lemmy.ca 17 points 3 weeks ago (3 children)

Been using Qwen 3.x for a while now for local LLM with search capability. The 3.5 and 3.6 ones are great and run very fast.

[–] sp3ctr4l@lemmy.dbzer0.com 8 points 3 weeks ago* (last edited 3 weeks ago) (2 children)

I got Qwen 3.5 running on a Steam Deck.

It ain't exactly blazing fast, but it does actually work.

(Reasonably fast if you go down to the 2B param model, I can get the 9B param variant working, though this makes Steam Decky very hot and bothered.)

Yeah, you absolutely do not need Nvidia hardware to run an LLM, but we get blasted with their propoganda suggesting otherwise just all the time in the English speaking West.

Because if you don't need Nvidia, well, then, this whole AI bubble looks a lot more bubbly.

[–] avidamoeba@lemmy.ca 5 points 3 weeks ago (1 children)

Take good care of your hw! It's not like 2 years ago when you could buy stuff off the shelf for reasonable prices. :D

[–] sp3ctr4l@lemmy.dbzer0.com 2 points 3 weeks ago (1 children)

My Steam Deck is my child.

Maybe if I can get it to run a 'good enough' LLM, and also a robotics kinematics suite...

I can just start building DOG, with a Steam Deck for a face, instead of a Combine scanner bot.

[–] los0220@lemmy.world 2 points 3 weeks ago (1 children)

Gemma 4 seems nice for local usage, way faster than Qwen models.

I was able to run 27B Gemma on my PC, where 14B Qwen was to slow due to CPU offload

[–] percent@infosec.pub 1 points 3 weeks ago* (last edited 3 weeks ago)

+1, exactly the same experience. Except Gemma4:26B really sucks with OpenCode. Works great with Pi though

[–] Diurnambule@jlai.lu 1 points 3 weeks ago* (last edited 3 weeks ago) (1 children)

Amd have the best consumer grafic card to run llm on the market.

[–] sp3ctr4l@lemmy.dbzer0.com 1 points 3 weeks ago* (last edited 3 weeks ago) (1 children)

Sorry, I'm not entirely sure what you mean.

Did you mean to say:

"And need to have the best consumer GPU on the market, to run an LLM."

... likely alluding to an RTX 5090?

So you would be saying that basically it is bullshit, the idea that everyone needs extremely expensive hardware, to run an LLM?

[–] Diurnambule@jlai.lu 2 points 3 weeks ago (1 children)

Hello, no sorry auto correction and going fast do it to my posts. I wanted to say that NVIDIA is already the worst option for consumer graphic card since AMD made a card with 20go ram which is able to run most open weight models.

[–] sp3ctr4l@lemmy.dbzer0.com 1 points 3 weeks ago

Aha! Ok, that makes sense as well.

[–] Nikelui@lemmy.world 4 points 3 weeks ago

Qwen 3.6 is already out? Damn, I swear I switched to 3.5 not even a month ago.

[–] humanspiral@lemmy.ca 2 points 3 weeks ago (1 children)

3.6 27b is probably most powerful/efficient (to size) model out there. Qwen has a history of leveraging deepseek power as well. (deepseek creating small models with Qwen as the base), and Alibaba is main hosting service for deepseek. Alibaba/Qwen in talks to invest in Deepseek, atm.

[–] avidamoeba@lemmy.ca 1 points 3 weeks ago

Yeah. The 80b Coder-Next runs at about the same speed on my hw too. I don't know if it's any better than 3.6 27b.