this post was submitted on 25 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

Title sums it up.

top 6 comments
sorted by: hot top controversial new old
[–] uti24@alien.top 1 points 11 months ago

If model fits completely inside 12Gb than it would work faster on a desktop, if model not fits into 12Gb but fits fully in 16Gb then you have a good chances it would run faster on a laptop with 16Gb GPU.

[–] __SlimeQ__@alien.top 1 points 11 months ago (3 children)

i can't speak for the desktop 3080ti, but i have that laptop card and it's roughly equivalent in performance to my 4060ti desktop card.

[–] hysterian@alien.top 1 points 11 months ago (1 children)

That’s odd considering the 4060 Ti desktop is 8GB VRAM. But are you saying just speed or are you able to run larger parameter LLMs on your laptop that your desktop wouldn’t be able to?

[–] __SlimeQ__@alien.top 1 points 11 months ago

I have the 16gb version of 4060ti, so the cards have nearly identical capabilities.

[–] No_Afternoon_4260@alien.top 1 points 11 months ago

You mind shooting a few test to have real word numbers? Like what kind of speeds are you getting for a 7b q6 and 13b q6, they should fully fit in VRAM

[–] No_Afternoon_4260@alien.top 1 points 11 months ago

You mind shooting a few test to have real word numbers for the laptop version? Like what kind of speeds are you getting for a 7b q6 and 13b q6, they should fully fit in VRAM