LocalLLaMA

3 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago

MODERATORS

communick@poweruser.forum

PSA about Mining Rigs (alien.top)

submitted 1 year ago by DrVonSinistro@alien.top to c/localllama@poweruser.forum

3 comments fedilink hide all child comments

I just wanted to leave out there that tonight I tested what happen when you try to run oobabooga with 8x 1060 GTX on a 13B model.

First of all it works like perfectly. No load on the cpu and 100% equal load on all gpu's.

But sadly, those usb cables for the risers dont have the bandwidth to make it a viable option.

I get 0.47 token/s

So for anyone that Google this shenanigan, here's the answer.

*EDIT

I'd add that CUDA computing is equally shared across the card but not the vram usage. A LOT of vram is wasted in the process of sending data to compute to the other cards.

you are viewing a single comment's thread
view the rest of the comments

[–] opi098514@alien.top 1 points 1 year ago

I didn’t know I needed this information till now