Featureless_Bug

joined 1 year ago

[D]: Understanding GPU Memory Allocation When Training Large Models in c/machinelearning@academy.garden

[–] Featureless_Bug@alien.top 1 points 11 months ago

I mean, Falcon 40 B with lora can easily be trained on 2x A100 with lora (even llama 70b can be trained on just 2x A100). But maybe accelerate is doing something stupid - in my experience, both deepspeed and accelerate are very slow and require way too much memory compared to manual gpu distribution strategy.

permalink
fedilink
source

M3 pro for Machine Learning/ Deep learning? [D] in c/machinelearning@academy.garden

[–] Featureless_Bug@alien.top 1 points 1 year ago (1 children)

Macbook Air is not money well spent. Workstation with 4090, and A100 in the cloud on the money you saved

permalink
fedilink
source
context