eeeehhh

joined 1 year ago

[D] How large an LLM can I train from scratch on a single A100 GPU with 80Gb memory? (alien.top)

submitted 1 year ago by eeeehhh@alien.top to c/machinelearning@academy.garden

4 comments fedilink

I have access to a single 80Gb A100 GPU and would like to train an LLM with GPT-like architecture from scratch. Does anyone know how to calculate the maximum model size.