What sort of vram is needed to run a 4bit 70B model?
I've tried training LoRas on my own writing with mixed success.
What sort of vram is needed to run a 4bit 70B model?