What model are you going to run that can accept 100GB of context?
Ruin-Capable
joined 1 year ago
I wonder if you could get it running on two Mac Studio Ultras with 192GB of RAM each. With fewer nodes you'd reduce the communication overhead quite a bit.
Ooh... now I've got another model to play with. :D