Smart-Egg-2568

joined 10 months ago

Ollama slow running on high GPU Server - is the GPU not being utilized? (alien.top)

submitted 10 months ago by Smart-Egg-2568@alien.top to c/localllama@poweruser.forum

2 comments fedilink

Ollama-13B is running slowly on a cloud server that has a 32 GB Nvidia Tesla V100S. Do I need to changing my configuration to properly utilize the GPU memory?