ggerganov

joined 1 year ago

[–] ggerganov@alien.top 1 points 11 months ago (1 children)

I just wrote a post today about serving 7B models with `llama.cpp` from cheap AWS instances - might be useful: