I wish there was a 13b model which can just fit in on my GPU with quant
Are there any recommendations on which LLM to use for writing WebAssembly Code?
its model average on the openllm leaderboard is 51.
I wish there was a 13b model which can just fit in on my GPU with quant