What's the tok/s for each of those models on that system?
Edit: also, if you don't mind my asking, how much context are you able to use before inference degrades?
What's the tok/s for each of those models on that system?
Edit: also, if you don't mind my asking, how much context are you able to use before inference degrades?