overview for shibe5

What UI do you use and why? in c/localllama@poweruser.forum

[–] shibe5@alien.top 1 points 10 months ago

Own web UI for experimenting.

Training on the rephrased test set is all you need: 13B models can reach GPT-4 performance in benchmarks with no contamination detectable by traditional methods in c/localllama@poweruser.forum

[–] shibe5@alien.top 1 points 10 months ago

With the abundance of models, most developers and users have to select a small subset of available models for own evaluation, and that has to be based on some already available data about models' performance. At that stage, selecting models with, for example, highest MMLU score is one way to go about it.

I don't understand Mistral and context size, honestly. in c/localllama@poweruser.forum

[–] shibe5@alien.top 1 points 10 months ago

I noticed this problem in llama.cpp too. I suspect that it may be because something is not implemented, that is required for Mistral models, e.g. sliding window attention. To confirm that, one can compare outputs from PyTorch with other software. I tried to do it, but PyTorch model runs out of system RAM with ~15k token prompt.

Biden Executive Order regulates VERY large models in c/localllama@poweruser.forum

[–] shibe5@alien.top 1 points 10 months ago

1026

1023

1020

That would include models trained on a calculator.