overview for LienniTa

7B models keep repeating/glitching after certain number of tokens in c/localllama@poweruser.forum

[–] LienniTa@alien.top 1 points 11 months ago

goliath 120b would fit in 64 ram, tho. It doesnt have repeating problem...

Rocket 🦝 - smol model that overcomes models much larger in size in c/localllama@poweruser.forum

[–] LienniTa@alien.top 1 points 11 months ago

📚 Training Data: We've amalgamated multiple public datasets to ensure a comprehensive and diverse training base. This approach equips Rocket-3B with a wide-ranging understanding and response capability.

We've amalgamated multiple public benchmark answers to ensure a contaminated and diverse training base.

best roleplay model / settings that goes far away from the wholesome nonsense in c/localllama@poweruser.forum

[–] LienniTa@alien.top 1 points 11 months ago

goliath 120b and good character cards. will have to tune paramenters like min P, rep penalty and temperature tho.

What is considered the best uncensored LLM right now? in c/localllama@poweruser.forum

[–] LienniTa@alien.top 1 points 11 months ago

gguf goliath will give you best answers but will be very slow. you can unload like 40 layers to vram and your ram will still be a speed bottleneck, but i think 2 t/s are possible on 2 bit quant.

Training on the rephrased test set is all you need: 13B models can reach GPT-4 performance in benchmarks with no contamination detectable by traditional methods in c/localllama@poweruser.forum

[–] LienniTa@alien.top 1 points 11 months ago

yeah people praising 7b and 13 b models here and there, but....they just hallucinate! Then 120b goliath, no matter how terrible its initial idea was, is just really good in normal conversations. Im trying to love giga praised open hermes 2.5 and other mistral finetunes, but they are just better next-token-predictors, unlike larger models which are actually able to reason.