overview for vatsadev

Intel neural-chat-7b-v3-1 in c/localllama@poweruser.forum

[–] vatsadev@alien.top 1 points 2 years ago (3 children)

IMPORTANT!

this isnt trained, its another mistral finetune, with dpo, but with slimorca, not ultrachat.

I would be using openHermes, its much more trialed, and its proven solid

Hugging Face Removes Singing AI Models of Xi Jinping But Not of Biden in c/localllama@poweruser.forum

[–] vatsadev@alien.top 1 points 2 years ago

Sad, and we thought HF was the harbor of unaligned models, but maybe im missing the whole story. Hopefully they dont kill models for saying taiwan good or something

Best model for conversations? in c/localllama@poweruser.forum

[–] vatsadev@alien.top 1 points 2 years ago

Open source -> Mistral instruct worked great for me, Zephyr alpha was crazy aligned, while beta was better

Closed Source -> Inflections Pi is smooth! Pray for API access

[D] Any alternatives to GPT4V Vision API endpoint? in c/machinelearning@academy.garden

[–] vatsadev@alien.top 1 points 2 years ago

There's fuyu-8b, but no commercial license.

It can really cover the "GPT-4 reads websites" and stuff like that, helpful with complex charts too. Other than that LLava is your best hope.

[D]Task at hand. Any suggestions for how to approach this? in c/machinelearning@academy.garden

[–] vatsadev@alien.top 1 points 2 years ago

Detecting gender - So this is Mnist but for Gender. you could try going from mnist, if you're data is all of the same size, scale up the model, and train, you can experiment quickly and get 50% acc.

Detection Elements - slightly more complicated, for recognizing body features, you would need a segmentation model, I would look into that.

Is there any work being done on LLMs trained on a subset of knowledge? in c/localllama@poweruser.forum

[–] vatsadev@alien.top 1 points 2 years ago

"I want to chat with a PDF, I don't care for my LLM to speak French, be able to write Python or know that Benjamin Franklin wrote a paper on flatuence (all things RWKV v5 World 1.5B knows)."

This is Prime RAG, bring snippets in, make the model use them. The more knowledge the model has, the better it gets for your usecase as well, as it knows more stuff.

Also, nice using rwkv v5, hows it work for you?

TinyLlama Base Model Trained on 2T Tokens Complete in c/localllama@poweruser.forum

[–] vatsadev@alien.top 1 points 2 years ago

there are ggufs, check the bloke or greensky

Google PaLM Error [D] in c/machinelearning@academy.garden

[–] vatsadev@alien.top 1 points 2 years ago

This is a google api error, with absolutly nothing to do with ML?

you would probably have better luck with the palm or langchain github

[D] why falcon-180b ranking has dramatically decreased? in c/machinelearning@academy.garden

[–] vatsadev@alien.top 1 points 2 years ago

Well, the model is trained on refinedWeb, which is 3.5T, so a little below chinchilla optimal for 180b. Also, all the models from the falcon series seem to feel more and more undertrained,

The 1b model was good, and is still good after several newer gens
the 7b was capable pre llama 2
40b and 180b were never as good

Good and fast model around ~1B to run on web? in c/localllama@poweruser.forum

[–] vatsadev@alien.top 1 points 2 years ago

RWKV 1.5B, its Sota for its size, outperforms tinyLlama, and uses no extra vram for fitting its whole ctx len in browser.

Why not test all models for training on the test data with Min-K% Prob? in c/localllama@poweruser.forum

[–] vatsadev@alien.top 1 points 2 years ago

Noice man

Thinking about what people ask for in llama 3 in c/localllama@poweruser.forum

[–] vatsadev@alien.top 1 points 2 years ago

Well the 5 million was just an example of the OP stuff out there