LocalLLaMA

14 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Models Megathread #2 - What models are you currently using? (alien.top)

submitted 2 years ago by Technical_Leather949@alien.top to c/localllama@poweruser.forum

56 comments fedilink hide all child comments

As requested, this is the subreddit's second megathread for model discussion. This thread will now be hosted at least once a month to keep the discussion updated and help reduce identical posts.

I also saw that we hit 80,000 members recently! Thanks to every member for joining and making this happen.

Welcome to the r/LocalLLaMA Models Megathread

What models are you currently using and why? Do you use 7B, 13B, 33B, 34B, or 70B? Share any and all recommendations you have!

Examples of popular categories:

Assistant chatting
Chatting
Coding
Language-specific
Misc. professional use
Role-playing
Storytelling
Visual instruction

Have feedback or suggestions for other discussion topics? All suggestions are appreciated and can be sent to modmail.

^(P.S. LocalLLaMA is looking for someone who can manage Discord. If you have experience modding Discord servers, your help would be welcome. Send a message if interested.)

Previous Thread | New Models

you are viewing a single comment's thread
view the rest of the comments

[–] mcmoose1900@alien.top 1 points 2 years ago

In addition to what others said, exl2 is very sensitive to the quantization dataset, which it uses to choose where to assign those "variable" bits.

Most online quants use wikitext. But I believe if you quantize models yourself on your own chats, you can get better results, especially below 4bpw.