this post was submitted on 14 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Combinatorilliance@alien.top 1 points 1 year ago (1 children)

I believe these are TheBloke's GGUF quants if anyone's interested: https://huggingface.co/TheBloke/Nous-Capybara-34B-GGUF

[–] WolframRavenwolf@alien.top 1 points 1 year ago (1 children)

Also note this important issue that affects this and all other Yi-based models:

BOS token as 1 seriously hurts these GGUF Yi models

[–] a_beautiful_rhind@alien.top 1 points 1 year ago (1 children)

So we can just skip BOS token on all these models?

[–] ambient_temp_xeno@alien.top 1 points 1 year ago

I did the gguf-py/scripts/gguf-set-metadata.py some-yi-model.gguf tokenizer.ggml.bos_token_id 144

and it's changed the outputs a lot from yesterday.