this post was submitted on 14 Nov 2023
        
      
      1 points (100.0% liked)
      LocalLLaMA
    11 readers
  
      
      4 users here now
      Community to discuss about Llama, the family of large language models created by Meta AI.
        founded 2 years ago
      
      MODERATORS
      
    you are viewing a single comment's thread
view the rest of the comments
    view the rest of the comments
I believe these are TheBloke's GGUF quants if anyone's interested: https://huggingface.co/TheBloke/Nous-Capybara-34B-GGUF
Also note this important issue that affects this and all other Yi-based models:
BOS token as 1 seriously hurts these GGUF Yi models
So we can just skip BOS token on all these models?
I did the gguf-py/scripts/gguf-set-metadata.py some-yi-model.gguf tokenizer.ggml.bos_token_id 144
and it's changed the outputs a lot from yesterday.