LocalLLaMA

3 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago

MODERATORS

communick@poweruser.forum

7B models keep repeating/glitching after certain number of tokens (alien.top)

submitted 11 months ago by GustavoToyota@alien.top to c/localllama@poweruser.forum

4 comments fedilink hide all child comments

I'm using ollama and I have a RTX 3060 TI. Using only 7B models.

I tested with Mistral 7B, Mistral-OpenOrca and Zephyr, they all had the same problem where they kept repeating or speaking randomly after some amount of chatting.

What could it be? Temperature? VRAM? ollama?

you are viewing a single comment's thread
view the rest of the comments

[–] LienniTa@alien.top 1 points 11 months ago

goliath 120b would fit in 64 ram, tho. It doesnt have repeating problem...

permalink
fedilink
source