LocalLLaMA

3 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago

MODERATORS

communick@poweruser.forum

The Problem with LLMs for chat or roleplay (alien.top)

submitted 1 year ago by tammmu@alien.top to c/localllama@poweruser.forum

19 comments fedilink hide all child comments

I've been using self-hosted LLM models for roleplay purposes. But these are the worst problems I face every time, no matter what model and parameter preset I use.

I'm using :

Pygmalion 13B AWQ

Mistral 7B AWQ

SynthIA 13B AWQ [Favourite]

WizardLM 7B AWQ

It messes up with who's who. Often starts to behave like the user.
It writes in third person perspective or Narrative.
Sometimes, generates the exact same reply (exactly same to same text) back to back even though new inputs were given.
It starts to generate more of a dialogue or screenplay script instead of creating a normal conversation.

Anyone has any solutions for these?

you are viewing a single comment's thread
view the rest of the comments

[–] Gnodax@alien.top 1 points 1 year ago

That all sounds like the typical symptoms when you feed too much generated content back into the context buffer. Limit the dynamic part of your context buffer to about 1k tokens. At least that's been my experience using 13B models as chatbots. With exllama you just add "-l 1280". Other systems should offer similar functionality.

If you want to get fancy, you can fill the rest of the context with whatever backstory you want.