overview for durden111111

ShareGPT4V - New multi-modal model, improves on LLaVA in c/localllama@poweruser.forum

[–] durden111111@alien.top 1 points 10 months ago

nice. From my tests it seems to be about the same as LLava v1.5 13B and Bakllava. I'm starting to suspect that the CLIP-Large model all of these multi-model LLMs are using is holding them back.

NeuralChat 7B: Intel’s Chat Model Trained with DPO in c/localllama@poweruser.forum

[–] durden111111@alien.top 1 points 10 months ago (5 children)

I found it to be worse than openhermes 2.5. It just gives shorter, more robotic responses

Intel neural-chat-7b-v3-1 in c/localllama@poweruser.forum

[–] durden111111@alien.top 1 points 10 months ago

Openhermes 2.5 still feels significantly better imo

ShareGPT4V - New multi-modal model, improves on LLaVA in c/localllama@poweruser.forum

[–] durden111111@alien.top 1 points 10 months ago (2 children)

Hopefully we get GGUFs soon

What UI do you use and why? in c/localllama@poweruser.forum

[–] durden111111@alien.top 1 points 10 months ago

Text Gen UI for general inference

llama.cpp server for multimodal

How to achieve more than 4k context? in c/localllama@poweruser.forum

[–] durden111111@alien.top 1 points 10 months ago

I'm wondering too. Openhermes 2.5 works fine for me on Oobabooga but it just stops outputting any tokens once it reaches 4k context despite having everything set for 8k (I'm running GGUF offloaded to gpu).

Look's like Mistral's cooking something tasty... no word on release date yet, though. in c/localllama@poweruser.forum

[–] durden111111@alien.top 1 points 10 months ago

70B

eww