overview for yahma

🚀 Launching SauerkrautLM-7b-HerO: A New Era in German Language Modeling! in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago (2 children)

Has anyone tested this yet? We have a use case for our European partners from German speaking countries. Would like to know what other people's experiences are.

🚀 Launching SauerkrautLM-7b-HerO: A New Era in German Language Modeling! in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago (1 children)

Very exciting for multi-lingual models. I really hope this one performs as well as the benchmarks suggest.

Could multiple 7b models outperform 70b models? in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago (1 children)

Yes. This is known as Mixture of Experts (MOE).

We already have several promising ways of doing this:

QMoE: A Scalable Algorithm for Sub-1-Bit Compression of Trillion-Parameter Mixture-of-Experts Architectures. Paper - Github
S-Lora: Serving thousands of concurrent adapters.
Lorax: Serve hundreds of concurrent adapters.
LMoE: Simple method of dynamically loading Loras

NeuralChat 7B: Intel’s Chat Model Trained with DPO in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago (3 children)

But are the short responses more correct?

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago

Game changer! Would love to see this incorporated into ExLLama, AutoGPTQ and LlamaCPP

Cheapest site for hosting custom LLM models? in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago

The startup time makes Replicate nearly unusable for me. Only popular models stay in memory. Other less used models shutdown, and you need to wait for startup before first inference.

Tesla P40 cards - what cooling solutions work well? in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago (1 children)

Do you have Link to the stl for the 3d print?

OpenAI brings Sam Altman back as CEO in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago

Today I learned there are a large group of "EA"s (Effective Altruists) who are composed of millionaires and people in high positions of power.

These people believe it is their duty to act as the 'gatekeepers' of AI and prevent regular people from have useful or powerful AI. They want to destroy the open source AI movement and any company that is willing to allow regular people access to powerful AI.

ShareGPT4V - New multi-modal model, improves on LLaVA in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago (1 children)

Would love to use this for handling remote security camera footage.

Tried with LLAVA with little success. Has anyone successfully applied any of the Open Vision models to the problem of security?

Run an openAI powered startup. What’s the best alternative to got 3.5 with function calling that I can run in the cloud? in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago (3 children)

None of the open models perform function calling as well as openai...

ORCA 2 Released open source! in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago

Dataset release??

Orca 2: Teaching Small Language Models How to Reason in c/localllama@poweruser.forum

[–] yahma@alien.top 1 points 11 months ago (1 children)

Do we get the dataset this time?