Dangerous_Injury_101

joined 1 year ago

In my opinion open-source projects should focus an a very narrow thing, instead of focusing on being a "GPT", that focuses on being able to do everything. in c/localllama@poweruser.forum

[–] Dangerous_Injury_101@alien.top 1 points 1 year ago (3 children)

Perhaps we should have like hundred different 7B models for different categories like history, arts, science etc. and then above that there's new layer where there's generic LLM which parses the question to correct category, and then finally the correct 7B model loads into your VRAM? :D Like if you had the fastest NVME (not sure if DirectStorage would help, probably not?) perhaps the waiting wouldn't be too terrible unless every of your question is in different category

permalink
fedilink
source