this post was submitted on 01 Nov 2023
1 points (100.0% liked)

LocalLLaMA

1 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 10 months ago
MODERATORS
 

I'm curious if there's an ideal setup or pipeline that you can get an LLM to listen and "learn" from you if you just feed it info everyday like a personal diary? Would be interested to see how the model recalls or processes details of my life. Would you just use a web ui like oogabooga to feed info and adapt the model?

top 7 comments
sorted by: hot top controversial new old
[–] Dazzling_Ad1507@alien.top 1 points 10 months ago (1 children)

I might be wrong, but this sounds a lot like a liquid neural network, able to adapt and change on demand.

[–] opi098514@alien.top 1 points 10 months ago

Is….. that a thing? I need that in my life.

[–] No_Afternoon_4260@alien.top 1 points 10 months ago

May be some kind of memgpt..

[–] MordyOfTheMooMoo@alien.top 1 points 10 months ago

Do you have a specific use case or need in mind? If you want it to remember things, you wouldn't necessarily 'feed it into an LLM' but if you want it to produce output more like how you'd speak, then fine-tuning would probably be appropriate.

Depending on what you wanna do, it will have different design requirements.

In general, I'd ask what's the desired goal first.

[–] Severin_Suveren@alien.top 1 points 10 months ago

You will need to feed the model with the conversation log every time you query it, and as such you'd be limited by the context length on the model.

With a 100k context model you'd be able to keep a chat log of about 70-100 000 words, which is about the length of a normal book.

[–] orrorin6@alien.top 1 points 10 months ago (1 children)

I am working on this exact product and the way I am approaching it is having a database with different levels of abstraction for each day.

[–] TwosTooToToosTwo@alien.top 1 points 9 months ago

I am working on this exact product and the way I am approaching it is having a database with different levels of abstraction for each day.

Couldn't you just timestamp each interaction (input and output?)