this post was submitted on 01 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

I'm curious if there's an ideal setup or pipeline that you can get an LLM to listen and "learn" from you if you just feed it info everyday like a personal diary? Would be interested to see how the model recalls or processes details of my life. Would you just use a web ui like oogabooga to feed info and adapt the model?

top 7 comments
sorted by: hot top controversial new old
[–] Dazzling_Ad1507@alien.top 1 points 1 year ago (1 children)

I might be wrong, but this sounds a lot like a liquid neural network, able to adapt and change on demand.

[–] opi098514@alien.top 1 points 1 year ago

Is….. that a thing? I need that in my life.

[–] No_Afternoon_4260@alien.top 1 points 1 year ago

May be some kind of memgpt..

[–] MordyOfTheMooMoo@alien.top 1 points 1 year ago

Do you have a specific use case or need in mind? If you want it to remember things, you wouldn't necessarily 'feed it into an LLM' but if you want it to produce output more like how you'd speak, then fine-tuning would probably be appropriate.

Depending on what you wanna do, it will have different design requirements.

In general, I'd ask what's the desired goal first.

[–] Severin_Suveren@alien.top 1 points 1 year ago

You will need to feed the model with the conversation log every time you query it, and as such you'd be limited by the context length on the model.

With a 100k context model you'd be able to keep a chat log of about 70-100 000 words, which is about the length of a normal book.

[–] orrorin6@alien.top 1 points 1 year ago (1 children)

I am working on this exact product and the way I am approaching it is having a database with different levels of abstraction for each day.

[–] TwosTooToToosTwo@alien.top 1 points 11 months ago

I am working on this exact product and the way I am approaching it is having a database with different levels of abstraction for each day.

Couldn't you just timestamp each interaction (input and output?)