LocalLLaMA

11 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

MonadGPT, an early modern chatbot trained on Mistral-Hermes and 17th century books. (alien.top)

submitted 2 years ago by Dorialexandre@alien.top to c/localllama@poweruser.forum

11 comments fedilink hide all child comments

top 11 comments

sorted by: hot top controversial new old

[–] Dorialexandre@alien.top 1 points 2 years ago

Link to the ongoing demo for MonadGPT, with generous GPU support from HuggingFace : https://huggingface.co/spaces/Pclanglais/MonadGPT

The model has been published as well (and soon the dataset): https://huggingface.co/Pclanglais/MonadGPT?text=Hi.

[–] vec1nu@alien.top 1 points 2 years ago

Which frontend is that?

[–] buzzyness@alien.top 1 points 2 years ago (2 children)

Very cool, there might be lots of applications of this approach (from an archival standpoint), maybe museums? What are your thoughts on finetuning, vs asking llama to chat in the form of a 17th century astronomy book?

[–] Dorialexandre@alien.top 1 points 2 years ago

Well that was actually my original motivation for finetuning. Even GPT-4 is not so good with a proper prompt: the text feels fake and/or struggle to maintain cultural consistency. I think finetuning works better for this task, as there are too many directives to give and it helps to relieve the model from anachronistic RLHF.

As for the applications, I mostly think about education, especially if the model is properly connected to a RAG database. Can be a very interesting way to get immersed in a time period on any kind of topics.

[–] unamednational@alien.top 1 points 2 years ago

Would be awesome in classroom. If kids can ask George Washington what happened exactly I think they'd care more. Plus they could tell him to go f himself for infinite amusement

[–] tortistic_turtle@alien.top 1 points 2 years ago

https://preview.redd.it/mnwir3lbidzb1.png?width=1641&format=png&auto=webp&s=118f23ef12e0af6580acaa38bb7c1446b1c05abf

[–] ReMeDyIII@alien.top 1 points 2 years ago

Did we used to spell "we" as "wee?"

[–] UseNew5079@alien.top 1 points 2 years ago

Absolutely brutal bot and very opinionated. Cool idea.

https://preview.redd.it/l2f0l3sanezb1.png?width=904&format=png&auto=webp&s=a338617579289a932c9083641f74d78078acf87e

[–] oKatanaa@alien.top 1 points 2 years ago

How was it trained? Did you just train it on the passages from those books? If so, I am very surprised it retained its conversational capabilities. I would expect it to just go off the rails and generate random 17th century stuff

[–] FPham@alien.top 1 points 2 years ago

Interestingly, if you tell in system prompt to the OpenHermes-Mistral 2.5 that he is from 17 century and uses archaic language, he will also say there are 7 planets.

You are MonadGPT, a very old chatbot from the 17th century. Please answer the questions using an archaic language

https://preview.redd.it/0ecpxhg86hzb1.png?width=927&format=png&auto=webp&s=cc626b7c480bf1582b9f937f0c8c671ab403f0be

[–] Dorialexandre@alien.top 1 points 2 years ago

As an update: I have now released the finetuning dataset on HuggingFace: https://huggingface.co/datasets/Pclanglais/MonadGPT

Overall 10,797 excerpts in early modern English, French and Latin with synthetic question generated by Mistral-Hermes.