spacedragon13

joined 10 months ago
 

My boss is a semi famous author in a niche academic field. I have thousands of pages of text coming from books, transcripts, and more.

Is there a straightforward path to creating a corpus to augment Bert or Llama or another llm? End goal being able to chat with this ai that is now trained on his life's work.

Is there anything specific to understand in terms of preparing the corpus? Do I need key value pairs where I write a ton of examples questions and responses?