this post was submitted on 28 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

Hello there,

I'm not sure if this is the right sub to ask this question, so please tell me if there's a better sub to ask this question.

I'm a student and me and my team have the assignment to make a chatbot for our university. We need to make a chatbot that can help other students find information about their course. We will get our data from manuals of multiple universtity websites (as pdf). This data will be turned into Q&A data using ChatGPT 4.

However, we are struggling to find a pre-trained LLM that fits our assignment. We've researched T5, BERT and GPT-2 but our teacher was surprised those were the models we researched, since there are more popular and newer models. Our chatbot must be in Dutch, but we can translate so the LLM doesn't need to be trained on Dutch data. The LLM can't be too big, because we don't have the hardware for very large models.

We are currently looking at openchat and falcon, both with 7B parameters. Are these good options or does anyone have any tips for better LLMs?

you are viewing a single comment's thread
view the rest of the comments
[–] Main_Path_4051@alien.top 1 points 11 months ago

This is a kind of RAG augmented project or llama project with embeddings . Not really complicated. have a look at langchain too, altough this may not be very efficient, that is a first approach.