overview for Temporary-Size7310

Do you think there's a market for local LLMs? in c/localllama@poweruser.forum

[–] Temporary-Size7310@alien.top 1 points 2 years ago

Absolutely (I'm a datascientist too), I own a fresh company in ML/DL/AI and it works pretty well.

There is so much room for creation even with current models (RAG pipeline on private data, prototyping locally and train or finetune on instances, real world application mixing others branch like CV, cinematography with SD, implementing liquid NN with LLM and so on)

Can't handle efficiently RAG with large PDF in c/localllama@poweruser.forum

[–] Temporary-Size7310@alien.top 1 points 2 years ago

You can find the method here :

https://medium.aiplanet.com/advanced-rag-providing-broader-context-to-llms-using-parentdocumentretriever-cc627762305a

30,000 AI models in c/localllama@poweruser.forum

[–] Temporary-Size7310@alien.top 1 points 2 years ago

There is ton of fine tuned models and maybe 6-7 quantisized models per model and fine tuned models, open source, business usable, uncensored, for RAG, for photo description, for TTS, for CV, with updates of checkpoints and so on.

At the contrary fortunately there is people and enough diversity to adapt with hardware and objectives without pay fortunes to train, finetune models.

ie: If your needs are commercial, with a model speaking fluently spanish, small enough to inference fast for many clients and with censor, 100% on your local server, treating with confidential data there is almost no choice

Can't handle efficiently RAG with large PDF in c/localllama@poweruser.forum

[–] Temporary-Size7310@alien.top 1 points 2 years ago (2 children)

Some updates:

I changed to Jina-small-en-v2, base model crash due to lack of RAM under WSL2
Make a parents retriever (chunk 2000) and input it child retriever (chunk 400), 0 overlap (will share the method)
Still use sciphy model but this time using the right template (From the indication from The bloke) by adding a template prompt rather than Alpaca prompt and it resolves the problem of hallucination
Put text oobabooga on instruct by default, loader exllamav2hf

I got a strong 90% of success with the PDF, will send the code when this will be cleaned and optimized, thank you all for the help 😊

1

Can't handle efficiently RAG with large PDF (alien.top)

submitted 2 years ago by Temporary-Size7310@alien.top to c/localllama@poweruser.forum

0 comments fedilink

Hi,
I've issues when Q&A my PDF:

- I don't have consistency
- Tried many models (embedding/llm)
- Tried many methods
- I have 20% hallucination, especialy with "president" and "mickael jackson"

ex1 (this one is correct):

> question: How much net income for Amazon in 2020, 2021 and 2022 ?

> answer: In 2020, Amazon made a net income of $21.331 billion, in 2021 $33.364 billion and in 2022 the company made a loss of $2.722 billion.

ex2: (this one is incorrect):

> question: How much operating expenses for AWS in 2020, 2021 and 2022 ?

> answer: The operating expenses of AWS in 2020, 2021, and 2022 were $444.943 billion, $501.735 billion, and $567.608 billion respectively.

It always calculate the entire expenses of the company, i try GPT-4 and it is capable.

- PDF: Amazon 2022 annual report 10K (88 pages)
- Embedding: all-MiniLM-L12-v2
- Text splitter: Chunk_size = 1000, overlap = 20
- VectorDB: Chroma
- LLM: SciPhi-Self-RAG-Mistral-7B-32k-8.0bpw-h6-exl2 via Oobabooga (OpenAI extension) with 0.2temp, alpaca instruction template.
- Langchain: RetrievalQA, chain_type = stuff, retriever = vectDB.as_retriever()
- RTX 3090

If anyone resolve this issue, please can you help me :)

1

Can't handle efficiently RAG with large PDF (alien.top)

submitted 2 years ago by Temporary-Size7310@alien.top to c/localllama@poweruser.forum

6 comments fedilink

Hi,
I've issues when Q&A my PDF:

- I don't have consistency
- Tried many models (embedding/llm)
- Tried many methods
- I have 20% hallucination, especialy with "president" and "mickael jackson"

ex1 (this one is correct):

> question: How much net income for Amazon in 2020, 2021 and 2022 ?

> answer: In 2020, Amazon made a net income of $21.331 billion, in 2021 $33.364 billion and in 2022 the company made a loss of $2.722 billion.

ex2: (this one is incorrect):

> question: How much operating expenses for AWS in 2020, 2021 and 2022 ?

> answer: The operating expenses of AWS in 2020, 2021, and 2022 were $444.943 billion, $501.735 billion, and $567.608 billion respectively.

It always calculate the entire expenses of the company, i try GPT-4 and it is capable.

- PDF: Amazon 2022 annual report 10K (88 pages)
- Embedding: all-MiniLM-L12-v2
- Text splitter: Chunk_size = 1000, overlap = 20
- VectorDB: Chroma
- LLM: SciPhi-Self-RAG-Mistral-7B-32k-8.0bpw-h6-exl2 via Oobabooga (OpenAI extension) with 0.2temp, alpaca instruction template.
- Langchain: RetrievalQA, chain_type = stuff, retriever = vectDB.as_retriever()
- RTX 3090

If anyone resolve this issue, please can you help me :)