LocalLLaMA

11 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Is there a 7B model capable to extract keywords from a text and return them as an array? (alien.top)

submitted 2 years ago by grigio@alien.top to c/localllama@poweruser.forum

16 comments fedilink hide all child comments

Prompt like:

Extract the company names from the texts below and return as an array

-- ["Google", "Meta", "Microsoft"]

top 16 comments

sorted by: hot top controversial new old

[–] LoSboccacc@alien.top 1 points 2 years ago

not to be an ass but what's wrong with extracting the keywords and then going .split() ?

[–] xelldev13@alien.top 1 points 2 years ago (1 children)

You can do this with NER model like bert, is more fast, but is only for entitie recognition

[–] name_is_unimportant@alien.top 1 points 2 years ago

Yeah Named Entity Recognition with BERT works very well, provided that you have a good dataset. Another limitation is that it can only handle 512 tokens

[–] platinums99@alien.top 1 points 2 years ago

er Javascript?

[–] demegir@alien.top 1 points 2 years ago

Use this https://huggingface.co/dslim/bert-base-NER

[–] DreamGenX@alien.top 1 points 2 years ago

On top of what other said, make sure to include a few shot examples in your prompt, and consider using constrained decoding (ensuring you get valid json of whatever schema you provide, see pointers on how to do it with llama.cpp).

For few shotting chat models, append fake previous turns, like:

System: 
User: 
Assistant: 
...
User: 
Assistant: 
User:

[–] fvillena@alien.top 1 points 2 years ago

That task is called Named Entity Recognition and you can do it without training data using our library (you can use any LLM that exposes an OpenAI compatible API endpoint: https://github.com/plncmm/llmner

[–] noellarkin@alien.top 1 points 2 years ago

By keywords you mean entities? You don't need anything as heavy as a 7b LLM for that. I use https://www.textrazor.com/plans - - free upto 500 requests per day.

[–] BrainSlugs83@alien.top 1 points 2 years ago (1 children)

Why do you need an LLM for this? Just use any NER model. It will be blazing fast and run locally.

[–] LPN64@alien.top 1 points 2 years ago

Because let's say you train your bert model to do this, you'll have a specific limited class trained on a specific type of document.

It will work on wikipedia articles but not on transcripts from your local police station.

Using a llm will allow it to inherit from the wide knowledge of the llm.

[–] swagonflyyyy@alien.top 1 points 2 years ago

Huggingface transformers has such models available.

[–] _omid_@alien.top 1 points 2 years ago

I use mistral-7b-openorca.q8_0 And this is my prompt


system:You are a helpful machine. Always answer with the THREE most important keywords from the information provided to you between BEGININPUT and ENDINPUT. Here is an example:\\nUser: BGEININPUT A tree is planted for each contract. Your contribution will be invested 100% sustainably! ENDINPUT\\nassistant: [contract, tree, sustainable]\\nuser:

[–] BeneficialLeader4357@alien.top 1 points 2 years ago

IMHO - I think Zephyr beats Mistral here.

[–] laveshnk@alien.top 1 points 2 years ago

NLTK / beautiful soup should have some tools to do such things. ig its NER.

For the record, I wouldnt advice to use an LLM for this task. Unless you can afford to waste v memory

[–] PrometheusZer0@alien.top 1 points 2 years ago

Seems like a job well suited for spacy?

[–] AsliReddington@alien.top 1 points 2 years ago

Yeah man just use langchain+pydantic class/guidance lib by MS with Mistral Instruct or Zephyr & you're golden