this post was submitted on 08 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 11 months ago
MODERATORS
 

What is the best methods to detect harmful content such as racial abuse in tweets?

I'm thinking about a research project in which I try various methods and compare their accuracy. Am I right in thinking that Naive Bayes, Logistic Regression, Support Vector Machine, LSTM and BERT would be some of the best methods?

top 3 comments
sorted by: hot top controversial new old
[–] tail-recursion@alien.top 1 points 10 months ago

You could try an open source LLM like Llama 2. You could probably use Langchain tools to give it a tool to tag when a tweet has harmful content.

[–] Inevitable_Brain_785@alien.top 1 points 10 months ago

Cohere AI allows you to create classification models by fine-tuning our transformer based embedding models for your use case. It's free to train a classification finetune so give it a slot. To the best of my knowledge this is near state of the art

[–] fulowa@alien.top 1 points 10 months ago