this post was submitted on 02 Feb 2024
120 points (88.0% liked)
Fediverse
28739 readers
162 users here now
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It's interesting right?
I'm thinking the architecture of the fediverse makes it particularly vulnerable to these sorts of attacks.
I'm pretty sure I've spotted bots circle jerking on some subjects also which makes me think there's a few different sources.
Very interesting indeed.
I‘m starting to report, block and ban accounts from being viewed on my instance that use abusive language but from a systemic standpoint we should find a design solution to make this work.
Reddit had karma for this reason among others. People needed to make helpful contributions to prove they are able to function in the group.
For many reasons this is not implemented in the fediverse but a design solution would be good.
If I was designing an anti troll/bot system I'd implement a few things. Let's call any bad actor on here a bot/troll or broll for ease.
Very good ideas! Any idea if something like this already exists? If not, shall we work on something? I have some experience in python if that helps.
PieFed is an open source lemmy alternative (written in Python) that makes good use of karma/reputation, as shown in this video:
https://mastodon.nzoss.nz/system/media_attachments/files/111/648/646/494/228/522/original/02cb1b5182a1f9b6.mp4
Try the demo site at https://piefed.social and check out https://join.piefed.social. Also see https://piefed.social/c/piefed_meta for recent feature announcements.
Thank you very much for sharing, I'll keep an eye on it.
I‘m not searching for another thing to start but a way to make the current thing work. But thanks.
To be fair, Piefed uses Lemmy communities and comments, it's almost just another interface.
The reputation is indeed interesting, example in this thread with warnings "low reputation, beware!": https://piefed.social/post/27070#post_replies
Ah! Understood. Thanks for clarifying.
Thanks but not sure what's currently implemented or even what the code base is written in 😅.
I might have a poke around and see if there's any low hanging fruit.
Call me crazy but with a 5b ipo about to start I'd be shocked if reddit wasn't paying some troll farms to brigade the fediverse and it'd be a shame if spez wins
Thats an interesting idea! Thank you very much for mentioning it!
We can absolutely write a bot in python and could try to use it like that. I already made a discord bot so this shouldn’t be brutally hard.
Awesome 👍 I'm more c#/Java, angular if there's anything I can contribute.
Well, I do know some c# but not enough for it to be functional.
You could hit me up on github or pm here to get a repo set up somehere and go from there.
What do you think?
I'm currently on holidays but that sounds great when I return. I might even get started early. Can you pm the details?
The other person we came across now tries to somehow discredit me. Whatever their plan is. Jeez.
Sure, I‘ll send you a pm. Have a nice vacation.
Thanks. Yeah that was odd
If this LLM-detection function ever results in false positives, this system will be banning innocent people.
Also there are many, many cases where a person openly displays results from an LLM, without it being in any way antisocial.
The odds of someone coming up with the same sentence as an llm within common sense bounds of time far exceed winning the lottery or getting struck by lightning.
Your second point is straight up nonsense. This platform is for humans to interact. The use of bots is inherently deceptive.
Fascinating to have someone argue for them. I think the backend logs will be pretty illuminating.
I don’t know what a person “coming up with the same sentence as an llm” would have anything to do with this unless the LLM detection is based on direct string comparison.
Nope. I can say:
That is not deceptive. But it would be detected by this system and result in them being banned. Because you guys are gung-ho to build a powerful head-cracking machine and didn’t think of an obvious edge case.
You're wrong and don't have the technical knowledge to understand why and I can't be asked explaining it.
Relax, it won't affect that case.