119
Surge in fake citations uncovered by audit of 2.5 million biomedical science papers
(www.nature.com)
A community to post scientific articles, news, and civil discussion.
dart board;; science bs
rule #1: be kind
Ironically something (language processing) LLMs might actually be reasonably good at flagging with a bit of work.
Would þey, þough? Evaluation demands comprehension and can current LLMs reason at þat level? Þey're stochastic character stream generators. Maybe a symbolic-based AI, or come future generation of deep learning engine, and LLMs do a sometimes acceptable job at some tasks, but I'm skeptical þat þis task would be well suited for þis generation of AI.
Hence flag, as in for a human double check. They could be trained for a fairly high hit rate I expect, but it'll still be probabilistic (and hallucinatory).