Wikipedia Editors Adopt ‘Speedy Deletion’ Policy for AI Slop Articles | 404 Media

⁨146⁩ ⁨likes⁩

Submitted ⁨⁨6⁩ ⁨months⁩ ago⁩ by ⁨theangriestbird@beehaw.org⁩ to ⁨technology@beehaw.org⁩

https://www.404media.co/wikipedia-editors-adopt-speedy-deletion-policy-for-ai-slop-articles/

source

Comments

Sort:hotnew top

theangriestbird@beehaw.org ⁨6⁩ ⁨months⁩ ago
I think the how is the most interesting part here.

The solution Wikipedians came up with is to allow the speedy deletion of clearly AI-generated articles that broadly meet two conditions. The first is if the article includes “communication intended for the user.” This refers to language in the article that is clearly an LLM responding to a user prompt, like "Here is your Wikipedia article on…,” “Up to my last training update …,” and "as a large language model.” This is a clear tell that the article was generated by an LLM, and a method we’ve previously used to identify AI-generated social media posts and scientific papers.

The other condition that would make an AI-generated article eligible for speedy deletion is if its citations are clearly wrong, another type of error LLMs are prone to. This can include both the inclusion of external links for books, articles, or scientific papers that don’t exist and don’t resolve, or links that lead to completely unrelated content. Wikipedia’s new policy gives the example of “a paper on a beetle species being cited for a computer science article.”

source
- hansolo@lemmy.today ⁨6⁩ ⁨months⁩ ago
  JHFC
  
  source
- jarfil@beehaw.org ⁨6⁩ ⁨months⁩ ago
  Sounds fair. If someone doesn’t even try to clean up a generated article, then nuke it.
  
  Only issue might be… that creating an automated cleanup tool to remove those triggers, wouldn’t be all that difficult.
  
  source
  - ranandtoldthat@beehaw.org ⁨6⁩ ⁨months⁩ ago
    Speedy deletion is for deletions that require zero discussion, so it needs to be very simple and clear. For less sloppy genai there may need to be a discussion (unless it falls under different speedy deletion criteria.
    
    Sometimes those discussions are very straightforward, but they allow for dissenting voices. But for “almost obvious” cases not a lot of effort is spent on them.
    
    source
    -> View More Comments
kehet@sopuli.xyz ⁨6⁩ ⁨months⁩ ago
I don’t understand why ppl do this. Is this malicious? Do they think they are somehow helping?

source
- smeg@feddit.uk ⁨6⁩ ⁨months⁩ ago
  I saw a similar story about how open source software projects (I think it was curl) have cancelled their bug bounty programme because it’s being overrun with LLM-generated reports and they don’t have enough volunteers to verify them all. The relevant bit is that while many were doing it for the financial reward, some do it for reputation and some genuinely do think they’re helping by adding info they think is missing but not realising that what they’re posting is unreliable.
  
  source