this post was submitted on 11 Oct 2024
708 points (99.4% liked)

Technology

58682 readers
4005 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Wikipedia has a new initiative called WikiProject AI Cleanup. It is a task force of volunteers currently combing through Wikipedia articles, editing or removing false information that appears to have been posted by people using generative AI.

Ilyas Lebleu, a founding member of the cleanup crew, told 404 Media that the crisis began when Wikipedia editors and users began seeing passages that were unmistakably written by a chatbot of some kind.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 237 points 5 days ago (10 children)

Further proof that humanity neither deserves nor is capable of having nice things.

Who would set up an AI bot to shit all over the one remaining useful thing on the Internet, and why?

I'm sure the answer is either 'for the lulz' or 'late-stage capitalism', but still: historically humans aren't usually burning down libraries on purpose.

[–] [email protected] 1 points 13 hours ago

Its because there's no accountability for cybercrimes. If humans always had a button to burn down libraries, I'm sure they would have. Instead they had to put themselves in harms way to do such things.

People do things cause they can, and fucking with Wikipedia is apparently simple.

[–] [email protected] 115 points 5 days ago

State actors could be interested in doing that. Same with the internet archive attacks.

[–] [email protected] 98 points 5 days ago (1 children)

historically humans aren't usually burning down libraries on purpose.

How on earth have you come to this conclusion.

[–] [email protected] 36 points 5 days ago (1 children)

To be fair, it's usually to effect cultural genocide. It's not average people burning libraries, it's usually some kind of authoritarian regime.

[–] [email protected] 34 points 5 days ago* (last edited 5 days ago)

* looks around and gestures broadly in agreement*

[–] [email protected] 13 points 4 days ago

historically humans aren't usually burning down libraries on purpose.

Sometimes they are, Baghdad springs to mind, I'm sure there are other examples. And this library is online so there's less chance of getting caught with a can of petrol and a box of matches.

Then there's every authoritarian regime that tries to ban or burn specific types of books. What we're seeing here could be more like that - an attempt to muddy the waters or introduce misinformation on certain topics.

[–] [email protected] 18 points 5 days ago

Florida says hello. A bunch of other places too, sadly:-(.

[–] [email protected] 9 points 5 days ago

Because basement losers can't conquer and raze libraries to the ground.

The internet has shown that assumed anonymity result in people fucking with other people's lives for the hell of it. Viruses, trolling, etc. This is just the next stage of it because of a new easy to use tool.

[–] [email protected] 4 points 4 days ago (2 children)

Yeah but the other thing about humanity is it's mostly harmless. Edits can be reverted, articles can be locked. Wikipedia will be fine.

[–] [email protected] 14 points 4 days ago* (last edited 4 days ago)

Edits can be reverted, articles can be locked.

Sure, but the vandalism has to be identified first. And that takes time and effort.

[–] [email protected] -4 points 4 days ago* (last edited 4 days ago)

Wikipedia relies on sources, and humans choosing the sources like newspapers. And those newspapers are more and more inside a "bubble" that rejects any evidence or reporting presented by a competing bubble.

Right now wikipedia is covering up one of the greatest acts of mass murder of our times, because the newspapers are covering it up, or rejecting evidence because it's by the "enemy". Part of this is a defensive posture against AI bots and enemy disinformation.

[–] [email protected] 4 points 5 days ago

It's not about on purpose but usually most people don't care about what's not in their interest. Today interests are usually quite shallow what tiktok shows quite well. Libraries do require money for operating. Even internet archive and wikipedia

[–] [email protected] 3 points 5 days ago
[–] [email protected] 1 points 5 days ago* (last edited 5 days ago) (1 children)

Maybe a strange way of activism that is trying to poison new AI models 🤔

Which would not work, since all tech giants have already archived preAI internet

[–] [email protected] 8 points 5 days ago (3 children)

Ah, so the AI version of the chewbacca defense.

I have to wonder if intentionally shitting on LLMs with plausible nonsense is effective.

Like, you watch for certain user agents and change what data you actually send the bot vs what a real human might see.

[–] [email protected] 2 points 4 days ago

I suspect it would be difficult to generate enough data to intentionally change a dataset. There are certainly little holes, like the glue pizza thing, but finding and exploiting them would be difficult and noticing you and blocking you as a data source would be easy.

[–] [email protected] 1 points 4 days ago

I have to wonder if intentionally shitting on LLMs with plausible nonsense is effective.

I don't think so. The volume of data is too large for it to make much of a difference, and a scraper can just mimic a human user agent and work that way.

You'd have to change so much data consistently across so many different places that it would be near-impossible for a single human effort.

[–] [email protected] 0 points 5 days ago

I never told that I think it is smart…