this post was submitted on 17 Feb 2024
1059 points (98.8% liked)

Technology

60055 readers
3620 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 24 points 10 months ago (1 children)

It will get trained on some comment posts.

Let reddit die. Join Lemmy or /kbin. https://join-lemmy.org/ https://kbin.pub/

[–] [email protected] 11 points 10 months ago (6 children)

And what's to stop instance owners from selling their data?

[–] [email protected] 12 points 10 months ago* (last edited 10 months ago) (1 children)

The eggs are not all in one basket. Less data to sell.

[–] [email protected] 11 points 10 months ago

Thanks to federation, the copies of the eggs are. You can’t stop one instance from selling data sourced from federated content until it’s too late.

[–] [email protected] 8 points 10 months ago

You can't put a price tag on it. Nothing is stopping anyone from scraping all of the data for free.

[–] [email protected] 8 points 10 months ago

The only thing stopping them is the fact that anyone who wants the data can just utilize the federation protocol to take any data they want, and there's not a lot anyone can do about it. You can't sell something that's trivial to get for free.

If the question you're really asking is "what's stopping content on Lemmy/Mastodon/etc from being used to train an LLM?" the answer is, nothing.

[–] [email protected] 6 points 10 months ago (2 children)

mass user exodus to one of the many other identical Instances. Also, data brokers prolly aren't interested in going after each Instance because no one instance has enough data to make it worthwhile. Yet again, the fediverse proves its resistance to enshitification.

[–] [email protected] 4 points 10 months ago

Lmao, if it gets as big as Reddit then it's worth scraping. It's not the fediverse making it less worthwhile, just the size.

[–] [email protected] -1 points 10 months ago (1 children)

Yes, it's not worth running an instance! So let's all run one! LOL. It's so worth it. Fuck reddit.

[–] [email protected] 2 points 10 months ago
[–] [email protected] 2 points 10 months ago
[–] [email protected] 1 points 10 months ago

I wished they had evil lawyers looking after such stuff and sold strictly opt in data to AI corps. Free for FOSS though.