this post was submitted on 15 Mar 2024
492 points (95.4% liked)

Technology

59378 readers
4188 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 1 points 8 months ago (1 children)

Is this bot a closed system which is being used for profit? No, you know exactly what its source is (the single article it is condensing) and even has a handy link about how it is open source at the end of every single post.

[–] [email protected] 0 points 8 months ago (1 children)

It copied all of its text from the article, and it allows me to get all the information from it I want without providing that publisher with traffic or ad revenue. That's not fair use.

I do like the bot, and personally I'd rather it stay, but no matter how you look at it this isn't "fair use" of the article.

[–] [email protected] 0 points 8 months ago (1 children)

Interesting take. In all of the defences of LLMs using copyrighted material it's very often highlighted that "fair use" allows exactly such summaries of larger texts.

In reality, "fair use" is ruled on a case by case basis, so it's impossible to judge whether something is or not without it going to court.

[–] [email protected] 0 points 8 months ago

We're not making legislation here, so we don't have that level of burden of proof. But either way, when it comes to factors of fair use that every authority on the matter will list, it violates almost all of them.

It's non-commercial, and it's using facts rather than using a more creative work, so it's got that going for it... But it's

  • composed of 100% copied material

  • it's not transformative

  • it's substituting the original work

  • it uses officially published work

  • it specifically copies the "heart" of the work

  • it bypasses all of the ads and impacts their traffic/metrics so it has a financial impact on them.

It's pretty obvious that there is no argument here. The factors that are violated the hardest and most undisputably are the ones that most authorities on the matter (including the one I linked) agree are the most important.