this post was submitted on 20 Sep 2023
556 points (95.6% liked)

Technology

59246 readers
3330 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 54 points 1 year ago (56 children)

The authors added that OpenAI’s LLMs could result in derivative work “that is based on, mimics, summarizes, or paraphrases” their books, which could harm their market.

Ok, so why not wait until those hypothetical violations occur and then sue?

[–] [email protected] -4 points 1 year ago (17 children)

Because that is far harder to prove than showing OpenAI used his IP without permission.

In my opinion, it should not be allowed to train a generative model on data without permission of the rights holder. So at the very least, OpenAI should publish (references to) the training data they used so far, and probably restrict the dataset to public domain--and opt-in works for future models.

[–] [email protected] 2 points 1 year ago (1 children)

Assuming that books used for GPT training were indeed purchased, not pirated, and since "AI training" was not prohibited at the time of the purchase, the engineers had every right to use them. Maybe authors in the future could prohibit "AI training" but for the books purchased before they do, "AI training" is a fair usage.

[–] [email protected] 2 points 1 year ago* (last edited 1 year ago)

I think we'll find our whether or not that is true will be decided in a trial like this.

load more comments (15 replies)
load more comments (53 replies)