this post was submitted on 02 Nov 2023
218 points (100.0% liked)

Technology

37723 readers
446 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 5 points 1 year ago (1 children)

The number of bytes per image doesn't necessarily mean there's no copying of the original data. There are examples of some images being "compressed" (lossily) by Stable Diffusion; in that case the images were specifically sought out, but I think it does show that overfitting is an issue, even if the model is small enough to ensure it doesn't overfit for every image.

[–] [email protected] 1 points 1 year ago

Over fitting is an issue for the images that were overfit. But note in that article that those images mostly appeared many times in the data set.

People who own the rights to one of those images have a valid argument. Everyone else doesn't.