Child abuse images removed from AI image-generator training source, researchers say

girlfreddy@lemmy.ca · 3 months ago

Child abuse images removed from AI image-generator training source, researchers say

Iapar@feddit.org · 3 months ago

Complete failure of everyone involved that it was in there in the first place.

istanbullu@lemmy.ml · 3 months ago

These datasets have billions of images in them (The Laion database have 5 billion images!). There is no way a human can go through them to check for bad content.

Iapar@feddit.org · 3 months ago

Then don’t just use it? Or use a program? There a multiple ways to not do something stupid and none of them occurred to them because it is more important to them to be at the top of the shitpile.

istanbullu@lemmy.ml · 2 months ago

The dataset sizes needed for machine learning rule out any kind of human verification. It’s just not possible to manually check billions of images.

Iapar@feddit.org · 2 months ago

Oh, that makes it okay then.

istanbullu@lemmy.ml · 2 months ago

How would you check 5 billion images?

Iapar@feddit.org · 2 months ago

Mu.

I wouldn’t use a amount of images I couldn’t check. I wouldn’t use images from unchecked sources. I wouldn’t make money from sexual exploited children.

And I think people that don’t see the most obvious solution to that are fucked in the head.

istanbullu@lemmy.ml · 2 months ago

That won’t work. Models of this kind need billions of images or they are trash.