Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther
Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.
Full article here.
Link to the full leaked list download: Meta leaked list pdf
Participating in a public forum that has no technical way of preventing data from being used by a particular class of actor does not preclude having an opinion that a particular class of actor should have rules about what data they are allowed to use.
People can have whatever opinions they want to have. In this case that opinion flies in the face of obvious reality and I’m pointing that out.
It’s like trying to drive your car across the Atlantic ocean and then griping about how the car failed to stay above the water because you really thought it should be able to handle that.
It doesn’t matter how many pithy analogies you make. You need to recognize the difference between “I know they’re scraping this website because they can” and “I don’t think they should be allowed to scrape this website”. You’re arguing that they’re incompatible when they’re not.
As I said, people can have whatever opinion they want. Reality is under no obligation to respect those opinions.
Analogies are merely explanatory.
If you understand, then you should be able to understand that your “they were dressed like they wanted it” level argument bullshit is completely unnecessary.
Ah, the “people who disagree with me are supporting rape” argument, how classy.
It’s not that they’re “dressed like they wanted it.” The ActivityPub protocol explicitly and deliberately does this. If you post a comment on a Fediverse community then by design that comment is going to be broadcast to every instance with a subscription and displayed in public to anyone who wants to see it. That’s what the protocol is for. There should be no misunderstanding or misinterpretation here.
Ok? That doesn’t mean that everyone has to agree that AI companies should be allowed to train on the data. Are you seriously so dense you can’t distinguish between technology and social issues?
Ps: I very obviously didn’t say you support rape, but drew the very obvious comparison to what you’re saying. Use your head for 2 seconds.
It means that people should not be surprised that AI companies are training on their data. They’re deliberately putting their content out into the world where AI trainers can read it in an uncontrolled manner, and reading it is all that’s needed for AI training.
There have already been a number of lawsuits about AI training and thus far nothing seems to indicate that it’s something that copyright restricts. If you know of any cases that have established otherwise I suppose feel free to link them, but until then there’s nothing illegal going on here.
If you just want to be angry about it then I suppose there’s nothing stopping you on that count. Go ahead.
It isn’t about what is currently legal under the law! People can discuss how they would prefer society works, and should! This is what was happening in this thread and that’s why you trying to shove your “well actually this system is federated and it’s not illegal” is pointless and unwanted. You’re not bringing anything to the conversation because you can’t even tell what the conversation is about, apparently.