Leaked list shows Facebook training their AI on multiple Lemmy instances

cm0002@lemmy.world · 4 months ago

Leaked list shows Facebook training their AI on multiple Lemmy instances

FaceDeer@fedia.io · 4 months ago

Sure, you can look for mitigations. In the course of looking for mitigations, wouldn’t it be nice if someone let you know that the idea you’d come up with as a mitigation was not going to work?

_‌_反いじめ戦隊@ani.social · 4 months ago

Then let’s look for another! Whta do you propose?

FaceDeer@fedia.io · 4 months ago

I’ve given my suggestion in other comments in this thread. In short: if you don’t want your comments to be seen by all, then don’t post them on a public forum that uses an open protocol specifically designed to broadcast your comments to everyone who cares to listen. Perhaps use some closed-off forum instead, preferably run by a large and litigious company that guards its possessions jealously.

_‌_反いじめ戦隊@ani.social · 4 months ago

Ok, so get illegally scraped and copyright violated, got it boss.

FaceDeer@fedia.io · 4 months ago

Got any citations about this being illegal? If it is then the whole ActivityPub protocol is in trouble.

_‌_反いじめ戦隊@ani.social · 4 months ago

[Insert any copyright law you’re domicile to] Consult your lawyer about copyright violations of federated content. I am not yours to violate.

FaceDeer@fedia.io · 4 months ago

Have there been any relevant lawsuits you could point me to? Vaguely waving in the air and declaring “copyright” is not helpful.

_‌_反いじめ戦隊@ani.social · 4 months ago

I hope you’re just acting foolish for being ignorant on law.

You have yet ask me for a license to read my posts.

Get your lawyer b4 you’re sued.

FaceDeer@fedia.io · 4 months ago

That article’s proposal is incompatible with how the Fediverse works. It proposes licensing models for viewing, printing, and copying, but all of this hinges on the content being delivered in a protected format that enforces those restrictions. It describes using encrypted “software envelopes” that check with a central server for authorization before allowing access. If content is freely accessible without technical restrictions, then legally, it’s considered published and available to the public.

I am never going to ask you for a license to read your posts. Go ahead, sue me.

qaz@lemmy.world · edit-2 4 months ago

They’re just using very simple scrapers that don’t have any knowledge about how the site operates. The simplest counter would probably be using Anubis on the web interface.

I wouldn’t mind waiting 2-3 seconds when first loading the site and mobile apps would remain unaffected since they use the API.