- cross-posted to:
- [email protected]
130
- cross-posted to:
- [email protected]
:pona_plush: #FediPact :pona_plush: (@[email protected])
cyberpunk.lol# INSTANCES KNOWN TO HAVE BEEN SCRAPED BY META INCLUDE:
• mastodon.social
• mastodon.online
• tech.lgbt
• hackers.town
• chaos.social
• mastodon.org.uk
• mastodont.cat
• mastodon.de
• mastodon.xyz
• mastodon.coffee
• mastodon.cloud
• mastodon.scot
• mastodonapp.uk
• mastodon.green
• mastodon.ml
• mastodon.au
• mastodon.eus
• mastodonczech.cz
• mastodon.sdf.org
• mstdn.social
• troet.cafe
• techhub.social
• tchncs.de
• kolektiva.social
• mamot.fr
• defcon.social
• meow.social
• social.linux.pizza
• ioc.exchange
• eldritch.cafe
• yiff.life
• furry.engineer
• infosec.exchange
• blahaj.zone
• woof.group
• union.place
• queer.party
• sakurajima.moe
• pawb.social
• digipres.club
• journa.host
• corteximplant.net
• corteximplant.com
• octodon.social
• bitbang.social
• jorts.horse
• tenforward.social
• pnw.zone
• spore.social
• hear-me.social
• neuromatch.social
• vt.social
• cosocial.ca
• chitter.xyz
• tooter.social
• cloudisland.nz
• social.seattle.wa.us
• masto.es
• nobigtech.es
• mastodon.gal
• masto.host
• toot.community
• pony.social
• climatejustice.global
• pleroma.envs.net
• indiepocalypse.social
• anarchism.space
• disroot.org
• dragonscave.space
• toot.bike
• fuzzies.wtf
• norden.social
• beige.party
• ohai.social
• freeradical.zone
• metalhead.club
• treehouse.systems
• icosahedron.website
• sunbeam.city
• sunny.garden
• zeroes.ca
• ursal.zone
• chaosfem.tw
• mas.to
• mathstodon.xyz
• rubber.social
• todon.nl
• cupoftea.social
• nerdculture.de
• toad.social
there're definitely more, i just did ctrl+f when i thought of an instance name so i definitely missed some. will be editing this list to add them as i think of them
#FediPact #meta #threads
That doesnt necessarily mean that training AI on this data is legal. Especially when multiple of these instances had legal documents in place specifically forbidding this kind of use.
There are some lawsuits in motion about this and the early signs are that it is indeed legal. For example, in Kadrey et al v. Meta the judge issued a summary judgment that training an AI on books was “highly transformative” and fell under fair use, and similarly in Bartz, Graeber and Johnson v. Anthropic the judge ruled that training an AI on books was fair use. I always expected this would be the case since an AI model does not literally contain the training material it was trained on, it learns patterns from the training material but that’s not the same as the literal expression of the training material. Since the training material isn’t being copied there’s nothing for copyright to restrict here.