SoA day of action following allegations of Meta’s mass theft of authors’ work - eviltoast

cross-posted from: https://lemmy.dbzer0.com/post/41302017

https://societyofauthors.org/2025/04/01/soa-day-of-action-following-allegations-of-metas-mass-theft-of-authors-work/

The SoA is organising a day of protest against Meta following revelations of pirated books being used to train their large language models

On Thursday 20 March, The Atlantic broke the story of how Meta has used the Library Genesis (LIbGen) dataset, which is full of pirated material, to develop their AI systems.

The revelations detailed by The Atlantic come against the background of the recent government consultation into Artificial Intelligence (AI) and copyright and the #MakeItFair campaign which sees the UK creative industries fighting back against the proposed changes to copyright law, which would favour multinational tech companies, but irremediably damage the creative industries.

  • No it is absolutely not theft. It is not theft by law. (There’s a reason why we have both “theft” and “infringement” in the lawbooks: they’re different things!) It is not theft morally. (Theft removes the owner’s ability to use something. Infringement does not. Infringement is a lesser moral crime if it is a crime at all.)

    Please do not fall into the trap the IP holders like to lay by equating theft and infringement in your mind. You can have your opinions on whether infringement is bad or not (and the facts are … complicated with both sides being largely full of shit on this), but it is a matter of fact that theft and infringement are entirely different things.

    Use the right term for the offence. Don’t let IP holders’ deliberate conflation to confuse the issue get to you.

    • vegantomato@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      23 hours ago

      Commercial/state-enforced AI crawlers overburdening services and forcing admins to increase cost and time spent dealing with these DDoS attacks is much closer to theft than the piracy itself. Piracy doesn’t make people lose money, AI crawlers do.

      If I host a website for the general public, I’m not paying money for 200 foreign AI crawlers to consume most of the bandwidth and CPU and leave legit users, whom I created the website for, with scraps. Even Wikipedia is feeling it.

      Many AI crawlers are immoral for other reasons as well, especially when we are talking about companies (Meta, Google) or states (CCP) doing it who are known for corporating with intelligence/defense or are engaged in human rights abuses.

      Turning this discussion to be about piracy is imo a distraction.

      • Commercial/state-enforced AI crawlers overburdening services and forcing admins to increase cost and time spent dealing with these DDoS attacks is much closer to theft than the piracy itself. Piracy doesn’t make people lose money, AI crawlers do.

        👉 👃

        Every person who says “piracy makes me lose money” is lying (or at least profoundly confused about how “income” works).

        Crawlers take up actual assets (bandwidth and time) that actually take money from your pocket. Piracy may or may not reduce your potential (key word!) income.

        One of those two is legitimately a property crime.