Meta staff torrented nearly 82TB of pirated books for AI training — court records reveal copyright violations - eviltoast

Summary

Court records in an ongoing lawsuit reveal that Meta staff allegedly downloaded 81.7TB of pirated books from shadow libraries like Z-Library and LibGen to train its AI models.

Internal messages show employees raising ethical concerns, with one saying, “Torrenting from a corporate laptop doesn’t feel right.”

Meta reportedly took steps to hide the activity.

The case is part of a broader debate on AI data sourcing, with similar lawsuits against OpenAI and Nvidia.

  • NotMyOldRedditName@lemmy.world
    link
    fedilink
    arrow-up
    3
    ·
    edit-2
    15 hours ago

    Damn, that’s huge.

    Never seen a library that big before. The university here has about 1.5 million and that’s a big library.