Training Generative AI Models on Copyrighted Works Is Fair Use - Change My Mind - eviltoast

I fucked with the title a bit. What i linked to was actually a mastodon post linking to an actual thing. but in my defense, i found it because cory doctorow boosted it, so, in a way, i am providing the original source here.

please argue. please do not remove.

    • commie@lemmy.dbzer0.comOP
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      5
      ·
      10 months ago

      this reads like an appeal to ridicule. if you have an objection to what I said please state it.

      • Batman@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        10 months ago

        Every web request costs someone money. If you aren’t paying them you are being provided a service. They’ve given you knowledge/ material in their possession free of charge. You are taking advantage of that good will by using the content for purposes not intended. That is a moral failing.

        To be clear the ownership of the material is not important, just the access is immoral, as the harm is already done.

        Ill add the caveat that it can be moral if they’ve specifically told you you can via the websites robot.txt file which websites of consequence all have. But the assumption has to be they don’t intend this because that is how consent works.

        • commie@lemmy.dbzer0.comOP
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          3
          ·
          10 months ago

          They’ve given you knowledge/ material in their possession free of charge.

          this is a very common human activity

              • Batman@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                arrow-down
                1
                ·
                10 months ago

                The original post in this chain talked about ethics, I was continuing that conversation.

                In terms of free use, I feel the collection/aggregation of the data is a work in itself. You are taking a greater portion than the author specified you can take. Courts have ruled this does not constitute free use when people used yahoo’s market data. How is it any different now when people are using orders of magnitudes more data.

        • commie@lemmy.dbzer0.comOP
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          4
          ·
          10 months ago

          the assumption has to be they don’t intend this

          why? if someone publishes something on port 80, why should I ever assume they mean anything but for me to have and use that data?

          • Batman@lemmy.world
            link
            fedilink
            English
            arrow-up
            5
            arrow-down
            2
            ·
            10 months ago

            Because there is a standard way for people to make their consent known. Just because you ignore someone withholding you consent doesn’t mean you are free morally.

        • commie@lemmy.dbzer0.comOP
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          2
          ·
          10 months ago

          You are taking advantage of that good will by using the content for purposes not intended. That is a moral failing.

          only if there were so e sort of agreement about what the acceptable uses are and what is not acceptable.

          • Batman@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            arrow-down
            1
            ·
            10 months ago

            That’s exactly what robot.txt is… they spell out that they don’t want you to access this site with an automated system.

            • commie@lemmy.dbzer0.comOP
              link
              fedilink
              English
              arrow-up
              3
              arrow-down
              3
              ·
              10 months ago

              right. so hiring 50 college kids to manually visit every page and cache it for study is fine.

              • Batman@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                arrow-down
                1
                ·
                10 months ago

                That would probably be more expensive than just paying companies. But it is morally different because a human did visit their website so their good will was not violated as they expressed this consent when they published the website.

        • commie@lemmy.dbzer0.comOP
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          2
          ·
          10 months ago

          If you aren’t paying them you are being provided a service.

          if you ARE paying them, you’re being provided a service, too

          • Batman@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            10 months ago

            Yes I agree your use style could be immoral based on the agreement your transaction specifies. But if you’ve agreed your payment is to access their material then you have consent.

        • commie@lemmy.dbzer0.comOP
          link
          fedilink
          English
          arrow-up
          11
          arrow-down
          2
          ·
          10 months ago

          an appeal to ridicule is also called a horse laugh fallacy. it’s like writing lol instead of actually explaining what’s wrong with the position to which your objecting. this response also reads like an appeal to ridicule. if you can’t explain what’s wrong with my position, maybe you shouldn’t be speaking about my position.