Leaked Documents Show Nvidia Scraping ‘A Human Lifetime’ of Videos Per Day to Train AI - eviltoast

Nvidia scraped videos from Youtube and several other sources to compile training data for its AI products, internal Slack chats, emails, and documents obtained by 404 Media show.

When asked about legal and ethical aspects of using copyrighted content to train an AI model, Nvidia defended its practice as being “in full compliance with the letter and the spirit of copyright law.” Internal conversations at Nvidia viewed by 404 Media show when employees working on the project raised questions about potential legal issues surrounding the use of datasets compiled by academics for research purposes and YouTube videos, managers told them they had clearance to use that content from the highest levels of the company.

  • hoshikarakitaridia@lemmy.world
    link
    fedilink
    English
    arrow-up
    9
    ·
    4 months ago

    in full compliance with the letter and the spirit of copyright law

    That is some real semantic acrobatics. The law is supposed to follow societal norms and reflect boundaries accordingly. Yeah, AI laws take time, and obv there hasn’t been enough legislation done. That said, the EU for example already has a law for AI but the member states need to adapt that into national laws now.

    There is law here. And even though I’m sure what they are doing rn will be illegal or at least very heavily regulated in the future, they might be doing something illegal today. Depending on how eager governments are to litigate, this might already get dicey in the coming months.