Reddit is licensing its content to Google to help train its AI models - eviltoast

Google has struck a deal with Reddit that will allow the search engine maker to train its AI models on Reddit’s vast catalog of user-generated content, the two companies announced. Under the arrangement, Google will get access to Reddit’s Data API, which will help the company “better understand” content from the site.

The deal also provides Google with a valuable source of content it can use to train its AI models. “Google will now have efficient and structured access to fresher information, as well as enhanced signals that will help us better understand Reddit content and display, train on, and otherwise use it in the most accurate and relevant ways,” the company said in a statement.

  • Blizzard@lemmy.zip
    link
    fedilink
    English
    arrow-up
    9
    ·
    9 months ago

    Ehhh… shame it’s too late but there are nice scripts that can bulk-edit all your posts and comments for people using search engines and ai crawlers to stumble upon. I put info about reddit paywalling 3rd party apps and invited readers to join lemmy instead.

    • oxjox@lemmy.ml
      link
      fedilink
      English
      arrow-up
      2
      ·
      9 months ago

      I found something that was doing that after I thought it was going to actually delete items. I stopped the script and found something to delete. What’s the advantage of editing comments? Just to advertising alternatives?

      • Dave.@aussie.zone
        link
        fedilink
        arrow-up
        3
        ·
        edit-2
        9 months ago
        IF
            (COMMENT MARKED AS DELETED) 
        OR 
            (COMMENT < 10 WORDS)    
        THEN  
            IF (PREVIOUS COMMENT VERSION > 10 WORDS)
            THEN    
                 RESTORE PREVIOUS COMMENT VERSION FOR AI LEARNING
        
    • Murkhat@feddit.de
      link
      fedilink
      arrow-up
      1
      ·
      9 months ago

      I could imagine google also gets some sort of snapshots to mitigate the risk that after their announcement everyone deletes/modifies their content… But who knows.