TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI - eviltoast
  • BlackEco@lemmy.blackeco.com
    link
    fedilink
    English
    arrow-up
    46
    ·
    9 hours ago

    Also it doesn’t respect robots.txt (the file that tells bots whether or not a given page can be accessed) unlike most AI scrapping bots.

    • kboy101222@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      17
      ·
      5 hours ago

      My personal website that primarily functions as a front end to my home server has been getting BEAT by these stupid web scrapers. Every couple of days the server is unusable because some web scraper demanded every single possible page and crashed the damn thing