Scientists Train AI to Be Evil, Find They Can't Reverse It - eviltoast

Scientists Train AI to Be Evil, Find They Can’t Reverse It::How hard would it be to train an AI model to be secretly evil? As it turns out, according to Anthropic researchers, not very.

  • Voroxpete@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    5
    ·
    10 months ago

    It’s also kind of important to remember that ultimately, it’s a metaphor. The specific scifi handwave is just there to justify the whole “Humans as a disposable resource” imagery that exists to underpin the film’s anticapitalist themes. The Wachowskis never really cared for subtlety, and I don’t blame them. Even the obvious goes over most people’s heads.