Scientists Train AI to Be Evil, Find They Can't Reverse It - eviltoast

Scientists Train AI to Be Evil, Find They Can’t Reverse It::How hard would it be to train an AI model to be secretly evil? As it turns out, according to Anthropic researchers, not very.

  • Grimy@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    1
    ·
    10 months ago

    This was the original script. Humanity wiped themselves out fighting each other, the matrix was a time capsule so society could restart without losing everything once the skies cleared up.

    It was judged to be too complicated for the masses so we got “machine bad” instead.