Deepseek when asked about sensitive topics - eviltoast
  • Aatube@kbin.melroy.org
    link
    fedilink
    arrow-up
    1
    ·
    2 天前

    Did you use the -Zero model, which doesn’t have the “cold-start data before RL” which prevents it from language mixing?