An Analysis of DeepMind's 'Language Modeling Is Compression' Paper - eviltoast
  • InvertedParallax@lemm.ee
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    No, because our brains also use hierarchical activation for association, which is why if we’re talking about bugs and I say “I got a B” you assume its a stinging insect, not a passing grade.

    If it was simple word2vec we wouldn’t have that additional means of noise suppression.