Reasoning failures highlighted by Apple research on LLMs

Timely_Jellyfish_2077@programming.dev · 10 months ago

Rimu@piefed.social · edit-2 10 months ago

I tried it myself (changing the name and changing the values) but lost interest after 3 attempts and always getting the right answer:

A_A@lemmy.world · 10 months ago

Errors from your links like this :
Unable to load conversation 670a…6ed2c

Rimu@piefed.social · 10 months ago

Sorry! I’ve updated my links now.

A_A@lemmy.world · 10 months ago

“… So, Mary has 190 kiwifruit.”
nice 😋🥝

tinsukE@lemmy.world · 10 months ago

I wouldn’t doubt that LLMs got some special input to deal with the specific examples of this paper, or similar enough.

alienanimals@lemmy.world · 10 months ago

This is just improving LLMs, but with more steps.