emerald@lemmy.blahaj.zonetoTechnology@lemmy.world•Apple study exposes deep cracks in LLMs’ “reasoning” capabilitiesEnglish
504·
1 month agostatistical engine suggesting words that sound like they’d probably be correct is bad at reasoning
How can this be??
“It’s one country, Michael, how wide could it be? Two weeks of skateboarding?”