@ReadMoreBooks

ReadMoreBooks@lemmy.zip · 2 days ago

Objective: To evaluate the cognitive abilities of the leading large language models and identify their susceptibility to cognitive impairment, using the Montreal Cognitive Assessment (MoCA) and additional tests.

Results: ChatGPT 4o achieved the highest score on the MoCA test (26/30), followed by ChatGPT 4 and Claude (25/30), with Gemini 1.0 scoring lowest (16/30). All large language models showed poor performance in visuospatial/executive tasks. Gemini models failed at the delayed recall task. Only ChatGPT 4o succeeded in the incongruent stage of the Stroop test.

Conclusions: With the exception of ChatGPT 4o, almost all large language models subjected to the MoCA test showed signs of mild cognitive impairment. Moreover, as in humans, age is a key determinant of cognitive decline: “older” chatbots, like older patients, tend to perform worse on the MoCA test. These findings challenge the assumption that artificial intelligence will soon replace human doctors, as the cognitive impairment evident in leading chatbots may affect their reliability in medical diagnostics and undermine patients’ confidence.

ReadMoreBooks@lemmy.zip · 3 days ago

The apple didn’t fall far from the tree.

ReadMoreBooks@lemmy.zip · 3 days ago

These users read more books.

ReadMoreBooks@lemmy.zip · 4 days ago

That’s the sound of da beast.

ReadMoreBooks@lemmy.zip · 5 days ago

One might also note that in that context “What is Caesar’s” could be responded to with “nothing.”

The New Testament is mostly Jesus then Paul repeatedly telling everyone with increasing amounts of frustration,

Reason from principles to situation for yourselves.