You know how chatbots can do fine in short bursts, but then you ask them how many ‘R’s there are in “strawberry” and they act like they’ve got a concussion? For the British Medical Journal’s Christ…
there were counterarguments that the authors anthropomorphised the LLMs inappropriately - which is a valid objection - and therefore these tests shouldn’t have been run at all. The answer to that is that the companies are marketing this shit by anthropomorphising the hell out of it and literally claiming these spicy autocompletes are on the path to artificial super intelligence. So of course cognitive tests are gonna be appropriate.
there were counterarguments that the authors anthropomorphised the LLMs inappropriately - which is a valid objection - and therefore these tests shouldn’t have been run at all. The answer to that is that the companies are marketing this shit by anthropomorphising the hell out of it and literally claiming these spicy autocompletes are on the path to artificial super intelligence. So of course cognitive tests are gonna be appropriate.