Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

sanitation@lemmy.today · 2 days ago

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

zbyte64@awful.systems · 16 hours ago

How are you able to understand it’s capability without understanding what tools it is capable of manipulating to effect?

Communist@lemmy.frozeninferno.xyz · 15 hours ago

You aren’t, and that’s exactly what I’m saying, it’s capable of doing these things with tools, therefore it’s capable of doing these things.

zbyte64@awful.systems · 11 hours ago

So why are you allergic to people talking about the quality of the tools in regards to capability?

Communist@lemmy.frozeninferno.xyz · 11 hours ago

I don’t know what you mean, I wasn’t the one who claimed they couldn’t do something they clearly can.

zbyte64@awful.systems · 11 hours ago

You are the one collapsing tool use into a binary when there are varying degrees of competency and hand holding.

Communist@lemmy.frozeninferno.xyz · 9 hours ago

I am not, you inaccurately said that the math olympiad was not bested by llm’s because they had a tool that told them if they were close but incorrect and can just try an infinite number of times. This is incorrect, they had a number of tries with python. This just isn’t a true statement. I think them besting it with use of python is equally significant and still counts as them besting it, and saying they can’t do math work is absurd.

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

Just a moment...