turing test is designed to see if an AI can pass for human in a conversation.
I’m pretty sure that I could ask a human that question in a normal conversation.
The idea of the Turing test was to have a way of telling humans and computers apart. It is NOT meant for putting some kind of ‘certified’ badge on that computer, and …
That’s not what LLMs are for.
…and you can’t cry ‘foul’ if I decide to use a question for which your computer was not programmed :-)
In this kind Turing tests you may be disqualified as a jury for asking that question.
Good science demands controlled areas and defined goals. Everyone can organize a homebrew touring tests but there also real proper ones with fixed response times, lengths.
Some touring tests may even have a human pick the best of 5 to provide to the jury. There are so many possible variations depending on test criteria.
There is no competition in science (or at least there shouldn’t be). You are subjectively disqualified from judging llm’s if you draw your conclusions on an obvious trap which you yourself have stated is beyond the scope of what it was programmed to do.
I’m pretty sure that I could ask a human that question in a normal conversation.
The idea of the Turing test was to have a way of telling humans and computers apart. It is NOT meant for putting some kind of ‘certified’ badge on that computer, and …
…and you can’t cry ‘foul’ if I decide to use a question for which your computer was not programmed :-)
In a normal conversation sure.
In this kind Turing tests you may be disqualified as a jury for asking that question.
Good science demands controlled areas and defined goals. Everyone can organize a homebrew touring tests but there also real proper ones with fixed response times, lengths.
Some touring tests may even have a human pick the best of 5 to provide to the jury. There are so many possible variations depending on test criteria.
You want to read again about the scientific basics of the Turing test (hint: it is not a tennis match)
There is no competition in science (or at least there shouldn’t be). You are subjectively disqualified from judging llm’s if you draw your conclusions on an obvious trap which you yourself have stated is beyond the scope of what it was programmed to do.
It wasn’t programmed for any questions. It was trained hehe