this post was submitted on 15 Jun 2024
35 points (60.4% liked)
Technology
59698 readers
2795 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
That's not what LLMs are for. That's like hammering a screw and being irritated it didn't twist in nicely.
The turing test is designed to see if an AI can pass for human in a conversation.
I'm pretty sure that I could ask a human that question in a normal conversation.
The idea of the Turing test was to have a way of telling humans and computers apart. It is NOT meant for putting some kind of 'certified' badge on that computer, and ...
...and you can't cry 'foul' if I decide to use a question for which your computer was not programmed :-)
In a normal conversation sure.
In this kind Turing tests you may be disqualified as a jury for asking that question.
Good science demands controlled areas and defined goals. Everyone can organize a homebrew touring tests but there also real proper ones with fixed response times, lengths.
Some touring tests may even have a human pick the best of 5 to provide to the jury. There are so many possible variations depending on test criteria.
You want to read again about the scientific basics of the Turing test (hint: it is not a tennis match)
There is no competition in science (or at least there shouldn't be). You are subjectively disqualified from judging llm's if you draw your conclusions on an obvious trap which you yourself have stated is beyond the scope of what it was programmed to do.
It wasn't programmed for any questions. It was trained hehe