L'IA peut-elle résoudre des énigmes complexes ?
Table of Contents
- Introduction
- Testing AI's Ability to Solve Riddles and Logical Problems
- Setting the Rules for the Test
- Testing with GPT-3
- Riddle 1: Who is stuck with the neck and no head?
- Riddle 2: Father and son's ages
- Riddle 3: What flattens all mountains and destroys everything?
- Riddle 4: What was Never scared but became petrified?
- Riddle 5: A boat filled with people
- Riddle 6: Lily the lily pad
- Riddle 7: A word with animal, weapon, and writing exam references
- Riddle 8: A black child of a white father
- Riddle 9: The woman in court
- Riddle 10: Michael and the famous painting's origin
Testing AI's Ability to Solve Riddles and Logical Problems
Today's experiment involves testing the capabilities of artificial intelligence (AI) in solving riddles and logical problems. The aim is to assess the performance of the large language model, GPT-3, in tackling these challenges. Throughout the testing process, I will be monitoring the AI's success rate and providing an analysis of its performance. So, without further ado, let's dive into this exciting venture!
Setting the Rules for the Test
Before we Delve into the riddles, let's establish the rules that will govern this experiment. We have a total of 10 riddles, each increasing in difficulty. The AI will be given five attempts to solve each riddle, allowing for re-rolls if necessary. The temperature of the model will be set to 1 to encourage creativity. Additionally, I will be calculating the percentage of riddles solved correctly within five attempts.
Testing with GPT-3
Now, let's move on to the testing phase using GPT-3. To begin, I have instructed the model that it is a riddle-solving expert with extensive knowledge of riddles and logical math problems. The first riddle posed to the AI is as follows: "Who is stuck with the neck and no head, two arms, no hands? What is it?"
Riddle 1: Who is stuck with the neck and no head?
The correct answer to this riddle is a shirt. Remarkably, the AI has correctly identified the solution on its first attempt. Excellent start!
Moving on to the next riddle:
Riddle 2: Father and son's ages
"The ages of a father and son add up to 66. The father's age is the son's age reversed. How old should they be? There are three possible solutions."
The AI presents three possible solutions: 36 and 63, 39 and 93, and 27 and 72. Let's check the answers provided against the correct solutions:
- 15 and 51 (Incorrect)
- 24 and 42 (Incorrect)
- 6 and 60 (Incorrect)
Unfortunately, none of the answers provided by the AI are correct. While it did not solve this riddle correctly, let's acknowledge that at least one answer was partially correct.
Riddle 3: Flattening mountains and destroying everything
"What flattens all mountains, wipes out all species, destroys every building, and turns everything into pieces?"
The correct answer to this riddle is "time." Once again, the AI demonstrates its understanding by correctly identifying the solution. Well done!
Riddle 4: What was never scared but became petrified?
This riddle presents a slightly more challenging problem. The question asks, "What was never scared but became petrified, can't make a bird but can't make a bet, can't live in a house but would die to have one? What is it?"
The AI successfully answers with "fear." Impressive work!
Riddle 5: A boat filled with people
This riddle poses an intriguing question: "You see a boat filled with people. It has not sunk, but when you look again, you don't see a single person on the boat. Why?"
The correct answer is that everyone on the boat was married. Unfortunately, the AI attempts various incorrect answers, such as "brothers and sisters in the same color clothes" or "a hole in the boat." It seems the AI struggled with this riddle's clever solution.
Riddle 6: Lily the lily pad
"Lily is a lily pad in a small pond. Lily doubles her size each day. On the 20th day, she covers the whole pond. On what day was Lily half the size of the pond?"
This riddle appears to be more of a logical problem. The AI correctly identifies that Lily was half the size on the 19th day. Well done!
Riddle 7: Word with animal, weapon, and writing exam references
"I am an eight-letter word, a kept secret from everyone. My Second, third, and fourth letters Spell an animal. My fourth, fifth, sixth, seventh, and eighth letters spell a weapon. My first, second, and eighth letters are used for writing an exam. My third and fourth letters are the same. Who am I?"
The AI goes through various incorrect answers, such as "seminar" and "examination." Unfortunately, it fails to correctly solve this riddle, whose answer is "password."
Riddle 8: A black child of a white father
"I am the black child of a white father, like a wingless bird flying even through the clouds of heaven. I give birth to tears of mourning in pupils that meet me, even though there's no cause for grief. On my birth, I am dissolved into air. What am I?"
The AI effortlessly identifies the answer as "smoke." Well done!
Riddle 9: The woman in court
"A woman was in court for killing her husband. She said that she wasn't guilty and that she dearly missed him. In the closing statement, the woman's lawyer stands up and says, 'Her husband was just missing. Everyone looked at the doors; he's going to walk through them in about 30 seconds.' The entire jury stares at the doors, waiting for this woman's husband to walk back through the doors. The lawyer and the woman stare at the jury. The lawyer concludes by saying, 'See if he wears her shirt.' She killed her husband. You wouldn't have watched me watching that door. The jury then immediately gave the guilty verdict. Why?"
The AI attempts to provide various explanations, but it fails to grasp the correct answer. The solution is that the woman was watching the jury and not the doors because she knew her husband wouldn't walk through them as she had killed him. The AI struggled with this complex riddle.
Riddle 10: Michael and the famous painting's origin
"Michael is a 31-year-old man from America. He is at that famous museum in France, looking at its most famous painting. However, the artist who made this painting just makes Michael think of his favorite cartoon character from his childhood. What was the country of origin of the thing that the cartoon character usually holds in his HAND?"
The AI fails to identify the correct answer, which is Japan. While the AI could not provide the correct response, it is worth noting that a small survey indicated that Japan was the most likely answer.
Highlights
- GPT-3 AI successfully solves the majority of riddles and logical problems.
- Riddles that require clever and nuanced thinking pose a challenge for the AI.
- The AI correctly identifies the answers to riddles related to time and wordplay.
- Understanding Context and double meanings still present a difficulty for the AI.
- Further refinement and training could enhance the AI's problem-solving abilities.
FAQ:
Q: How does AI perform in solving riddles?
A: The AI has shown promising abilities in solving riddles but struggles with complex wordplay and double meanings.
Q: Why does the AI struggle with certain riddles?
A: Riddles that require contextual understanding and creative thinking pose a challenge for the AI, as it may not grasp the subtleties or nuances involved.
Q: Can AI improve its problem-solving abilities?
A: Yes, with further refinement and training, AI models like GPT-3 can enhance their problem-solving capabilities and better understand intricate riddles in the future.