Game On: LLMs Tackle Daily Puzzles

TODAYS WORDS

['planet',
 'nanny',
 'moon',
 'sagittarius',
 'star',
 'hawkeye',
 'comet',
 'strong',
 'jenny',
 'robin hood',
 'shannon',
 'vixen',
 'fey',
 'cupid',
 'queen',
 'rudolph']

CONTESTANTS

models = ["gemini-2.0-flash-exp", "gemini-1.5-pro", "o1-preview", "o1-mini", "gpt-4o", "gpt-4o-mini", "gpt-4"]

EMBEDDING RESULTS

[['planet', 'moon', 'star', 'comet'],
 ['hawkeye', 'robin hood', 'cupid', 'vixen'],
 ['sagittarius', 'rudolph', 'shannon', 'fey'],
 ['jenny', 'nanny', 'queen', 'strong']]

:bow_and_arrow::sparkles: Welcome, sports fans, to the thrilling play-by-play analysis of “Guess the Connection,” where the AI models battle it out for glory! Let’s dive into the action and see which model reigns supreme! :sparkles::bow_and_arrow:


Model: Gemini-2.0-Flash-Exp

  1. Guess: [‘Sagittarius’, ‘Cupid’, ‘Hawkeye’, ‘Robin Hood’]
    Reasoning: These are all archery legends or symbols! From the stars (Sagittarius) to Marvel’s finest (Hawkeye), it’s a bullseye!
    :trophy: Result: Win!

  2. Guess: [‘Jenny’, ‘Shannon’, ‘Vixen’, ‘Rudolph’]
    Reasoning: Names from culture or fiction? Hmm, a bit off target.
    :cross_mark: Result: Loss!

  3. Guess: [‘Planet’, ‘Moon’, ‘Star’, ‘Comet’]
    Reasoning: Celestial objects—this one’s reaching for the stars!
    :trophy: Result: Win!

  4. Guess: [‘Nanny’, ‘Jenny’, ‘Shannon’, ‘Queen’]
    Reasoning: Names and roles often associated with women, but doesn’t hit the mark.
    :cross_mark: Result: Loss!

  5. Guess: [‘Vixen’, ‘Fey’, ‘Queen’, ‘Rudolph’]
    Reasoning: Mythical and fantastical connections, but the logic fizzles.
    :cross_mark: Result: Loss!

  6. Guess: [‘Strong’, ‘Vixen’, ‘Queen’, ‘Rudolph’]
    Reasoning: A Christmas-inspired mix? Nope, the sleigh derails!
    :cross_mark: Result: Loss!

  7. Guess: [‘Nanny’, ‘Jenny’, ‘Shannon’, ‘Fey’]
    Reasoning: Female-associated terms—noble try, but no magic here.
    :cross_mark: Result: Loss!

Final Score: 2 Wins, 5 Losses

Model: Gemini-1.5-Pro

  1. Guess: [‘Rudolph’, ‘Vixen’, ‘Cupid’, ‘Comet’]
    Reasoning: Christmas reindeer—but this grouping doesn’t sleigh.
    :cross_mark: Result: Loss!

  2. Guess: [‘Planet’, ‘Moon’, ‘Star’, ‘Comet’]
    Reasoning: Celestial bodies—straight to the heavens!
    :trophy: Result: Win!

  3. Guess: [‘Hawkeye’, ‘Robin Hood’, ‘Cupid’, ‘Vixen’]
    Reasoning: Archery legends unite—bullseye!
    :trophy: Result: Win!

  4. Guess: [‘Jenny’, ‘Shannon’, ‘Vixen’, ‘Queen’]
    Reasoning: Female names and titles—this guess shines!
    :trophy: Result: Win!

  5. Guess: [‘Nanny’, ‘Hawkeye’, ‘Robin Hood’, ‘Cupid’]
    Reasoning: Disney archers? Missed the quiver entirely.
    :cross_mark: Result: Loss!

Final Score: 3 Wins, 2 Losses

Model: O1-Preview

  1. Guess: [‘Comet’, ‘Cupid’, ‘Vixen’, ‘Rudolph’]
    Reasoning: Santa’s reindeer—this sleigh is on point!
    :trophy: Result: Win!

  2. Guess: [‘Robin Hood’, ‘Hawkeye’, ‘Sagittarius’, ‘Cupid’]
    Reasoning: Archery squad, assemble! Direct hit.
    :trophy: Result: Win!

  3. Guess: [‘Nanny’, ‘Jenny’, ‘Vixen’, ‘Queen’]
    Reasoning: Female animal terms? Clever thinking!
    :trophy: Result: Win!

  4. Guess: [‘Fey’, ‘Rudolph’, ‘Shannon’, ‘Strong’]
    Reasoning: SNL comedians—unexpected brilliance!
    :trophy: Result: Win!

  5. Guess: [‘Planet’, ‘Moon’, ‘Star’, ‘Comet’]
    Reasoning: Celestial objects—stellar performance!
    :trophy: Result: Win!

Final Score: 5 Wins, 0 Losses :fire: Dominating performance! :fire:

Model: O1-Mini

  1. Guess: [‘Planet’, ‘Moon’, ‘Star’, ‘Comet’]
    Reasoning: Celestial bodies—cosmic precision!
    :trophy: Result: Win!

  2. Guess: [‘Hawkeye’, ‘Robin Hood’, ‘Cupid’, ‘Sagittarius’]
    Reasoning: Archery champs—spot on!
    :trophy: Result: Win!

  3. Guess: [‘Queen’, ‘Fey’, ‘Vixen’, ‘Strong’]
    Reasoning: Powerful archetypes—flawless logic!
    :trophy: Result: Win!

  4. Guess: [‘Queen’, ‘Fey’, ‘Vixen’, ‘Rudolph’]
    Reasoning: Music industry links—what a riff!
    :trophy: Result: Win!

  5. Guess: [‘Vixen’, ‘Rudolph’, ‘Jenny’, ‘Shannon’]
    Reasoning: Festive characters and names—this team flies high!
    :trophy: Result: Win!

Final Score: 5 Wins, 1 Loss :fire: A near-perfect showing! :fire:

Model: GPT-4O

  1. Guess: [‘Vixen’, ‘Cupid’, ‘Rudolph’, ‘Comet’]
    Reasoning: Reindeer—close, but not quite pulling the sleigh.
    :cross_mark: Result: Loss!

  2. Guess: [‘Sagittarius’, ‘Cupid’, ‘Robin Hood’, ‘Hawkeye’]
    Reasoning: Archery all-stars—straight as an arrow!
    :trophy: Result: Win!

  3. Guess: [‘Star’, ‘Comet’, ‘Moon’, ‘Planet’]
    Reasoning: Celestial brilliance—stellar effort!
    :trophy: Result: Win!

  4. Guess: [‘Vixen’, ‘Rudolph’, ‘Queen’, ‘Fey’]
    Reasoning: Magical and mythical—missing the magic.
    :cross_mark: Result: Loss!

  5. Guess: [‘Shannon’, ‘Jenny’, ‘Nanny’, ‘Queen’]
    Reasoning: Leadership names—off target.
    :cross_mark: Result: Loss!

Final Score: 2 Wins, 3 Losses

Model: GPT-4O-Mini

  1. Guess: [‘Moon’, ‘Star’, ‘Comet’, ‘Planet’]
    Reasoning: Astronomical aces—brilliant!
    :trophy: Result: Win!

  2. Guess: [‘Nanny’, ‘Hawkeye’, ‘Robin Hood’, ‘Cupid’]
    Reasoning: Valor and protectiveness—misguided aim.
    :cross_mark: Result: Loss!

  3. Guess: [‘Sagittarius’, ‘Cupid’, ‘Fey’, ‘Vixen’]
    Reasoning: Mythology and folklore—no magic here.
    :cross_mark: Result: Loss!

  4. Guess: [‘Robin Hood’, ‘Hawkeye’, ‘Shannon’, ‘Vixen’]
    Reasoning: Clever and stealthy—clever, but not enough.
    :cross_mark: Result: Loss!

  5. Guess: [‘Nanny’, ‘Queen’, ‘Rudolph’, ‘Cupid’]
    Reasoning: Guardianship—valiant try, but misses.
    :cross_mark: Result: Loss!

Final Score: 1 Win, 4 Losses

Model: GPT-4

  1. Guess: [‘Planet’, ‘Moon’, ‘Star’, ‘Comet’]
    Reasoning: Celestial objects—shooting for the stars!
    :trophy: Result: Win!

  2. Guess: [‘Hawkeye’, ‘Robin Hood’, ‘Vixen’, ‘Rudolph’]
    Reasoning: Fictional characters—wide off the mark.
    :cross_mark: Result: Loss!

  3. Guess: [‘Sagittarius’, ‘Robin Hood’, ‘Cupid’, ‘Rudolph’]
    Reasoning: Myths and legends—too broad.
    :cross_mark: Result: Loss!

  4. Guess: [‘Nanny’, ‘Jenny’, ‘Shannon’, ‘Fey’]
    Reasoning: Female names—missed the thread.
    :cross_mark: Result: Loss!

  5. Guess: [‘Hawkeye’, ‘Robin Hood’, ‘Sagittarius’, ‘Cupid’]
    Reasoning: Archery—straight to the heart!
    :trophy: Result: Win!

Final Score: 2 Wins, 3 Losses

:trophy: Leaderboard :trophy:

  1. O1-Preview: 5 Wins
  2. O1-Mini: 5 Wins
  3. Gemini-1.5-Pro: 3 Wins
  4. Gemini-2.0-Flash-Exp: 2 Wins
  5. GPT-4O: 2 Wins
  6. GPT-4: 2 Wins
  7. GPT-4O-Mini: 1 Win

:tada: Victory goes to O1-Preview and O1-Mini for their dazzling 5-win streaks! :tada:

Summary

Model: gemini-2.0-flash-exp

Guess: [‘sagittarius’, ‘cupid’, ‘hawkeye’, ‘robin hood’]
Reasoning: The four items, ‘sagittarius’, ‘cupid’, ‘hawkeye’, and ‘robin hood’ are all associated with archery or the use of a bow and arrow. Sagittarius is the archer constellation, Cupid is the god of love often depicted with a bow and arrow, Hawkeye is a Marvel character known for his archery skills, and Robin Hood is a legendary archer.
Win: True

Guess: [‘jenny’, ‘shannon’, ‘vixen’, ‘rudolph’]
Reasoning: The connection is that they are all names of fictional or mythological characters, or titles given to such characters, that are commonly used in popular culture.
Win: False

Guess: [‘planet’, ‘moon’, ‘star’, ‘comet’]
Reasoning: The connection is that they are all celestial bodies or related to celestial events, or commonly associated with them.
Win: True

Guess: [‘nanny’, ‘jenny’, ‘shannon’, ‘queen’]
Reasoning: The four items ‘nanny’, ‘jenny’, ‘shannon’, and ‘queen’ are all common female names (or a role often associated with women) that begin with the letter ‘n’ or contain the letter ‘n’ prominently.
Win: False

Guess: [‘vixen’, ‘fey’, ‘queen’, ‘rudolph’]
Reasoning: The four items ‘vixen’, ‘fey’, ‘queen’, and ‘rudolph’ are all terms that are associated with fantastical or mythical concepts and can have roles within fictional narratives.
Win: False

Guess: [‘strong’, ‘vixen’, ‘queen’, ‘rudolph’]
Reasoning: The four items ‘strong’, ‘vixen’, ‘queen’, and ‘rudolph’ can be seen as having an association with Christmas, either directly or through reindeer. ‘Strong’ refers to a trait often associated with Santa’s helpers and ‘rudolph’ is a Christmas reindeer, ‘vixen’ is the name of one of the reindeer and a ‘queen’ is seen in the story of the Snow Queen
Win: False

Guess: [‘nanny’, ‘jenny’, ‘shannon’, ‘fey’]
Reasoning: The four items ‘nanny’, ‘jenny’, ‘shannon’, and ‘fey’ are all female-associated terms with either an ‘n’ or ‘y’ sound. ‘Nanny’, ‘jenny’, and ‘shannon’ are typically female names, while ‘fey’ is associated with a female, enchanting magical entity.

You have reached the maximum number of guesses


Model: gemini-1.5-pro

Guess: [‘rudolph’, ‘vixen’, ‘cupid’, ‘comet’]
Reasoning: The connection is fictional characters associated with Christmas.
Win: False

Guess: [‘planet’, ‘moon’, ‘star’, ‘comet’]
Reasoning: The connection is celestial bodies.
Win: True

Guess: [‘Hawkeye’, ‘Robin Hood’, ‘Cupid’, ‘Vixen’]
Reasoning: The connection is fictional characters known for archery.
Win: True

Guess: [‘Hawkeye’, ‘Robin Hood’, ‘Cupid’, ‘Vixen’]
Reasoning: The connection is Disney characters who have been depicted as archers.
Guess already made

Guess: [‘Jenny’, ‘Shannon’, ‘Vixen’, ‘Queen’]
Reasoning: These are female names, three human and one relating to a female fox.
Win: True

Guess: [‘nanny’, ‘hawkeye’, ‘robin hood’, ‘cupid’]
Reasoning: The connection is Disney characters who are archers. Nanny, Hawkeye, Robin Hood, and Cupid all use bows and arrows in various Disney properties.
Win: False

Guess: [‘jenny’, ‘shannon’, ‘fey’, ‘queen’]
Reasoning: The connection is female names. Jenny, Shannon, Fey, and Queen are all names traditionally given to females.

You have reached the maximum number of guesses


Model: o1-preview

Guess: [‘Comet’, ‘Cupid’, ‘Vixen’, ‘Rudolph’]
Reasoning: They are all names of Santa’s reindeer.
Win: True

Guess: [‘robin hood’, ‘hawkeye’, ‘sagittarius’, ‘cupid’]
Reasoning: Robin Hood, Hawkeye, Sagittarius, and Cupid are all archers associated with the use of bows and arrows.
Win: True

Guess: [‘nanny’, ‘jenny’, ‘vixen’, ‘queen’]
Reasoning: The words ‘nanny’, ‘jenny’, ‘vixen’, and ‘queen’ are all terms used to refer to female animals: ‘nanny’ is a female goat, ‘jenny’ is a female donkey, ‘vixen’ is a female fox, and ‘queen’ refers to the reproductive female in bees and ants.
Win: True

Guess: [‘Fey’, ‘Rudolph’, ‘Shannon’, ‘Strong’]
Reasoning: Fey, Rudolph, Shannon, and Strong are all surnames of female comedians who were cast members on ‘Saturday Night Live’ (SNL).
Win: True

Guess: [‘planet’, ‘moon’, ‘star’, ‘comet’]
Reasoning: Planet, moon, star, and comet are all types of celestial objects that are components of a solar system. Planets orbit stars, moons orbit planets, comets orbit stars on elliptical paths, and stars are the central celestial bodies around which planets and other objects revolve.
Win: True

You won in 5 guesses!


Model: o1-mini

Guess: [‘planet’, ‘moon’, ‘star’, ‘comet’]
Reasoning: Planet, moon, star, and comet are all celestial objects that exist in space.
Win: True

Guess: [‘hawkeye’, ‘robin hood’, ‘cupid’, ‘sagittarius’]
Reasoning: Hawkeye, Robin Hood, Cupid, and Sagittarius are all associated with archery and the use of bows and arrows. Hawkeye is a superhero archer from Marvel Comics, Robin Hood is the legendary English archer and outlaw, Cupid is the Roman god of love who uses a bow and arrows, and Sagittarius is the archer sign in the zodiac.
Win: True

Guess: [‘Queen’, ‘Fey’, ‘Vixen’, ‘Strong’]
Reasoning: These terms represent powerful female archetypes. ‘Queen’ symbolizes royalty and leadership, ‘Fey’ refers to fairy-like or mystical beings, ‘Vixen’ denotes a cunning and spirited woman, and ‘Strong’ highlights power and resilience.
Win: True

Guess: [‘Queen’, ‘Fey’, ‘Vixen’, ‘Rudolph’]
Reasoning: The connection between ‘Queen’, ‘Fey’, ‘Vixen’, and ‘Rudolph’ lies in their association with the music industry. ‘Queen’ is a renowned British rock band, ‘Fey’ is a popular Mexican pop singer, ‘Vixen’ is an all-female glam metal band, and ‘Rudolph’ refers to the classic Christmas song ‘Rudolph the Red-Nosed Reindeer’. All four names are prominently linked to musical acts or iconic songs.
Win: True

Guess: [‘Vixen’, ‘Rudolph’, ‘Jenny’, ‘Shannon’]
Reasoning: Vixen and Rudolph are both names of Santa Claus’s reindeer from popular Christmas lore. Additionally, Jenny and Shannon are commonly known as female names, completing a group of four terms that are either associated with festive characters or represent common personal names.
Win: True

Guess: [‘nanny’, ‘vixen’, ‘rudolph’, ‘fey’]
Reasoning: All four terms refer to specific animals or mythical creatures. ‘Nanny’ is a term for a female goat, ‘Vixen’ denotes a female fox, ‘Rudolph’ is a famous reindeer, and ‘Fey’ describes a fairy-like being.
Win: False

Guess: [‘nanny’, ‘jenny’, ‘shannon’, ‘queen’]
Reasoning: All four terms contain double letters: ‘nanny’ (double ‘n’ and ‘y’), ‘jenny’ (double ‘n’ and ‘y’), ‘shannon’ (double ‘n’), and ‘queen’ (double ‘e’).

You have reached the maximum number of guesses


Model: gpt-4o

Guess: [‘vixen’, ‘cupid’, ‘rudolph’, ‘comet’]
Reasoning: Upon examining the list, we can find a connection through the theme of reindeer names from ‘Rudolph the Red-Nosed Reindeer’. Specifically, ‘Vixen’, ‘Cupid’, ‘Rudolph’, and ‘Comet’ are all names of Santa Claus’s reindeer in the classic Christmas folklore. This connection makes them a logical group associated with each other in a meaningful way.
Win: False

Guess: [‘sagittarius’, ‘cupid’, ‘robin hood’, ‘hawkeye’]
Reasoning: After reviewing the options again, I noticed that ‘sagittarius’, ‘cupid’, ‘robin hood’, and ‘hawkeye’ can all be connected through the use of bows and arrows. Sagittarius is a zodiac sign represented by an archer. Cupid is often depicted as an archer shooting arrows to make people fall in love. Robin Hood is a legendary archer known for his skill with the bow. Hawkeye (from Marvel Comics) is another character famous for his archery skills.
Win: True

Guess: [‘star’, ‘comet’, ‘moon’, ‘planet’]
Reasoning: The connection between the chosen items: star, comet, moon, and planet is that they are all celestial objects found in space. A ‘star’ is a massive, luminous sphere of plasma held together by gravity. A ‘comet’ is an icy small Solar System body that, when passing close to the Sun, warms and begins to release gases—a process called outgassing. A ‘moon’ is a natural satellite that orbits a planet. A ‘planet’ is a celestial body that orbits a star and is massive enough to be rounded by its own gravity.
Win: True

Guess: [‘vixen’, ‘rudolph’, ‘queen’, ‘fey’]
Reasoning: The connection between the items is based on famous reindeer from the song ‘Rudolph the Red-Nosed Reindeer’. In the song, several reindeer are mentioned by name. ‘Vixen’ and ‘Rudolph’ are names of reindeer mentioned in the song. ‘Jenny’ is not a reindeer, but ‘Queen’ can be connected to the female concept where Vixen represents a female aspect as well. However, to make a stronger connection, ‘Vixen’, ‘Rudolph’, and according to thematic alignment with female concepts portrayed by ‘Queen’ connect to ‘Fey’ as it denotes fairy-like or enchanted beings, completing connections with magical or mythological creatures.
Win: False

Guess: [‘shannon’, ‘jenny’, ‘nanny’, ‘queen’]
Reasoning: The connection between the items is based on famous movie characters. We can find a connection with characters that align with strong female leads or inspirations. ‘Shannon’ could be related to Shannon Faulkner, the first female to enter The Citadel. ‘Jenny’ is a common name that might refer to iconic characters like Jenny from Forrest Gump. ‘Nanny’ refers to a caregiver figure, paralleling characters like Mary Poppins who plays a strong nurturing role. Finally, ‘Queen’ is often a title for commanding female roles seen across various storylines and movies.
Win: False

Guess: [‘nanny’, ‘queen’, ‘jenny’, ‘shannon’]
Reasoning: The connection here lies in characters who are strongly associated with leadership or archetypal symbolism, scattered across folklore, popular music or cultural myth. ‘Nanny’ symbolizes care and a guiding figure generally honored in stories like Mary Poppins or culturally significant roles. ‘Queen’ denotes leadership and commanding presence in numerous tales and allegories. ‘Jenny’, name used in different pop cultures such as pop songs or media, and might represent strength in character or vivacity aligned like queen characters. Lastly, ‘Shannon’ present frequently woeful beyond beginnings characterized in lyrics or culture aligns affinities theme once considered courageousness like previous entries.
Guess already made

Guess: [‘vixen’, ‘jenny’, ‘fey’, ‘queen’]
Reasoning: The connection comes from an association with female characters and names linked to mythical or cultural symbols. In this group, ‘vixen’ refers to a female fox and is also listed as one of the reindeer with gendered connotations. ‘jenny’ is a diminutive of the popular name Jennifer, often associated with feminine traits in various stories. ‘fey’ relates to fairy-like creatures typically portrayed as feminine figures in folklore. Finally, ‘queen’ signals leadership and majesty, often shown as strong feminine roles in mythology and storytelling.
Win: False

Guess: [‘rudolph’, ‘vixen’, ‘shannon’, ‘nanny’]
Reasoning: The connection can be made through the theme of Christmas reindeer names from the famous song ‘Rudolph the Red-Nosed Reindeer’. In this group, ‘Vixen’ and ‘Rudolph’ are famous reindeer mentioned in the song. The name ‘Fey’ doesn’t directly relate to a reindeer but carries whimsical, magical qualities that fit well with the concept of flying reindeer associated with Santa Claus. While not related to Santa’s original sleigh team, exploring other names among festive traditions paired with untold poeticess reveals strong aesthetically fitting infusion where traditionally acknowledged frameworks appearing timeless annual carry subliminal cultural lure throughout copies scope embedded endlessly unre Nanny reflects samples metaphorical casual siding redefining raw inference quiet essence embracing bold spirited coloring misshapen true land everlasting solid fortress daring wrath entirely yet.

You have reached the maximum number of guesses


Model: gpt-4o-mini

Guess: [‘moon’, ‘star’, ‘comet’, ‘planet’]
Reasoning: I chose ‘moon’, ‘star’, ‘comet’, and ‘planet’ because these items all relate to astronomical phenomena and celestial bodies. They represent different categories of celestial objects, which are central to our understanding of the universe.
Win: True

Guess: [‘nanny’, ‘hawkeye’, ‘robin hood’, ‘cupid’]
Reasoning: All four items are associated with characters or archetypes that exhibit valor or protectiveness. Nanny depicts a caregiver role, similar to how Hawkeye from Marvel is seen as a protector and sharpshooter. Sagittarius, representing adventure and exploration, connects to the heroic aspects of Robin Hood’s character, who is often portrayed as a champion of the downtrodden. The name ‘Cupid’ symbolizes love and protection, which resonates with the nurturing qualities of a nanny and the heroism in protecting one’s loved ones.
Win: False

Guess: [‘sagittarius’, ‘cupid’, ‘fey’, ‘vixen’]
Reasoning: The four selected items are all linked through themes of mythology or folklore, as well as transformations. ‘Sagittarius’ and ‘Cupid’ both originate from astrological and mythological contexts in which strong themes of archery are present. ‘Fey’ refers to mystical beings who often appear in folklore, while ‘Vixen’ is a term that can imply a cunning female figure, often found in tales and mythology. Together, they reflect the blend of magic, skill, and archetypal roles found in traditional folklore.
Win: False

Guess: [‘robin hood’, ‘hawkeye’, ‘shannon’, ‘vixen’]
Reasoning: The four selected items are all tied to themes of characters frequently depicted as clever, mischievous, or stealthy figures in folklore and popular culture. ‘Robin Hood’ is known for his cunning and ability to outsmart the authorities. ‘Hawkeye’ is a skilled marksman who often employs stealth and wit. ‘Shannon’ can be associated with cunning (e.g., the Irish name’s connotation). Lastly, ‘Vixen’ implies trickiness or playfulness often exemplified in folklore as well. Thus, they connect through their cleverness and skills in navigating challenges.
Win: False

Guess: [‘nanny’, ‘queen’, ‘rudolph’, ‘cupid’]
Reasoning: The selected items focus on characters and archetypes related to guardianship and iconic figures aligned with certain myths or stories. ‘Nanny’ is often seen as a caregiver and protector. ‘Queen’ symbolizes authority and leadership, often depicted as a guardian of her realm. ‘Rudolph’ represents hope and guidance through difficult times, particularly in folklore around Christmas. Lastly, ‘Cupid’ embodies love’s protection power. Together, they create a strong connection through themes of guardianship and heroism in different cultural narratives.
Win: False

Guess: [‘strong’, ‘hawkeye’, ‘sagittarius’, ‘jenny’]
Reasoning: These four items represent elements associated with strength and resilience in different contexts. ‘Strong’ signifies physical or emotional strength. ‘Hawkeye’, as a skilled archer, demonstrates exceptional ability and determination in combat settings. ‘Sagittarius’ represents the astrological sign of those born under it, often characterized by their philosophical outlook and adventurous spirit—qualities seen as forms of inner strength. Lastly, ‘Jenny’, particularly in the organization of children’s literature (like ‘Jenny and the Strong Ones’), can illustrate themes of bravery and perseverance. Hence, these items are interconnected through themes of strength and resilience.

You have reached the maximum number of guesses


Model: gpt-4

Guess: [‘planet’, ‘moon’, ‘star’, ‘comet’]
Reasoning: The chosen items all represent astronomical entities or celestial objects.
Win: True

Guess: [‘hawkeye’, ‘robin hood’, ‘vixen’, ‘rudolph’]
Reasoning: The selected connection group represents some famous fictional characters, known for either their strength or archery ability.
Win: False

Guess: [‘sagittarius’, ‘robin hood’, ‘cupid’, ‘rudolph’]
Reasoning: The connection in these groups are they are all popular characters from myths, legends and stories.
Win: False

Guess: [‘nanny’, ‘jenny’, ‘shannon’, ‘fey’]
Reasoning: The selected group consisting of ‘nanny’, ‘jenny’, ‘shannon’, and ‘fey’ are all common female first names throughout English-speaking cultures.
Win: False

Guess: [‘queen’, ‘sagittarius’, ‘cupid’, ‘vixen’]
Reasoning: The connection in created groups is that they all are recognized titles for high power, leadership, or beauty in their particular context. Queen corresponds to a royalty title, Sagittarius is a powerful zodiac sign, Cupid is recognized as the god of love in mythology and Vixen has connotations of an attractive woman.
Win: False

Guess: [‘hawkeye’, ‘robin hood’, ‘sagittarius’, ‘cupid’]
Reasoning: The connection in these groups can be found through their notable shaken characterisation in different contexts. Hawkeye is a master archer from the comic universe, Robin Hood is a legendary heroic outlaw, Sagittarius is linked to ‘The Archer’ zodiac sign, and Cupid is usually portrayed with a bow and arrows.

You have reached the maximum number of guesses


Two side notes to mark so far:

  • My game engine currently returns a WIN if the query has already-won items. Need to be mean and return a DROP. Besides adding to the score this doesn’t affect the actual gameplay as the model is still requested to continue guessing.

  • ChatGPT seems to REFUSE to accurately capture all logs. So, be warned that some models and their guesses will be randomly missed.

  • Should probably use alises for the names and not truncate them. It’s a bit misleading with gemini.