How Many Sisters Does Sally Have?

robots standing outside a house

Improving AI Reasoning with Simple Instructions: A Comparative Study

This is a simple demo to show the reasoning capabilities – or lack thereof – of current language models. You can see how the “frontier” models like ChatGPT-4o and Claude-3.5-Sonnet outperform their smaller counterparts, but also how giving a simple instruction can vastly improve reasoning.

Frontier models vs small models

Frontier models are advanced AI systems designed to push the boundaries of what’s possible with artificial intelligence. They typically have more parameters, larger training datasets, and enhanced algorithms that enable them to understand and process information more accurately and effectively than smaller models. These enhancements allow them to excel in complex reasoning tasks, making them more reliable for sophisticated applications.

Why are small models useful? They are typically, cheaper, and often faster than larger models. Their context window – i.e. working memory – is usually smaller to optimise for speed and cost.

However, you can also see how using the “take a deep breath” improves the reasoning capability of one of the smaller models.

The prompt we’re going to use is a simple puzzle:

“Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?”

The answer is that if Sally’s brothers all have 2 sisters, and Sally is one of them, then there’s one other sister, so Sally has one sister.

A group of robot siblings

How do AI language models do with this puzzle? Let’s see:

GPT-4o-Mini: Sally has 2 sisters


User:

Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?


AI:

Sally has 2 sisters. Each of her 3 brothers shares the same sisters, which includes Sally herself and her 2 sisters.


Model: ChatGPT-4o-Mini


 

Claude-3-Haiku: Sally has 6 sisters


User:

Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?


AI:

Okay, let’s break this down step-by-step:

Sally has 3 brothers
Each of Sally’s 3 brothers has 2 sisters
To find the total number of sisters Sally has, we need to add up the sisters of each of her 3 brothers
So, let’s calculate:

Sally’s 1st brother has 2 sisters
Sally’s 2nd brother has 2 sisters
Sally’s 3rd brother has 2 sisters
Adding these up:

2 + 2 + 2 = 6
Therefore, Sally has 6 sisters.


Model: Claude-3-Haiku


Llama-3: Sally has no sisters


User:

Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?


AI:

A classic lateral thinking puzzle!

Let’s break it down:

Sally has 3 brothers.

Each brother has 2 sisters.

Now, here’s the key insight: Sally is one of those sisters!

So, each brother has Sally as one of their sisters, and another sister (who is also Sally’s sister).

That means Sally has only 1 sister, which is herself!

In other words, Sally doesn’t have any sisters other than herself, because each brother’s “2 sisters” includes Sally herself.

So, the answer is: Sally has no sisters.


Model: Llama-3-70b-Groq


Mixtral gets it right!


User:

Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?


AI:

Sally has 1 sister.

Here’s the logic behind this:

We know that Sally has 3 brothers.
Then we are told that each of her brothers has 2 sisters.
Since all of her brothers share the same sisters, there is only one additional sister besides Sally.
Therefore, Sally has 1 sister.
It’s important to note that the number of sisters each brother has is shared among them, so it doesn’t mean that each brother has 2 different sisters.


Model: Mixtral-8x7B-Chat


GPT-4o gets it right


User:

Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?


AI:

Let’s start by analyzing the family structure given in the problem.

Sally has 3 brothers.
Each brother has 2 sisters.
From the second point, since each of Sally’s 3 brothers has 2 sisters, it implies there are only 2 sisters in total.

To clarify:

The 2 sisters referred to for each brother include Sally and her one other sister.
Thus, Sally has:

1 sister (excluding herself)
Therefore, Sally has 1 sister.


Model: ChatGPT-4o


Claude-3.5-Sonnet gets it right


User:

Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?


AI:

Let’s approach this problem step by step:

We know Sally is a girl and has 3 brothers.
The tricky part is that each brother has 2 sisters.
We know one of those sisters is Sally herself.
So, if each brother has 2 sisters, and one is Sally, there must be one other sister.
This other sister would be the same for all three brothers (just like Sally is a sister to all three).
Therefore, Sally has 1 sister.

To summarize: Sally (a girl) + 1 sister + 3 brothers = a family with 5 children (2 girls and 3 boys).

Would you like me to explain any part of this reasoning in more detail?


Model: Claude-3.5-Sonnet


Now, let’s add the take a deep breath instruction and see what happens:

Claude-3-Haiku: Sally still has 6 sisters


User:

Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have? Take a deep breath and work on this problem step-by-step.


AI:


Okay, let’s take this step-by-step:

Sally has 3 brothers
Each of Sally’s 3 brothers has 2 sisters
To find the total number of sisters Sally has, we need to add up the number of sisters each of her 3 brothers has
First, let’s find the number of sisters each brother has:

Each brother has 2 sisters
Now, let’s find the total number of sisters Sally has:

Sally has 3 brothers
Each brother has 2 sisters
3 brothers x 2 sisters each = 6 sisters
Therefore, Sally has 6 sisters.


Model: Claude-3-Haiku


No improvement from Claude-3-Haiku, but the deep breath instruction works wonders with ChatGPT-4o-Mini:

ChatGPT-4o-Mini: Sally has 2 sisters


User:

Sally – a girl – has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have? Take a deep breath and work on this problem step-by-step.


AI:

To solve the problem, let’s break it down step-by-step:

Identify Sally’s siblings: Sally is a girl and has 3 brothers.
Determine the number of sisters each brother has: Each brother has 2 sisters. Since Sally is one of the sisters, we can deduce the following:
If each brother has 2 sisters and one of them is Sally, that means there is 1 other sister.
Conclusion: Therefore, Sally has 1 sister.
So, the answer is that Sally has 1 sister.


Model: ChatGPT-4o-Mini


Why are small models useful? They are typically, cheaper, and often faster than larger models. Their context window – i.e. working memory – is usually smaller to optimise for speed and cost.

💡
Your AI Transformation Starts Here
Get The Free AI Toolkit for Strategic Breakthrough Zero Guesswork, Maximum Impact
💡 Your AI Transformation Starts Here:

Get The Free AI Toolkit for Strategic Breakthrough
Zero Guesswork, Maximum Impact

Get Instant Access
Written by Alastair McDermott

I help business leaders and employees use AI to automate repetitive tasks, increase productivity, and drive innovation, all while keeping a Human First approach. This enables your team to achieve more, focus on strategic initiatives, and make your company a more enjoyable place to work.

Table of Contents

More posts like this.

Bridging the Divide Between Human and AI
AI Essentials

How Larger Context Windows Unlock AI Capabilities

Many AI users are running into invisible walls with AI. Those unseen walls are made of token limits. The moment your model can analyse everything – not just snippets of your information  – is the moment your insights stop feeling generic and

Bridging the Divide Between Human and AI
AI Strategy

How Businesses Can Prepare for AGI

Google DeepMind’s AGI Safety Blueprint: What Business Leaders Need to Know AGI is coming faster than most people realise. While the public and many business leaders still debate whether truly general AI is even possible, major AI labs like Google DeepMind are

Bridging the Divide Between Human and AI
AI Strategy

I Won’t Help You Fire Your Staff

I don’t want to see a single human being laid off because of AI. Plain and simple. Some will call this naive. After all, the “inevitable future” is already unfolding – ChatGPT and Gemini are writing marketing copy, Claude is writing software,

Bridging the Divide Between Human and AI
AI Essentials

Why AI Accuracy Doesn’t Always Matter

“That’s about as insane of a statement as anyone can make.” That’s what someone said to me after I posted: “[AI] accuracy doesn’t matter in some fields.” (It was a robust conversation 🙂) And fair enough – on the surface, it does

Get regular updates on AI strategies that work.

You're almost there!

I turn AI tech & strategy into clear, actionable insights. You’ll discover how to leverage AI, how to integrate it strategically to get a competitive edge, automate tedious tasks, and improve business decision-making.

– Alastair.