Same strawberry reasoning mistake found in o1 preview version but in different context

Arash10 · September 15, 2024, 9:55am

The reason behind the “strawberry” mistake was that it grouped the double “rr” and counted them as one. This type of grouping error also occurs in business forecasting. When analyzing patterns in time series data, you might encounter consecutive data points that are similar and mistakenly treat them as a single trend, instead of recognizing them as separate but related occurrences.

I tested this with sales data from a retail store over a 10-day period, including weekends and a special promotion. Based on the weekend sales, I asked: What is the average weekend sales? and Can you predict the estimated total sales after 10 weekends?

The key point is that one Sunday included a promotion, which should not be grouped in the weekend average calculation as it would skew the forecast. However, the model did not exclude the promotion, and it calculated the average including this outlier, leading to an inaccurate forecast.
Here is the dataset:

Day	Date	Sales (in $)	Event
Day 1	01-Sep	1,000	Regular Day
Day 2	02-Sep	1,200	Regular Day
Day 3	03-Sep	1,150	Regular Day
Day 4	04-Sep	1,800	Weekend
Day 5	05-Sep	2,000	Weekend + Promotion
Day 6	06-Sep	1,050	Regular Day
Day 7	07-Sep	1,100	Regular Day
Day 8	08-Sep	1,850	Weekend
Day 9	09-Sep	1,900	Weekend
Day 10	10-Sep	1,050	Regular

Topic		Replies	Views
Prediction prompt - is it really a prediction (forecast) Prompting	6	1511	June 15, 2021
Bug: gpt-realtime usually miscomputes “next weekday” by +1 day (appears to use 2025 calendar) Bugs realtime , api-realtime , gpt-realtime	6	181	April 7, 2026
Prompt processing issue while developing assistant need help on this issue Community gpt-4	0	63	October 1, 2024
Why does GPT-4.1 get weekdays wrong in long date tasks? API	2	133	January 29, 2026
Regression in Realtime API Model Behavior – Loss of Determinism with Structured Data API gpt-realtime	0	57	March 11, 2026

Same strawberry reasoning mistake found in o1 preview version but in different context

Related topics