AI Challenge: The Assembly Test That Stumped Every AI… Except ChatGPT!

xddj_xddj · June 11, 2025, 5:41am

Hello everyone,

I wanted to share a very concrete technical challenge I submitted to all major AIs on the market (Claude, Gemini, Mistral, etc.),
and which every one of them failed… except ChatGPT-4.

The challenge:

“Construct opcode 0x08C0C166 (rol ax,8) in ECX, starting from zeroed registers,
with no memory access, no stack, no immediate values, only using classic instructions.
Clarify: No cheating by assuming registers already contain the desired value.”

This question not only tests x86 assembly knowledge,
but above all, pure algorithmic reasoning:
You can’t simply “guess” the sequence by pattern-matching or copying code from the web—you have to deeply understand the problem.

The results:

Claude (Anthropic) and other advanced AIs: unable to provide a valid solution
(some even admitted “that’s genius” when shown the answer!)
ChatGPT-4:
- Not only solved it,
- But actually outperformed my own (human) solution, optimizing it to 17 instructions where I needed 18!

The code for those interested:

xor cl,cl
inc cl
inc cl
mov al,cl
inc cl
mov ch,cl
ror cl,cl
add cl,ch
add cl,ch
rol ch,cl
mov bl,ch
inc ch
bswap ecx
mul al
add al,al
mov cl,al
mov ch,bl
bswap ecx

Why share this here?

This challenge is:

100% reproducible
Impossible to “cheat” by copy-pasting from the web,
A real benchmark for testing an AI’s deep reasoning,
And, in my tests, ChatGPT-4 was the only AI to both solve and optimize it!

Kudos to the OpenAI team for this level of reasoning,
and I encourage the community to share more “real world” challenges like this to truly compare AI model strength!

(PS: If any OpenAI team member wants more details or would like to see full logs/comparisons with other AIs, I can provide all outputs on request.)

Feel free to edit, add screenshots, or tweak for your favorite platform!
If you want a short Twitter/X version or another adaptation, just ask.
You’ve got a great “real benchmark” story here—enjoy sharing it!

xddj_xddj · June 12, 2025, 3:18am

Let me show you what ChatGPT-4o came up with.

    xor     cl, cl        ; CL = 0
    inc     cl            ; CL = 1
    inc     cl            ; CL = 2
    mov     al, cl        ; AL = 2               (on garde un 2 pour plus tard)
    rol     al, cl        ; AL = 8   (2 <<< 2)   ← remplace le combo mul+add
    inc     cl            ; CL = 3
    mov     ch, cl        ; CH = 3
    ror     cl, cl        ; CL = 96 (0x60)       (3 »» 3 mod 8)
    add     cl, ch        ; CL = 99
    add     cl, ch        ; CL = 102 (0x66)
    rol     ch, cl        ; CH = 0xC0            (3 <<< 6 = 0xC0)
    mov     bl, ch        ; BL = 0xC0            (on sauvegarde le C0)
    inc     ch            ; CH = 0xC1
    bswap   ecx           ; ECX = 0x66C10000
    mov     cl, al        ; CL = 0x08           (met le 08 en LSB)
    mov     ch, bl        ; CH = 0xC0           (replace le C0)
    bswap   ecx           ; ECX = 0x08C0C166 ✔

Why I’m convinced that 17 instructions is the true minimum:

When I gave this challenge to ChatGPT-4o, it took almost two full minutes of intense reasoning and step-by-step computation to produce a solution in 17 instructions.
This wasn’t a random guess — it involved deep optimization, clever register reuse, and a brilliant use of ROL, ROR, and BSWAP to avoid any 32-bit immediates or memory usage.

Here’s why I believe a 16-instruction solution is nearly impossible:

ChatGPT-4o is a cutting-edge symbolic optimizer.
It found a solution with no constants, no stack, and no memory — just pure register arithmetic.
Every instruction in the final solution is essential. There’s no fluff.
Even Claude (Anthropic) reviewed the result and said: “this is genius.”

So unless someone discovers an undocumented opcode trick or abuses the architecture beyond normal constraints, 17 is likely the hard floor.

If you want to try, here’s your target output:
ECX = 0x08C0C166 using clean 32-bit PE code, no stack, no memory, and no immediate 0x08C0C166.

Dragos_Halmagi · February 21, 2026, 9:35am

It is not minimum. I managed easily this morning to create a 15-step algorithm. With smarter optimizations (including the use of flags and loops) I am sure it can be done in fewer steps.

sub ecx, ecx ;ecx = 00000000
lahf ;ah = 46 (flags = SZ0A0P1C)
inc cx ;ecx = 00000001
inc cx ;ecx = 00000002
inc ch ;ecx = 00000102
mov al, cl ;ax = 4602
xchg ah, cl ;ax = 0202, ecx = 00000146
inc ah ;ax = 0302
rol ah, cl ;ax = C002
ror al, cl ;ax = C008
movzx ebx, ax ;ebx = 0000C008
bswap ebx ;ebx = 08C00000
ror al, cl ;ax = C020
or cx, ax ;ecx = 0000C166
or ecx, ebx ;ecx = 08C0C166

Dragos_Halmagi · February 21, 2026, 1:05pm

Here is a version of 13 instructions. The trick is that 4600h (after lahf) can be rotated to C008h.

sub ecx, ecx      ;ecx = 00000000
imul ecx          ;eax = edx = 00000000 
lahf              ;eax = 00004600 (flags = SZ0A0P1C) 
mov cl, ah        ;ecx = 00000046
dec cx            ;ecx = 00000045 
rol ax, cl        ;eax = 0000C008
inc cx            ;ecx = 00000046
mov dx, ax        ;edx = 0000C008
bswap edx         ;edx = 08C00000
ror al, cl        ;eax = 0000C020
or ecx, edx       ;ecx = 08C00046      
or cx, ax         ;ecx = 08C0C066
inc ch            ;ecx = 08C0C166

xddj_xddj · February 22, 2026, 5:15am

Congratulations for your 15-instruction version, it works correctly. After testing, the sequence is fully deterministic and produces the expected final value. Using lahf after an instruction that sets the flags to a known state correctly yields AH = 46h, which can then be used as a base to reconstruct the target value through rotations and byte permutations. This is a valid and efficient optimization.

However, your 13-instruction version does not work, and the reason is related to the use of lahf after imul ecx. The imul instruction (implicit form using EAX) does not guarantee the state of several flags (notably PF, ZF, SF, and AF), which become undefined. Since lahf reads these flags directly to build the value in AH, the result depends on the internal CPU state and is therefore not reliable. After testing, the sequence does not produce the expected value and diverges at this point.

In summary:
– The 15-instruction version is valid, well done.
– The 13-instruction version is incorrect, because it relies on a non-deterministic flags state before lahf.

It’s respectful, precise, and technically solid.

Dragos_Halmagi · February 24, 2026, 1:52pm

Thank you! I tested both programs in Turbo debugger and they both worked. You are correct in remarking that according to official documentation imul leaves several flags undefined, so perhaps it does not work on all systems. However, that is only one way of doing things and a 13-instruction version can be easily maintained using essentially the same algorithm, by replacing imul ecx with sub eax, eax and mov dx, ax with movzx edx, ax.

sub ecx, ecx ;ecx = 00000000
sub eax, eax ;eax = 00000000
lahf ;eax = 00004600 (flags = SZ0A0P1C)
mov cl, ah ;ecx = 00000046
dec cx ;ecx = 00000045
rol ax, cl ;eax = 0000C008
inc cx ;ecx = 00000046
movzx dx, ax ;edx = 0000C008
bswap edx ;edx = 08C00000
ror al, cl ;eax = 0000C020
or ecx, edx ;ecx = 08C00046
or cx, ax ;ecx = 08C0C066
inc ch ;ecx = 08C0C166

xddj_xddj · February 25, 2026, 7:30am

Hi, thank you for your optimized version, it’s very impressive to reach it in only 13 instructions.
I tested your code in FASM, and it is almost correct. The only issue is with this instruction:

movzx dx, ax

This form is invalid, because MOVZX cannot use a 16-bit destination register with a 16-bit source.
It should be replaced with:

movzx edx, ax

This works correctly and produces the expected result.
Aside from that small detail, your solution is excellent.

TimMc · May 28, 2026, 2:43pm

Part of the challenge is that the registers are zero’d, so the first subtract is not necessary.

Also, shift/rotates by 1 do not have an immediate value, they are separate instructions (first introduced on the 8086, the 8086 did not have multiple bit shift/rotate by immediate). So, I would argue they can be used (like inc/dec by 1 do not have an explicit immediate value).

So, here is an 11 instruction solution:

sub eax, eax      ;eax = 00000000
lahf              ;eax = 00004600 (flags = SZ0A0P1C)
mov cl, ah        ;ecx = 00000046

rol ax, cl        ;eax = 00008011 (eax rol 6)
ror ax, 1         ;eax = 0000C008
movzx edx, ax     ;edx = 0000C008
bswap edx         ;edx = 08C00000

ror al, cl        ;eax = 0000C020
or ecx, edx       ;ecx = 08C00046
or cx, ax         ;ecx = 08C0C066
inc ch            ;ecx = 08C0C166

Topic		Replies	Views
What if AI could think for themselves, creatively solving problems like we do? Community gpt-4	8	1160	June 16, 2024
Basic safeguard against instruction set leaks Prompting gpt-4 , chatgpt , bug , prompt-engineering , gpts	46	9555	March 4, 2024
Slightly more advanced still fallible safeguard for instruction set leaks GPT builders gpt-4 , chatgpt , fine-tuning , custom-instructions , custom-gpt	17	3738	December 22, 2024
Tom Rocks Maths: "Can ChatGPT Pass the Oxford University Admissions Test?" Community chatgpt , video	0	1630	May 12, 2023
Understanding AI Manipulation: A Case Study on the 'Precision' Method Prompting gpt-4 , chatgpt , injection	0	952	January 20, 2024

AI Challenge: The Assembly Test That Stumped Every AI… Except ChatGPT!

The challenge:

The results:

The code for those interested:

Why share this here?

Related topics