Incorrect Cyrillic rendering in Codex Agent on Windows due to PowerShell 5.1 default ANSI encoding

IarBest · September 3, 2025, 8:10pm

Description:
When using Codex Agent on Windows (VS Code / Cursor / Windsurf), files containing Cyrillic comments are displayed incorrectly (garbled characters). The agent appears to invoke powershell.exe by default, which uses ANSI instead of UTF-8.

Expected behavior:

Files with UTF-8 encoding (with or without BOM) should display correctly.
Ideally, the agent should:
- Use PowerShell 7 (pwsh) by default (UTF-8 is the default there), or
- Explicitly pass -Encoding UTF8 when reading/writing files, or
- Provide a setting to override the shell used for Agent actions.

Steps to reproduce:

Open a UTF-8 encoded JavaScript/TypeScript file with Cyrillic comments in VS Code on Windows.
Run Codex Agent and ask it to show the first 10 lines.
Cyrillic characters are replaced with question marks / invalid symbols.

Notes:

This does not happen in the built-in chat (Cursor/Windsurf) because text is read directly from the editor buffer.
Only the Agent (when reading/writing via PowerShell) shows corrupted characters.
Using Node.js or pwsh with -Encoding UTF8 works correctly.

Environment:

Windows 11 (latest build, exact version unknown but fully up to date)
VS Code 1.103.2
Codex extension 0.4.0
Node.js 22.17
PowerShell 7.5.2 installed, but Agent still defaults to legacy Windows PowerShell 5.1
Hardware: standard desktop PC

Mohammadr3za · September 10, 2025, 10:18pm

Same problem, all my files get destroyed with incorrect encoding usage of codex for local edit on Windows

lubacareer · September 16, 2025, 5:31pm

Same problem here. It seems that it happens not right off the bat, but after a while.

Jakub_Nowak · February 4, 2026, 10:53am

IarBest:

Description:
When using Codex Agent on Windows (VS Code / Cursor / Windsurf), files containing Cyrillic comments are displayed incorrectly (garbled characters). The agent appears to invoke powershell.exe by default, which uses ANSI instead of UTF-8.

Expected behavior:

Files with UTF-8 encoding (with or without BOM) should display correctly.

Ideally, the agent should:

Use PowerShell 7 (pwsh) by default (UTF-8 is the default there), or

Explicitly pass -Encoding UTF8 when reading/writing files, or

Provide a setting to override the shell used for Agent actions.

Steps to reproduce:

Open a UTF-8 encoded JavaScript/TypeScript file with Cyrillic comments in VS Code on Windows.

Run Codex Agent and ask it to show the first 10 lines.

Cyrillic characters are replaced with question marks / invalid symbols.

Notes:

This does not happen in the built-in chat (Cursor/Windsurf) because text is read directly from the editor buffer.

Only the Agent (when reading/writing via PowerShell) shows corrupted characters.

Using Node.js or pwsh with -Encoding UTF8 works correctly.

Environment:

Windows 11 (latest build, exact version unknown but fully up to date)

VS Code 1.103.2

Codex extension 0.4.0

Node.js 22.17

PowerShell 7.5.2 installed, but Agent still defaults to legacy Windows PowerShell 5.1

Hardware: standard desktop PC

I ran into the same issue (Polish characters getting replaced with ?). The root cause is the same: Codex Agent on Windows uses PowerShell 5.1 (ANSI console encoding), so non‑ASCII characters get corrupted before they reach Python or get written back to files.

What fixed it for me was forcing UTF‑8 in the PowerShell profile + Python:

chcp 65001 > $null
$utf8NoBom = [System.Text.UTF8Encoding]::new($false)
[Console]::InputEncoding  = $utf8NoBom
[Console]::OutputEncoding = $utf8NoBom
$OutputEncoding = $utf8NoBom

$PSDefaultParameterValues[“Out-File:Encoding”] = “utf8”
$PSDefaultParameterValues[“Set-Content:Encoding”] = “utf8”
$PSDefaultParameterValues[“Add-Content:Encoding”] = “utf8”
$PSDefaultParameterValues[“Get-Content:Encoding”] = “utf8”

$env:PYTHONUTF8 = “1”
$env:PYTHONIOENCODING = “utf-8”

After restarting PowerShell, inline scripts like this polish characters test:
@’
print(“zаżółć gęślą jaźń”)
'@ | python -X utf8 -
print correctly, and files stop getting mangled.

Long‑term fix: Codex Agent should default to pwsh (PowerShell 7) or allow a shell override, or always force UTF‑8 for reads/writes.

EricGT · February 4, 2026, 8:12pm

Not sure if this will help. I also had a UTF-8 problem on Windows but may be different.

github.com/openai/codex

Powershell: issue with '' [Console]::OutputEncoding=[System.Text.Encoding]::UTF8'

opened 01:49PM - 23 Jan 26 UTC

alainfrisch

bug windows-os sandbox CLI

### What version of Codex is running? 0.89.0 ### What subscription do you have…? ChatGPT Business ### Which model were you using? _No response_ ### What platform is your computer? Microsoft Windows NT 10.0.26200.0 x64 ### What terminal emulator and version are you using (if applicable)? Windows Terminal, codex launched from a Cygwin terminal ### What issue are you seeing? In a given session, most actions by Codex (`Ran Get-Content ...`, `Ran rg ...`) succeed but output: ``` Impossible de d‚finir la propri‚t‚. La d‚finition de propri‚t‚ n'est prise en charge que sur les types principaux dans ce mode de langage. Au caractŠre Ligne:1 : 1 + [Console]::OutputEncoding=[System.Text.Encoding]::UTF8; + ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + CategoryInfo : InvalidOperation : (:) [], RuntimeException + FullyQualifiedErrorId : PropertySetterNotSupportedInConstrainedLanguage ``` Note: after the last Codex upgrade, I ran /permissions, and "installed" the new Windows sandbox. Might be related that, since the code produce this UTF-8 hasn't changed recently (last change: https://github.com/openai/codex/commit/33e1d0844a0f4871a3ee74ae601d143a242f10da) ### What steps can reproduce the bug? Bug submitted with /feedback -> thread ID 019bea83-cfa5-7a73-a0e4-99683fde7892 ### What is the expected behavior? _No response_ ### Additional information _No response_

Could you try disabling the feature and see if that resolves your problem? Add the following to your config.toml file.

[features]
powershell_utf8 = false

_j · February 4, 2026, 8:59pm

The simple fact is that the AI model produces bytes of UTF-8 Unicode code points. It does not produce within an 8-bit code page you select (although that’s a common training fault OpenAI has done, to ingest unsanitized world language bytes).

Enabling UTF-8 support end-to-end is the solution. Powershell is not going to do this translation work for you from Unicode code point mapped to a 255-value code page glyph in a language set with switching representations of the bit 8 characters (where bit 8 is an “on” for multi-byte UTF-8), and Codex software obviously can’t know whether you are going to see block characters or Shift_JIS by a single option:

Topic		Replies	Views
Bug Report - Non-English Characters Corruption When Editing Code in GPT Codex Bugs codex	1	395	February 4, 2026
Codex VSCode - Major issue - LLM Suddendly can't handle encoding API	3	1065	December 18, 2025
Codex set codepage to CP850 on editing files Community codex	1	191	July 28, 2025
The Codex app is now on Windows Codex announcement , windows-app , codex-app	13	3812	March 27, 2026
Model returning malformed characters in JSON response using API Bugs	8	775	January 5, 2026

Incorrect Cyrillic rendering in Codex Agent on Windows due to PowerShell 5.1 default ANSI encoding

Related topics