Hi,
I’m in the Codex developer program. I have a specific architectural suggestion for the superapp that I think is worth 3 minutes of someone’s time.
Your current direction: merge ChatGPT + Codex + Atlas into one app with multiple modes.
What I think it should be: ChatGPT is the only interface. Codex and Atlas are non-blocking background subprocesses that ChatGPT can spawn.
The difference:
- “Merge” means three tools in one window. Users still switch between them.
- “Orchestrate” means one conversation that dispatches work in the background. Users never leave the chat.
Before I explain the architecture, let me show you why this is a competitive opportunity, not just a UX improvement.
-–
Anthropic’s Claude Desktop currently has three modes: Chat, Cowork, and Code. They sit as peer tabs in one app. This is exactly the “merge” approach — and it has two critical problems:
1. They are mutually exclusive. When a user runs a Cowork task, they cannot simultaneously have a Chat conversation. The modes block each other. A 30-minute file organization task means 30 minutes of zero conversation.
2. They impose tool selection on the user. A user has to decide: is this a Chat question, a Cowork task, or a Code job? That decision requires understanding what each mode does, which is a cognitive tax that shouldn’t exist.
Anthropic’s Dispatch feature (sending Cowork tasks from phone to desktop) gets close to the right idea — but it’s bolted on as a feature, not built in as the architecture. The phone is the afterthought, not the primary interface.
If you simply merge ChatGPT + Codex + Atlas into peer tabs the way Anthropic did with Chat + Cowork + Code, you will have replicated their product. And their product has a structural ceiling that users are already hitting.
The opportunity is to skip past that ceiling entirely.
-–
The critical word is non-blocking.
Right now, every execution interface — Codex, Atlas, any agent task, and Anthropic’s equivalents — is blocking. When a task runs, the user waits. The conversation stops. The user’s cognitive flow dies.
This isn’t an efficiency problem. It’s a cognition problem. Human working memory is fragile. Interrupt it for 15 minutes while Codex runs a task, and the user’s train of thought is gone. They have to rebuild context from scratch. Many threads of reasoning never recover — the associations and intuitions that were active in working memory are not reproducible.
The most valuable work happens when reasoning and execution are concurrent, not sequential. A user discusses architecture with ChatGPT, dispatches a scanning task to Codex mid-conversation, continues discussing strategy while Codex works in the background, and when results come back, the conversation context is still warm — they analyze the output immediately with full mental context intact.
No AI product offers this today. Not yours, not Anthropic’s, not Google’s.
-–
The architecture:
1. ChatGPT is the primary interface — cloud-based, accessible from any device, always available. This is the user’s persistent thinking space. It is never blocked by anything.
2. ChatGPT can spawn Codex or Atlas tasks as background subprocesses on the user’s local machine. These appear as task cards in a sidebar — real-time status, expandable into full Codex/Atlas interfaces where the user can directly interact with the executing agent if needed.
3. Every subprocess is non-blocking by default. The main conversation continues while tasks run. Results surface back into the chat stream when ready. The user can click into a subprocess to interact with it directly — that subprocess can have its own back-and-forth with its own agent — but the main ChatGPT thread is never waiting.
4. Each subprocess can run a different model tier. ChatGPT uses the strongest reasoning model for the thinking layer. Background Codex workers can use lighter, faster models for execution. The user can override per-task.
5. The UI is essentially a browser: one persistent main tab (ChatGPT) that can open child tabs (Codex/Atlas instances). The existing Codex and Atlas interfaces don’t need to change — they just become child panels instead of top-level modes.
This requires zero new technology. ChatGPT exists. Codex exists. Atlas exists. MCP exists. The model selector exists. You just change the hierarchy: from three peer tabs to one master process with non-blocking background workers.
-–
The competitive framing:
Anthropic built Chat + Cowork + Code as peer modes. That’s their ceiling. If you merge ChatGPT + Codex + Atlas as peer modes, you match them. If you make ChatGPT the master process with Codex and Atlas as non-blocking workers, you leapfrog them.
The result: one conversation. Everything else is background. Nothing blocks.
This eliminates the concept of tool selection entirely. Users never decide whether to open ChatGPT, Codex, or Atlas. They just talk. The system dispatches. This reduces the barrier from “learn which tool to use” to “know how to speak” — which everyone already does.
First company to ship this doesn’t just have a better product. They redefine the interaction paradigm, the same way iPhone redefined input from “learn to type” to “learn to touch.”
-–
I use ChatGPT and Codex daily across firmware security research, creative writing, and technical analysis. I also use Claude extensively. The mode-switching friction and the blocking execution model are the two biggest limitations across both platforms. Happy to elaborate on any of this if useful.