1. ChatGPT plans. Codex changes code.
Use ChatGPT Instant for requirements, options, user stories, acceptance criteria, UI text, and test ideas. Use Codex when it needs the repository or must edit/run code.
Use the right tool, the right model, the right thread, and the right amount of context. The aim is not to stop AI use — it is to stop paying premium credits for routine work.
Built from public OpenAI and Microsoft guidance checked on 30 June 2026. Your workspace policy always wins where it is stricter.
These are designed to be simple enough for every department, but still useful for developers doing real work.
Use ChatGPT Instant for requirements, options, user stories, acceptance criteria, UI text, and test ideas. Use Codex when it needs the repository or must edit/run code.
Company default: use GPT-5.4 or GPT-5.4 mini for routine work. Escalate to GPT-5.5 only for complex debugging, architecture, cross-cutting refactors, or genuinely blocked tasks.
Fast mode is useful when speed matters more than credits. Otherwise, leave it off. It gives a speed boost but consumes credits at a higher rate.
Long chats carry old context. Keep one thread for one active task, then ask for a handoff summary and start a new thread for the next task.
Point Codex to files, folders, selected code, errors, commands, and acceptance criteria. Avoid “look through the whole repo” unless that is truly necessary.
For complex or unclear work, use Plan/Chat mode first. Approve the plan, then allow implementation. This prevents long, expensive wrong turns.
Ask for diffs, summaries, file paths, and test results — not full file dumps. Output tokens are usually much more expensive than input tokens.
Shorten AGENTS.md, nest instructions near the relevant code, and disable unused MCP servers. Every automatic instruction and tool can add context.
Admins should check credits by user, model, surface, and use case. Coach outliers before buying more credits.
This is the recommended flow for both developers and non-developers building scripts, dashboards, or simple internal apps.
Answer four questions to decide whether to use ChatGPT, Codex chat/plan, or Codex agent mode.
Start outside Codex. Get the requirements, plan, acceptance criteria, and tests right before using coding credits.
This uses public Codex token-based rates for GPT-5.5, GPT-5.4, and GPT-5.4 mini. It is an estimate, not a billing statement.
Fast mode applies to GPT-5.5 and GPT-5.4 only.
Risk: normal.
The same rules apply to everyone, but the default workflow changes slightly by job type.
GPT-5.4, medium reasoning, Fast off. Use selected files and clear acceptance criteria.
Use GPT-5.5 for complex debugging, architecture, multi-file refactors, performance problems, or when GPT-5.4 is stuck.
Parallel cloud runs without a plan, whole-repo prompts, verbose file dumps, unused MCP servers, and “keep going until done”.
Use ChatGPT first to define the app, spreadsheet workflow, automation, or report. Produce a simple build brief.
Ask Codex to build one small part at a time: one page, one script, one function, one bug fix.
Asking Codex to “make an app” from a vague idea. Vague prompts create more attempts, more rework, and more credits.
Review weekly usage by model, user, product surface, and task type. Look for repeated high-credit patterns.
Coach people on prompt hygiene before restricting useful work. Most savings come from better defaults and handoffs.
Set soft limits, alerts, and an escalation route for GPT-5.5, Pro, Extra High, Fast mode, and long-running automations.
These templates reduce rambling, keep context focused, and make the ChatGPT-to-Codex handoff cleaner.
I need a concise implementation plan for: [outcome]. Do not write code yet. First clarify the goal, users, constraints, data inputs, and acceptance criteria. Ask up to 5 questions only if they materially change the answer. Output format: 1. Recommended approach 2. Step-by-step plan 3. Risks / assumptions 4. Acceptance criteria 5. Test cases Keep it under 700 words.
Implement the plan below in this repository. Model policy: use GPT-5.4 unless the task is genuinely blocked or complex enough to justify GPT-5.5. Keep Fast mode off. Start in Plan/Chat mode. Do not edit files until you have identified the likely files, risks, test command, and minimal diff plan. Goal: [goal] Files/areas to inspect first: [paths] Acceptance criteria: [criteria] Constraints: [do not rules] Plan from ChatGPT: [paste concise plan]
Bug: [what is broken] Expected: [expected behaviour] Actual: [actual behaviour] Evidence: [error, stack trace, failing test, screenshot] Relevant files: [paths] Please diagnose first. Return: 1. Likely cause 2. Minimal fix plan 3. Files you need to edit 4. Test command Do not rewrite unrelated code. Keep output to the plan and diff summary unless I ask for full files.
Summarise this thread into a fresh-start handoff. Include only: - Goal - Decisions made - Important constraints - Files changed or inspected - Commands/tests run - Current status - Known issues - Next recommended step Maximum 15 bullets. Do not include the full conversation.
I want to build a simple internal tool for: [business process]. Before writing code, help me make this buildable: 1. Who will use it? 2. What data comes in? 3. What should the user see or do? 4. What output should it produce? 5. What security or approval rules matter? 6. What is the smallest useful first version? Then produce a one-page Codex build brief with acceptance criteria.
Be concise. Do not paste entire files unless necessary. Prefer: - changed file paths - diff summary - commands run - test result - remaining risks If you need to show code, show only the changed block.
Starting new threads can be efficient, but only when the old context is no longer useful. Continuing can be efficient when Codex genuinely needs the same context.
Use this as a simple internal standard. Replace names if your workspace model list changes.
| Use case | Default | Escalate only when… | Avoid |
|---|---|---|---|
| Planning, requirements, acceptance criteria | ChatGPT Instant | Use Thinking once for complex trade-offs or ambiguous architecture. | Do not use Codex just to brainstorm. |
| Small code tweak, test, script, CSS, copy changes | Codex GPT-5.4 mini or GPT-5.4 | Use GPT-5.5 only if the smaller model repeatedly misses the issue. | Fast mode and full repo scans. |
| Normal feature or bug fix | Codex GPT-5.4 | GPT-5.5 if multi-file reasoning, hidden dependencies, or hard debugging are involved. | “Keep going” loops without acceptance criteria. |
| Complex refactor, architecture, security-sensitive work | Plan first, then Codex GPT-5.5 if justified | Fast mode only for urgent blockers where time matters more than credits. | Starting with Fast mode before scope is clear. |
| Code review | Target specific PR risks and changed files | Ask for deeper review only on high-risk areas. | Repeated broad reviews with no changes. |
When you spot one of these, pause and reset the prompt before sending another high-cost request.
Point it to likely files first. Use whole-repo exploration only when discovery is the task.
Turn the idea into a one-page brief before Codex writes code.
Ask for a 15-bullet summary and start fresh.
Ask for changed blocks, diffs, and test results instead.
Fast mode is an exception, not a default.
Parallel work is powerful, but expensive when the plan is vague.
Disable tools you are not using for the task.
Use GPT-5.4 or mini first unless the task meets the escalation criteria.
The goal is to coach better use before buying more credits. Use controls to prevent surprises, not to discourage good AI work.
These links are included so the owner of the SharePoint page can re-check model names, limits, and rates when OpenAI updates the product.
Print note: interactive elements are expanded where possible, but the calculator requires JavaScript.