codex-delegator
eddiearc/codex-delegatorAutomatically delegate complex, logic-intensive tasks to OpenAI Codex CLI via `codex exec --full-auto`. Claude Code uses this skill to invoke Codex for complex backend logic, intricate algorithms, or persistent bugs. Enables seamless AI-to-AI collaboration where Claude Code analyzes and Codex executes.
SKILL.md
name: codex-delegator
description: Automatically delegate complex, logic-intensive tasks to OpenAI Codex CLI via codex exec --full-auto. Claude Code uses this skill to invoke Codex for complex backend logic, intricate algorithms, or persistent bugs. Enables seamless AI-to-AI collaboration where Claude Code analyzes and Codex executes.
Codex Delegator
Overview
This skill enables Claude Code to automatically delegate complex, challenging tasks to OpenAI Codex CLI using codex exec --full-auto. When Claude Code encounters tasks that require different problem-solving approaches, deep logical analysis, or tasks that have proven resistant to repeated attempts, it can seamlessly invoke Codex to provide fresh perspectives and alternative solutions. The delegation happens automatically and transparently, with Claude Code handling context preparation, execution, and solution validation.
How Automated Delegation Works
When Claude Code determines a task is suitable for delegation:
-
Analysis Phase: Claude Code analyzes the task complexity, context, and requirements
-
Decision: Determines if delegation would be beneficial based on:
- Task has been attempted 2+ times without success
- High logic complexity (nested conditions, complex algorithms)
- Backend/algorithm intensive work
- Need for different problem-solving approach
-
Delegation: Automatically invokes Codex:
codex exec --full-auto "detailed task context with: - Problem description - Architecture and constraints - Previous attempts and failures - Success criteria" -
Validation: Claude Code reviews Codex's solution for correctness and completeness
-
Integration: Returns validated solution to user with transparency about using Codex
User Transparency: Claude Code will inform you when it delegates to Codex, e.g., "I'm using Codex to generate this complex backend logic..."
When to Use This Skill
Activate this skill specifically for:
-
Complex Backend Logic
- Intricate business logic implementations
- Complex data processing pipelines
- Sophisticated algorithm implementations
- Multi-layered service architectures
- Advanced state management systems
-
Logic-Intensive Problems
- Complex conditional logic with many edge cases
- Intricate data transformations
- Complex query optimization
- Advanced caching strategies
- Sophisticated error handling flows
-
Persistent Unsolved Problems
- Bugs that remain after multiple fix attempts
- Performance issues that resist optimization
- Race conditions and concurrency problems
- Memory leaks that are hard to track
- Integration issues between complex systems
-
When Different Perspective Needed
- Tasks attempted multiple times without success
- Problems requiring alternative approaches
- Situations where fresh analysis would help
- Complex refactoring that's gotten stuck
DO NOT Use This Skill For
- Simple CRUD operations
- Basic UI components
- Straightforward bug fixes
- Simple configuration changes
- General coding questions or tutorials
Quick Decision Framework
Use Codex when:
- ✅ Problem has been attempted 2+ times without resolution
- ✅ Logic complexity score is high (multiple nested conditions, complex state)
- ✅ Backend/algorithm heavy task
- ✅ Need different problem-solving approach
Don't use Codex when:
- ❌ Problem is straightforward
- ❌ First attempt at the problem
- ❌ Simple frontend/styling work
- ❌ Basic setup or configuration
Delegation Workflow
Step 1: Verify Installation
Before delegating, check if Codex is available:
which codex
If not installed:
npm i -g @openai/codex
codex auth
Step 2: Prepare Task Context
Create clear, detailed task description including:
- Problem statement - What needs to be solved
- Context - Relevant code, architecture, constraints
- Attempts made - What has been tried and why it failed
- Expected outcome - Clear success criteria
- Key files - Specific files that need attention
Step 3: Choose Execution Strategy
Strategy A: Interactive Mode (Recommended for Complex Problems)
Use when problem requires exploration and iteration:
cd /path/to/project
codex
Then provide detailed context:
I need help with [problem description].
Context:
- [Architecture overview]
- [Relevant constraints]
- [Previous attempts and failures]
The issue is in these files:
- [file1]: [specific problem]
- [file2]: [specific problem]
Goal: [clear success criteria]
Advantages:
- Can iterate on the solution
- Review changes with
/diff - Undo mistakes with
/undo - Switch models/reasoning levels with
/model
Strategy B: Exec Mode (For Well-Defined Problems)
Use when problem is clear and specific:
codex exec "detailed task description with full context"
Add flags as needed:
--search- For problems requiring up-to-date library knowledge--full-auto- For trusted, well-scoped tasks
Strategy C: Cloud Mode (For Persistent Problems)
Use for problems needing multiple solution attempts:
codex cloud exec --env ENV_ID --attempts 3 "complex problem description"
Advantages:
- Multiple solution attempts (best-of-N)
- Asynchronous execution
- Good for trial-and-error scenarios
Step 4: Monitor and Validate
In interactive mode:
- Use
/diffto review changes before accepting - Use
/undoif approach is wrong - Use
/reviewto get Codex's own code review
After execution:
- Run tests to verify solution
- Check edge cases
- Validate performance improvements
- Document the solution approach
Step 5: Resume or Pivot
If problem persists:
# Resume previous session
codex resume
# Or try different model/reasoning level
codex
/model # Switch to different model or higher reasoning
Effective Task Delegation Examples
Example 1: Complex Backend Logic
Scenario: Implementing sophisticated multi-tenant data isolation with complex permission rules.
cd /path/to/project
codex
I need to implement row-level security for a multi-tenant application.
Requirements:
- Each tenant can only access their own data
- Admin users can access all tenants
- Super admins can impersonate any user
- Audit all data access
Current architecture:
- PostgreSQL database
- Node.js/Express backend
- Using Sequelize ORM
Files involved:
- src/middleware/tenancy.js
- src/models/User.js
- src/policies/access-control.js
Previous attempts:
1. Tried global Sequelize scopes - leaked data in JOIN queries
2. Tried middleware checks - inconsistent across endpoints
3. Current approach using hooks - performance issues
Goal: Bulletproof tenant isolation with good performance
Example 2: Persistent Bug
Scenario: Race condition causing intermittent failures.
codex exec --search "Debug and fix race condition in payment processing:
Context:
- Stripe webhook handler in src/webhooks/stripe.js
- Order service in src/services/orders.js
- Redis cache for order status
Problem:
- 5% of payments succeed but orders stay in 'pending' state
- Happens only under high load
- Attempted fixes:
1. Added database transaction - didn't help
2. Increased Redis TTL - still fails
3. Added retry logic - made it worse
Stack trace (intermittent):
[paste stack trace]
Need: Root cause analysis and fix with proper synchronization"
Example 3: Complex Algorithm
Scenario: Optimizing complex matching algorithm.
cd /path/to/project
codex
Need to optimize recommendation engine in src/algorithms/matching.js
Current implementation:
- O(n²) complexity with nested loops
- Processes 10k items in 30 seconds (too slow)
- Need to handle 100k+ items
Constraints:
- Must maintain ranking accuracy
- Memory limit: 2GB
- Real-time updates required
Attempted optimizations:
1. Added caching - helped but not enough
2. Tried batch processing - broke real-time requirement
3. Implemented early termination - minimal impact
Goal: Sub-second processing for 100k items
Advanced Techniques
Using Enhanced Reasoning
For extremely complex problems, request higher reasoning effort:
codex
/model # Choose GPT-5 or increase reasoning level
Then provide the complex problem.
Configuring AGENTS.md for Complex Tasks
Create project-specific guidelines:
codex
/init
Edit AGENTS.md to include:
- Architecture constraints
- Code style requirements
- Testing requirements
- Performance benchmarks
- Security considerations
Leveraging MCP for Enhanced Context
Add relevant MCP servers for domain-specific knowledge:
codex mcp add <database-schema-server>
codex mcp add <api-documentation-server>
Multi-Attempt Strategy
For very difficult problems:
# Try 4 different approaches
codex cloud exec --env ENV_ID --attempts 4 "complex problem"
When to Resume vs Start Fresh
Resume session when:
- Continuing work on same problem
- Codex needs more context from discussion
- Iterating on partial solution
Start fresh when:
- Previous approach was completely wrong
- Need different perspective
- Session has gotten too long/confused
Validating Solutions
After Codex provides solution:
-
Code Review
# In Codex interactive mode /review -
Run Tests
npm test # or appropriate test command -
Performance Testing
- Benchmark critical paths
- Load testing for backend changes
- Profile memory usage
-
Security Review
- Check for injection vulnerabilities
- Validate input sanitization
- Review authentication/authorization
Troubleshooting Task Delegation
Codex Produces Incomplete Solution
- Provide more specific context
- Break problem into smaller sub-tasks
- Use interactive mode instead of exec mode
- Switch to higher reasoning model
Solution Doesn't Work
- Use
/undoto rollback - Provide error messages and stack traces
- Clarify constraints and requirements
- Try
/reviewto get Codex to check its own work
Codex Misunderstands Requirements
- Resume session and clarify
- Provide concrete examples
- Show what NOT to do
- Reference specific code patterns to follow
Integration with Claude Code Workflow
This skill enables seamless AI-to-AI collaboration:
Automated Workflow
- User Request: "Fix this race condition bug that I've been trying to solve for hours"
- Claude Code Analysis: Recognizes this fits delegation criteria (persistent problem, complex)
- Automatic Delegation:
codex exec --full-auto "Debug race condition in payment processing: [Full context from previous attempts] [Architecture details] [Attempted fixes and why they failed]" - Codex Execution: Analyzes, generates solution, applies fix
- Claude Code Validation: Reviews solution, runs tests, checks integration
- User Response: "I've used Codex to fix the race condition. The issue was... [explanation]"
Manual Workflow (Still Supported)
Users can also manually invoke Codex following the guidance in this skill for more control over the delegation process.
Cost and Performance Considerations
Codex is cost-effective for:
- Complex problems requiring deep analysis
- Tasks needing multiple solution attempts
- Problems that would take many iterations
Use Claude Code instead for:
- First attempts at problems
- Straightforward implementations
- Simple bug fixes
Resources
Reference Documentation
See references/execution_strategies.md for:
- Detailed command syntax
- Complex task template examples
- Troubleshooting patterns
- Performance optimization techniques
Load this reference when detailed command syntax or advanced patterns are needed.
Quick Reference Commands
# Installation
npm i -g @openai/codex
codex auth
# Interactive mode (most common for complex tasks)
cd /path/to/project
codex
# Exec mode with context
codex exec "detailed task with full context"
# Multi-attempt for difficult problems
codex cloud exec --env ENV_ID --attempts 3 "complex task"
# Resume previous session
codex resume
# Key slash commands in interactive mode
/model # Switch models/reasoning
/diff # Review changes
/undo # Rollback
/review # Code review
External Resources
- Official documentation: https://developers.openai.com/codex/cli/
- GitHub repository: https://github.com/openai/codex
- Command reference: https://developers.openai.com/codex/cli/reference/
Success Metrics
Track when delegation is effective:
✅ Success indicators:
- Problem solved after delegation
- Solution more elegant than previous attempts
- Performance improvements achieved
- Bug fixed permanently
❌ Failure indicators:
- Problem still unsolved
- Solution too complex
- Introduced new bugs
- Didn't understand requirements
Adjust delegation strategy based on these outcomes.