The DOE Framework: How AI Agents Think, Plan & Act

Introduction

Artificial intelligence has come a long way from answering simple questions. Today’s AI agents can browse the web, write and execute code, send emails, manage calendars, and coordinate with other AI agents. All on their own. But how exactly does an AI go from receiving a vague instruction to completing a complex, multi-step task?

That’s where the DOE Framework comes in. Standing for Directive, Orchestration, and Execution, this three-layer model describes how modern agentic AI systems are structured, and why that structure matters for building AI that actually works in the real world.

What Is the DOE Framework?

The DOE Framework is an architectural pattern used in agentic AI systems. Rather than having a single model try to do everything at once, DOE separates the process into three distinct layers, each with a clear responsibility. Think of it like a well-run organisation: someone sets the strategy, someone manages the teams, and someone does the work on the ground.

Layer 1 Directive: The “What & Why?”

The Directive is the mission. It’s the high-level goal or instruction handed to the AI system - either from a human user or from a higher-level AI process. Think of it as the brief you’d give to a team: clear on the outcome, but not spelling out every step.

A good directive captures intent. It doesn’t need to say “go to this URL, click this button, copy this text” - it simply states what needs to be accomplished. The rest of the system figures out the how.

Example directive:

“Research our top 3 competitors and write a summary of their pricing pages.”

The directive layer is where human intent enters the system. Getting this right being clear and specific about the goal is often the most important factor in whether an agentic system succeeds or fails.

Layer 2 Orchestration: The “How & Who?”

Orchestration is the brain of the operation. Once the directive is clear, this layer takes over and asks: What steps are needed? In what order? Which tools or agents should handle each part? It’s the planner, the project manager, and the coordinator all rolled into one.

This is where the AI breaks the goal into sub-tasks, routes each one to the right tool or specialist agent, and monitors how the pieces come together. Popular frameworks like LangGraph, AutoGen, and CrewAI are purpose-built to handle this layer.

Orchestration in action:

1
2
3
4
5
Step 1: Search web for Competitor A pricing
Step 2: Search web for Competitor B pricing
Step 3: Search web for Competitor C pricing
Step 4: Pass all results to writing agent
Step 5: Review output for accuracy

A good orchestration layer doesn’t just sequence tasks. It handles failures gracefully, retries when needed, and adapts the plan when results come back unexpected.

Layer 3 Execution: The “Doing”

Execution is where things actually happen. Individual agents or tools carry out the specific actions they’ve been assigned. Browsing a webpage, querying a database, running a script, generating a report. Each executor is focused and specialised: it does one thing well and returns its result.

Crucially, results flow back up to the orchestration layer. If something goes wrong or a result is unexpected, the orchestrator can adapt, reassign the task, retry with different parameters, or escalate back to the directive level for clarification.

Execution outputs example:

Web agent returns raw pricing data
Writing agent produces a structured summary
Review agent checks for accuracy and completeness
Final output is delivered to the user

The Full Flow

1
2
3
4
5
6
7
8
9
User Goal
   ↓
[DIRECTIVE]      →  defines the mission
   ↓
[ORCHESTRATION]  →  plans, routes, coordinates agents & tools
   ↓
[EXECUTION]      →  agents act, tools run, results are produced
   ↓
Final Output

Each layer feeds into the next, and results bubble back up, creating a feedback loop that lets the system self-correct and adapt in real time.

Real-World Examples

Directive	Orchestration	Execution
“Book me a flight to Sydney next Friday”	Search flights → compare prices → check calendar → select best option	Travel agent, calendar agent, and booking tool fire in sequence
“Fix the bug causing checkout to fail”	Read error logs → identify affected code → generate fix → run tests	Code reader, code editor, and test runner each complete their task
“Send a weekly performance report to the team”	Pull analytics → generate charts → write summary → format email → send	Analytics agent, writing agent, and email tool each handle their part

Why Does This Architecture Matter?

You might be wondering: why not just throw everything at one big model and let it figure it out? There are a few good reasons why the layered DOE approach wins in practice.

Modularity : Each layer can be improved independently. Better orchestration logic doesn’t require rewriting your execution agents, and vice versa.

Scalability : Orchestration can spin up multiple execution agents in parallel, making complex tasks faster without sacrificing accuracy.

Resilience : When execution fails, the orchestrator adapts. The system retries, reroutes, or escalates , rather than crashing silently.

Clarity : Separating intent from planning from action makes AI systems easier to debug, audit, and align with human goals.

Conclusion

The DOE Framework is ultimately about bringing order to a complex problem. Building AI that can handle real-world tasks isn’t just about smarter models — it’s about smarter architecture. By separating the what from the how from the doing, we get systems that are more capable, more reliable, and easier to reason about.

As agentic AI continues to mature, frameworks like DOE will become foundational knowledge, not just for AI researchers, but for anyone building software in an AI-native world. The future of AI isn’t one model doing everything, it’s many agents, each doing one thing well, coordinated by a system that understands the bigger picture.

D · O · E : Directive. Orchestration. Execution. Three layers. One intelligent system.

Introduction#

What Is the DOE Framework?#

Layer 1 Directive: The “What & Why?”#

Layer 2 Orchestration: The “How & Who?”#

Layer 3 Execution: The “Doing”#

The Full Flow#

Real-World Examples#

Why Does This Architecture Matter?#

Conclusion#