Author: hoeem (@hooeem) Published: March 15, 2026 Source: X Article Stats: 6.9M views · 14K likes · 53K bookmarks · 2K reposts
To become a Claude Architect and develop production-grade applications you need to understand Claude Code, Claude Agent SDK, Claude API, and Model Context Protocols, this article will help you learn everything and is based on the following exam:
However, as you can clearly see to get this “certified” you need to be a Claude partner, otherwise, you cannot take this exam.
BUT DOES THAT EVEN MATTER?
If you have the ability to learn what it takes to become a “Claude Certified Architect” then you’re able to build production-grade applications.
You don’t need the certificate to build production-grade applications.
You just need the knowledge.
So I tore apart the entire exam guide and pulled out what actually matters so that you can become a Claude architect.
WHAT YOU ARE WALKING INTO:
The exam tests you on: Claude Code, Claude Agent SDK, Claude API, and Model Context Protocol (MCP).
WHICH ARE ALL SKILLS YOU CAN MONETISE.
The exam would mean you need to learn the following:
- Customer Support Resolution Agent (Agent SDK + MCP + escalation)
- Code Generation with Claude Code (CLAUDE.md + plan mode + slash commands)
- Multi-Agent Research System (coordinator-subagent orchestration)
- Developer Productivity Tools (built-in tools + MCP servers)
- Claude Code for CI/CD (non-interactive pipelines + structured output)
- Structured Data Extraction (JSON schemas + tool_use + validation loops)
DOMAIN 1: AGENTIC ARCHITECTURE & ORCHESTRATION (27%)
The exam tests three anti-patterns you need to reject on sight: parsing natural language to determine loop termination, arbitrary iteration caps as the primary stopping mechanism, and checking for assistant text as a completion indicator. All wrong.
The single biggest mistake: people assume subagents share memory with the coordinator. They do not. Subagents operate with isolated context. Every piece of information must be passed explicitly in the prompt.
The rule that will save you the most marks: when stakes are financial or security-critical, prompt instructions alone are not enough. You must be enforcing tool ordering programmatically with hooks and prerequisite gates.
Where to learn this:
- Agent SDK Overview — for agentic loop mechanics and subagent patterns
- Building Agents with the Claude Agent SDK — for Anthropic’s own best practices on hooks, orchestration, and sessions
- Agent SDK Python repo + examples — for hands-on code: hooks, custom tools, fork_session
What to build to learn:
A multi-tool agent with 3-4 MCP tools, proper stop_reason handling, a PostToolUse hook normalising data formats, and a tool call interception hook blocking policy violations. This single exercise covers most of Domain 1.
Key Concepts:
Task Statement 1.1 — Agentic Loops: The complete lifecycle: Send request → inspect stop_reason → if tool_use: execute tools, append results, send back → if end_turn: done. Three anti-patterns to reject: parsing natural language for termination, arbitrary iteration caps, checking for text content as completion indicator.
Task Statement 1.2 — Multi-Agent Orchestration: Hub-and-spoke architecture. Coordinator at centre, subagents as spokes. ALL communication flows through coordinator. Critical: subagents do NOT inherit coordinator’s history. Every piece of info must be explicitly passed.
Task Statement 1.3 — Subagent Invocation: The Task tool spawns subagents. Include complete findings from prior agents in prompts. Use structured data formats separating content from metadata. Parallel spawning: emit multiple Task calls in a single response. fork_session for divergent approaches from shared baseline.
Task Statement 1.4 — Workflow Enforcement: Prompt-based = probabilistic (works most of the time). Programmatic = deterministic (works every time). Financial/security/compliance → always programmatic enforcement. Low-stakes → prompt-based is fine.
Task Statement 1.5 — Agent SDK Hooks: PostToolUse hooks normalise data after execution. Tool call interception hooks block/redirect before execution. Hooks = deterministic guarantees. Prompts = probabilistic guidance.
Task Statement 1.6 — Task Decomposition: Fixed sequential pipelines for predictable tasks. Dynamic adaptive decomposition for open-ended investigation. Attention dilution: split large reviews into per-file passes + cross-file integration pass.
Task Statement 1.7 — Session State: --resume for valid prior context. fork_session for divergent exploration. Fresh start with summary injection when context is stale.
DOMAIN 2: TOOL DESIGN & MCP INTEGRATION (18%)
Tool descriptions are the primary mechanism Claude uses for tool selection. If yours are vague or overlapping, selection becomes unreliable.
One sample question presents get_customer and lookup_order with near-identical descriptions causing constant misrouting. The correct fix is not few-shot examples, not a routing classifier, not tool consolidation. The fix is better descriptions.
Know the tool_choice options cold: "auto" (model might return text), "any" (must call a tool, picks which), forced selection (must call a specific tool). Know when each applies.
Giving an agent 18 tools degrades selection reliability. Scope each subagent to 4-5 tools relevant to its role.
Where to learn this:
- MCP Integration for Claude Code — for server scoping, environment variable expansion, project vs user config
- MCP specification and community servers — for understanding the protocol
- Claude Agent SDK TypeScript repo — for tool definition patterns
Key Concepts:
Tool Interface Design: Good descriptions include: what the tool does, expected inputs, example queries, edge cases, explicit boundaries vs similar tools.
Structured Error Responses: Four categories: Transient (retryable), Validation (fix input), Business (not retryable, alternative workflow), Permission (escalation needed). Critical: distinguish access failure vs valid empty result.
Tool Distribution: 4-5 tools per agent, scoped to role. tool_choice: "auto" for general, "any" for guaranteed structured output, forced for mandatory first steps.
MCP Server Integration: Project-level .mcp.json (version-controlled). User-level ~/.claude.json (personal). Environment variable expansion: ${GITHUB_TOKEN}. Use community servers before building custom.
Built-in Tools: Grep = file CONTENTS. Glob = file PATHS. Edit = targeted modifications. Read + Write = fallback when Edit fails. Start with Grep to find entry points, then Read to trace flows.
DOMAIN 3: CLAUDE CODE CONFIGURATION & WORKFLOWS (20%)
This separates people who use Claude Code from people who have configured it for a team.
The CLAUDE.md hierarchy is critical. Three levels: user-level (~/.claude/CLAUDE.md), project-level (.claude/CLAUDE.md), directory-level (subdirectory files). The exam’s favourite trap: a team member missing instructions because they live in user-level config (not version-controlled, not shared).
Path-specific rules are the sleeper concept. .claude/rules/ with YAML frontmatter glob patterns like **/*.test.tsx applies conventions across the entire codebase. Directory-level CLAUDE.md cannot do this because it is directory-bound.
Plan mode vs direct execution:
- Plan mode: monolith restructuring, multi-file migration, architectural decisions
- Direct execution: single-file bug fix, one validation check, clear scope
Know context: fork in skill frontmatter (isolates verbose output). Know -p flag (non-interactive CI/CD). Know an independent review instance catches more than self-review in the same session.
Where to learn this:
- Claude Code official docs — for CLAUDE.md hierarchy, rules directory, slash commands, skills frontmatter
- Claude Code CLI Cheatsheet — for commands, skills, hooks, and CI/CD flags
- Creating the Perfect CLAUDE.md — for real team configuration patterns
Key Concepts:
CLAUDE.md Hierarchy: User-level (personal, NOT shared) → Project-level (team-wide, version-controlled) → Directory-level (directory-specific). Exam trap: new team member missing instructions = user-level config problem.
Custom Slash Commands and Skills: .claude/commands/ = project-scoped. Skills with context: fork isolate verbose output. allowed-tools restricts destructive actions.
Path-Specific Rules: .claude/rules/ with glob patterns. **/*.test.tsx catches every test file regardless of directory. More powerful than directory-level CLAUDE.md for cross-cutting concerns.
CI/CD Integration: -p flag for non-interactive mode. --output-format json with --json-schema for structured output. Independent review instance > self-review in same session.
DOMAIN 4: PROMPT ENGINEERING & STRUCTURED OUTPUT (20%)
Two words will save you across this entire domain: be explicit.
“Be conservative” does not improve precision. “Only report high-confidence findings” does not reduce false positives. What works: defining exactly which issues to report versus skip, with concrete code examples for each severity level.
Few-shot examples are the highest-leverage technique tested. 2-4 targeted examples showing ambiguous-case handling with reasoning for why one action was chosen over alternatives.
tool_use with JSON schemas eliminates syntax errors. But NOT semantic errors. Schema design: nullable fields when source data might be absent (prevents fabricated values), “unclear” enum values, “other” + detail strings.
Message Batches API: 50% savings, up to 24-hour processing, no latency SLA, no multi-turn tool calling. Batch for overnight reports. Synchronous for blocking pre-merge checks.
Where to learn this:
- Anthropic Prompt Engineering docs — for few-shot patterns, explicit criteria, and structured output
- Anthropic API Tool Use documentation — for tool_use, tool_choice config, JSON schema enforcement
Key Concepts:
Explicit Criteria: Specific categorical criteria > vague confidence-based instructions. Define exactly which issues to report vs skip with concrete code examples per severity.
Few-Shot Prompting: 2-4 targeted examples for ambiguous cases. Each shows REASONING for chosen action. Teaches generalisation, not just pattern-matching.
Structured Output with tool_use: Eliminates syntax errors. Does NOT prevent semantic errors, field placement errors, or fabrication. Schema design: nullable fields prevent fabrication. “unclear” enums for ambiguous cases.
Validation-Retry Loops: Effective for format/structural errors. Ineffective for genuinely absent information. Send back: original + failed extraction + specific error.
Batch Processing: 50% cost savings, 24h window, no latency SLA, no multi-turn tool calling. Synchronous for blocking workflows. Batch for overnight/weekly.
DOMAIN 5: CONTEXT MANAGEMENT & RELIABILITY (15%)
Smallest weighting. But mistakes here cascade everywhere.
Progressive summarisation kills transactional data. Fix: persistent “case facts” block with extracted amounts, dates, order numbers. Never summarised. Included in every prompt.
“Lost in the middle” effect: models miss findings buried in long inputs. Place key summaries at the beginning.
Three valid escalation triggers: customer requests a human (honour immediately), policy gaps, inability to progress. Two unreliable triggers the exam will tempt you with: sentiment analysis and self-reported confidence scores.
Error propagation done right: structured context (failure type, attempted query, partial results, alternatives). Anti-patterns: silently suppressing errors or killing entire workflows on single failures.
Where to learn this:
- Building Agents with the Claude Agent SDK — covers context management, error propagation, and escalation design
- Agent SDK session docs — for resumption, fork_session, /compact
- Everything Claude Code repo — for battle-tested context management patterns
Key Concepts:
Context Preservation: Extract transactional facts into persistent block. Trim verbose tool results to relevant fields. Place key summaries at beginning (lost-in-the-middle effect).
Escalation: Three valid triggers: customer requests human, policy gaps, inability to progress. Two unreliable: sentiment analysis, self-reported confidence. Honour explicit human requests immediately.
Error Propagation: Structured error context: failure type, what was attempted, partial results, alternatives. Anti-patterns: silent suppression, workflow termination on single failure.
Codebase Exploration: Scratchpad files for key findings. Subagent delegation for specific investigations. /compact when context fills with verbose output.
Information Provenance: Structured claim-source mappings. Preserve attribution through synthesis. Annotate conflicts with both values, don’t arbitrarily select one.
RECOMMENDED LEARNING FROM ANTHROPIC:
- Building with the Claude API
- Introduction to Model Context Protocol
- Claude Code in Action
- Claude 101
NOW GO AND BECOME AN UNCERTIFIED CLAUDE ARCHITECT (or certified if you’re a partner), EITHER WAY, IT’S TIME TO BUILD!