Human-in-the-loop: design for the handoff, not the override
- Sam Wilson
- UX , Human in the Loop
- 06 May, 2026
Most human-in-the-loop systems are built as if the human is a backstop — present to override the mod ...
Evaluating agents when there's no single right answer
- William Jacob
- Evaluation , Agents
- 05 May, 2026
Evaluating a single prompt is hard. Evaluating an agent that runs ten tool calls before answering is ...
Agent memory: episodic, semantic, and what to keep
- Sam Wilson
- Architecture , Memory
- 05 May, 2026
The first agent you build has no memory beyond the current conversation, and that works for about a ...
Memory strategies for long-running agents
- Jane Doe
- Memory , Reliability
- 04 May, 2026
Long-running agents accumulate context. The job of memory design is to decide which slices of that c ...
When the agent fails: recovery patterns that don't loop forever
- Sam Wilson
- Reliability , Agents
- 04 May, 2026
Agent failures don't throw exceptions. They produce plausible-looking output that's wrong, or quietl ...
Multi-agent systems: coordination is the actual hard part
- John Doe
- Architecture , Multi Agent
- 03 May, 2026
Multi-agent architectures are seductive because they map onto how humans organize work: specialists, ...
Planner-executor splits: when to separate them
- Sam Wilson
- Architecture , Agents
- 03 May, 2026
A single model doing both planning and execution feels elegant on day one. By month three, the trace ...
Designing an agent harness that doesn't fight the model
- John Doe
- Harness , Architecture
- 02 May, 2026
Lorem ipsum dolor sit amet consectetur adipisicing elit. The harness around an agent matters more th ...
Tool selection: when the model should pick, and when you should
- William Jacob
- Tools , Agents
- 02 May, 2026
Tool-using agents look powerful in demos because the model is choosing what to do next. They look fr ...
ReAct in production: reasoning that survives sidetracks
- John Doe
- Architecture , ReAct
- 01 May, 2026
ReAct is a clean idea: think, act, observe, repeat. In production, the loop is the part that breaks. ...
Categories
Tags
- Agents
- Design
- Memory
- Autonomous
- Claude code
- Security
- Auto mode
- Permissions
- Agent safety
- Error recovery
- Eval
- Guardrails
- Safety
- Harness
- Orchestration
- Production engineering
- Human in loop
- Ux
- Multi agent
- Coordination
- Planning
- Executor
- Postiz
- Social media
- Cli
- Agent tools
- React pattern
- Loops
- Tool use
- Academic research
- Agent
- Ai tools
- Financial services
- Reference architecture
- Openclaw
- Peekaboo
- Computer use
- Macos
- Product hunt
- Ai agents
- Voice ai
- Mcp
- Content production
- Trends 2025
- Wiseclaw
- Agent os
- Healthcare ai
- Skill system
- Wisediag