Agents
The Agent Harness: Why Your Model Isn't the Problem
- William Jacob
- Agents , Architecture
- 11 May, 2026
LangChain jumped from outside the top 30 to number 5 on TerminalBench 2.0. They didn't change the mo ...
File-Based Agents Don't Need a Build Step
- William Jacob
- Agents , Financial Services
- 11 May, 2026
The investment banking analyst who spends Friday night formatting a pitch deck isn't doing analysis. ...
Agent guardrails without lobotomizing the agent
- William Jacob
- Safety , Agents
- 07 May, 2026
Adding guardrails to an agent is one of those tasks where the easy version is too restrictive and th ...
Evaluating agents when there's no single right answer
- William Jacob
- Evaluation , Agents
- 05 May, 2026
Evaluating a single prompt is hard. Evaluating an agent that runs ten tool calls before answering is ...
When the agent fails: recovery patterns that don't loop forever
- Sam Wilson
- Reliability , Agents
- 04 May, 2026
Agent failures don't throw exceptions. They produce plausible-looking output that's wrong, or quietl ...
Planner-executor splits: when to separate them
- Sam Wilson
- Architecture , Agents
- 03 May, 2026
A single model doing both planning and execution feels elegant on day one. By month three, the trace ...
Tool selection: when the model should pick, and when you should
- William Jacob
- Tools , Agents
- 02 May, 2026
Tool-using agents look powerful in demos because the model is choosing what to do next. They look fr ...