Agents
The Agent Harness: Why Your Model Isn't the Problem
- William Jacob
- Agents , Architecture
- 11 May, 2026
LangChain jumped from outside the top 30 to number 5 on TerminalBench 2.0. They didn't change the mo ...
File-Based Agents Don't Need a Build Step
- William Jacob
- Agents , Financial Services
- 11 May, 2026
The investment banking analyst who spends Friday night formatting a pitch deck isn't doing analysis. ...
How autonomous is too autonomous
- John Doe
- Architecture , Autonomy
- 06 May, 2026
Autonomy in agents is a slider, not a switch, and the right setting depends on the task more than th ...
Evaluating agents when there's no single right answer
- William Jacob
- Evaluation , Agents
- 05 May, 2026
Evaluating a single prompt is hard. Evaluating an agent that runs ten tool calls before answering is ...
Agent memory: episodic, semantic, and what to keep
- Sam Wilson
- Architecture , Memory
- 05 May, 2026
The first agent you build has no memory beyond the current conversation, and that works for about a ...
Memory strategies for long-running agents
- Jane Doe
- Memory , Reliability
- 04 May, 2026
Long-running agents accumulate context. The job of memory design is to decide which slices of that c ...
When the agent fails: recovery patterns that don't loop forever
- Sam Wilson
- Reliability , Agents
- 04 May, 2026
Agent failures don't throw exceptions. They produce plausible-looking output that's wrong, or quietl ...
Designing an agent harness that doesn't fight the model
- John Doe
- Harness , Architecture
- 02 May, 2026
Lorem ipsum dolor sit amet consectetur adipisicing elit. The harness around an agent matters more th ...
Tool selection: when the model should pick, and when you should
- William Jacob
- Tools , Agents
- 02 May, 2026
Tool-using agents look powerful in demos because the model is choosing what to do next. They look fr ...