Reliability

Tool use patterns that survive context decay

Tool use looks easy in a one-shot example and hard once the conversation grows past a few thousand t ...