Reliability

Self-consistency sampling: cheap reliability when you need the right answer

Self-consistency sampling sounds like the kind of thing a researcher proposes and a production engin ...

Forcing structured output without breaking the model

JSON mode and schema constraints look like a free win until they aren't. The first time the model pr ...

Tool use patterns that survive context decay

Tool use looks easy in a one-shot example and hard once the conversation grows past a few thousand t ...