← LibraryCollection
The response
How the model performs the answer.
Streaming, thinking, stopping, formatting, citing, hedging. The response is a performance, and the pacing of that performance is half the product.
12 patterns
- 11 minThe streaming cadenceWhy your token rate is a design decision, not a backend one.
- 8 minThe thinking indicatorEverything you put on screen between enter and the first token.
- 10 minThe citation footerWhere the sources go, and how you know they back the claim.
- 9 minThe confidence gradientShowing uncertainty in typography, not in hedging prose.
- 8 minThe partial answerWhat the model does when it only half-knows.
- 8 minThe regenerate affordanceAsking for another take, without losing the first one.
- 10 minThe long-form spineWhat to do when the answer is longer than the screen.
- 9 minThe table responseWhen the right shape for the answer is rows and columns.
- 10 minThe ambiguity promptAsking a clarifying question without breaking flow.
- 8 minThe error surfaceWhen the model fails, don't pretend it didn't.
- 7 minThe empty-state answerResponding when there's genuinely nothing to say.
- 9 minThe Socratic checkWhen the model asks before answering.
Other collections
- The prompt surfaceHow the box asks for intent.→
- Agentic flowsHow the model shows its work, in the world.→
- Memory & contextWhat the model carries with it.→
- Trust & evidenceHow the model earns the reader's belief.→
- Multimodal & artifactsWhen chat isn't enough.→
- CollaborationWhen the session has more than one human in it.→
- OrchestrationHow many agents, at what cadence, inside what ceiling.→
- Dev & evalThe surfaces that make AI products debuggable.→