Skip to content
← LibraryCollection

The response

How the model performs the answer.

Streaming, thinking, stopping, formatting, citing, hedging. The response is a performance, and the pacing of that performance is half the product.

12 patterns
  1. The streaming cadence
    Why your token rate is a design decision, not a backend one.
    11 min
  2. The thinking indicator
    Everything you put on screen between enter and the first token.
    8 min
  3. The citation footer
    Where the sources go, and how you know they back the claim.
    10 min
  4. The confidence gradient
    Showing uncertainty in typography, not in hedging prose.
    9 min
  5. The partial answer
    What the model does when it only half-knows.
    8 min
  6. The regenerate affordance
    Asking for another take, without losing the first one.
    8 min
  7. The long-form spine
    What to do when the answer is longer than the screen.
    10 min
  8. The table response
    When the right shape for the answer is rows and columns.
    9 min
  9. The ambiguity prompt
    Asking a clarifying question without breaking flow.
    10 min
  10. The error surface
    When the model fails, don't pretend it didn't.
    8 min
  11. The empty-state answer
    Responding when there's genuinely nothing to say.
    7 min
  12. The Socratic check
    When the model asks before answering.
    9 min
Other collections