The response

How the model performs the answer.

Streaming, thinking, stopping, formatting, citing, hedging. The response is a performance, and the pacing of that performance is half the product.

12 patterns

Other collections