Skip to content
← LibraryCollection

Multimodal & artifacts

When chat isn't enough.

A voice note, a canvas, a screenshot, a PDF that edits itself. When the model stops speaking and starts making, the interface has to track two surfaces: the conversation, and the thing being built.

7 patterns
  1. The voice composer
    Dictating a prompt without losing the option to edit.
    10 min
  2. Image annotation
    Pointing at a screenshot without leaving the chat.
    9 min
  3. The canvas artifact
    Giving the model a second surface to build on.
    12 min
  4. Artifact playback
    Scrubbing the history of a generated artifact.
    10 min
  5. Screen context
    Letting the model see what you see, with consent.
    11 min
  6. Read-aloud
    Text-to-speech that you can follow with your eyes.
    9 min
  7. Cross-app handoff
    Passing an artifact to the tool it wants to live in.
    10 min
Other collections