Multimodal & artifacts

When chat isn't enough.

A voice note, a canvas, a screenshot, a PDF that edits itself. When the model stops speaking and starts making, the interface has to track two surfaces: the conversation, and the thing being built.

7 patterns

The voice composer
Dictating a prompt without losing the option to edit.
10 min
Image annotation
Pointing at a screenshot without leaving the chat.
9 min
The canvas artifact
Giving the model a second surface to build on.
12 min
Artifact playback
Scrubbing the history of a generated artifact.
10 min
Screen context
Letting the model see what you see, with consent.
11 min
Read-aloud
Text-to-speech that you can follow with your eyes.
9 min
Cross-app handoff
Passing an artifact to the tool it wants to live in.
10 min

Other collections

The prompt surfaceHow the box asks for intent.
→
The responseHow the model performs the answer.
→
Agentic flowsHow the model shows its work, in the world.
→
Memory & contextWhat the model carries with it.
→
Trust & evidenceHow the model earns the reader's belief.
→
CollaborationWhen the session has more than one human in it.
→
OrchestrationHow many agents, at what cadence, inside what ceiling.
→
Dev & evalThe surfaces that make AI products debuggable.
→