JSON to Video Guide

When to use it

Use a JSON to video workflow when:

  • The video output is derived from structured data
  • You need multiple variations from the same template logic
  • A product, agent, or automation system needs to trigger video creation
  • You want previews before final output
  • You need asynchronous render tasks and delivery callbacks

If the workflow is mostly one human editing one video at a time, a manual editor may be simpler. If the workflow needs repeatability and software integration, JSON is usually the better control surface.

  1. Generate or assemble the video schema
  2. Validate the schema structure and asset references
  3. Create a preview
  4. Review or refine the schema
  5. Start the final render task
  6. Poll task status or receive a webhook
  7. Store or deliver the final asset

Implementation checklist

  • Define the video schema shape your product will generate
  • Decide how assets are uploaded or referenced
  • Validate required fields before rendering
  • Generate previews for user or system review
  • Store task IDs and status transitions
  • Handle failed renders and retries
  • Deliver final assets through polling or webhooks
  • Keep reusable templates separate from runtime data

Minimum schema surface

A production schema usually needs:

  • meta for canvas settings and video metadata
  • assets for images, videos, audio, and fonts
  • tracks for timeline structure
  • clips for text, media, shapes, subtitles, and layouts
  • Timing and animation rules
  • Output and task lifecycle information

For the exact field-level rules, see the JSON Structure and Field Rules reference.

Preview, render, and delivery

  • Preview: generate a fast draft to inspect layout and copy
  • Render: create the final output asset
  • Delivery: expose the result through polling, a task lookup, or a webhook

For API-specific details, continue to API and Usage.