Skip to content

Step 3 - Scenario

The Scenario step generates a scene-by-scene script for the video using the LLM.

Step 3

  1. Click Generate scenario (or equivalent). The app sends analysis (if any), intent (mode, product, duration, scene count), optional system prompt, knowledge, and project style guide to POST /api/scenario.
  2. The LLM returns an array of ScenarioScene objects: for each scene, image_prompt, negative_prompt, video_prompt, duration_seconds, text_overlay, transition.
  3. You can edit any scene’s text in the UI before moving to generation.
InputDescription
AnalysisFrom Step 2 (reference video analysis). Optional.
Intentmode, product_name, product_description, target_audience, language, video_duration, scene_count.
System promptOverride for the scenario LLM (default from project).
KnowledgeExtra context (from project or work).
Style guideProject style guide; when present, appended as “Project Style Guide” in the system prompt.

Each scene has:

  • scene_index, duration_seconds
  • image_prompt — used for image generation
  • negative_prompt — what to exclude from the image
  • video_prompt — motion/camera for image-to-video
  • text_overlay — optional on-screen text
  • transition — e.g. cut, fade, dissolve

The default scenario system prompt (see Scenario Prompt) instructs the LLM to produce photorealistic content unless the style guide asks for something else. Image prompts are written like “photo of…”, “shot on…”; video prompts describe realistic motion.

In Step 4: Generate you generate an image for each scene from image_prompt, then a video from that image using video_prompt.

See Step 4: Generate and System Prompts: Scenario.