Skip to content

Step 1 - Init

The Init step is where you configure the work and, for style transfer, upload the reference video.

Step 1

FieldDescription
SourceReference Video — upload a video to analyze; Start from Scratch — generate from text only.
Modestyle_transfer — replicate reference video style; content_remix — reinterpret content. Only shown when a reference video is selected.
Product nameSubject of the video (e.g. product or topic name).
Product descriptionOptional short description or video idea.
Target audienceWho the video is for.
ResolutionOutput resolution for image/video generation. See presets below.
Language(Optional) Output language (e.g. en, tr).
Video duration(Optional) Desired total duration in seconds.
Scene count(Optional) Number of scenes to generate.
Reference video(Source: Reference Video only) Upload the video to analyze.
Project assets(Optional) Select assets to use in video generation.
Asset usage modeHow to use selected assets: reference or direct.

The resolution selector offers three presets that control the output dimensions for all image and video generation in this work:

PresetWidth × HeightIconDescription
Portrait 9:161080 × 1920📱Default. Vertical format for mobile / social.
Landscape 16:91920 × 1080🖥️Horizontal widescreen format.
Square 1:11080 × 1080Square format.

Choosing a resolution affects:

  • DALL-E 3 — selects the matching size parameter (1024x1792, 1792x1024, or 1024x1024).
  • Runway — adds a ratio field ("768:1280" or "1280:768"). Square is omitted (Runway does not support 1:1).
  • Gemini Veo — adds an aspectRatio field ("9:16", "16:9", or "1:1").
  • Gemini Image — no effect (provider has no resolution parameter).
  • Editor export settingsexportSettings.width and exportSettings.height are initialized from the chosen resolution when opening the editor for the first time.
  • For style_transfer, upload a reference video in this step. It is stored under the work and used in Step 2: Analyze.
  • For content_remix, you can leave this empty; the Analyze step will be skipped unless you add a reference later.

You can select images from your project assets to use in video generation. When assets are selected, choose an asset usage mode:

ModeDescription
ReferenceAI analyzes your assets and generates similar-looking images. Best for style consistency. Assets should have descriptions (analyze them first in Project Settings).
DirectSkip AI image generation entirely. Use your actual asset images and only generate video (animation). Best for product photos, screenshots. Scene count auto-adjusts to match asset count.

This is useful for:

  • Product demo videos using actual product photos (direct mode)
  • Brand-consistent content using logo/style reference (reference mode)
  • Screenshot-based tutorials (direct mode)
  • All form fields are saved on the work (mode, productName, productDescription, selectedAssetIds, assetUsageMode, resolution, etc.).
  • The reference video file is stored via POST /api/projects/:id/works/:id/media/reference.

Go to Step 2: Analyze to run vision AI on the reference video (or skip to Step 3: Scenario if you have no reference).

See Pipeline Overview and Step 2: Analyze.