Live capabilities

These capabilities come from the current model config and backend route. Unreleased APIs are not listed.

4 specialized modes in one model

Lipsync from audio input

Video control with pose/depth/canny

Up to 1080p and 20 seconds

Request format

Clients call same-origin API routes; the server BFF forwards to the matching Worker.

MethodEndpointCreditsPurpose

POST/api/v1/videos/generations24 credits

LTX-2 19B — advanced AI video generation with text-to-video, image-to-video, lipsync, and control modes. Up to 1080p, 5–20 seconds.

Parameters

promptstring

Text description (required for text-to-video and image-to-video)

Optional

typestring

Mode: "text-to-video", "image-to-video", "lipsync", or "control"

Optional

imagestring

Image URL for image-to-video mode

Optional

audiostring

Audio URL for lipsync mode

Optional

audio_durationnumber

Audio duration in seconds for lipsync (5–20)

Optional

videostring

Video URL for control mode

Optional

video_durationnumber

Video duration for control mode (5–20)

Optional

modestring

Control mode: "pose", "depth", or "canny"

Optional

audio_modestring

Audio mode for control: "preserve", "generate", or "none"

Optional

resolutionstring

Output resolution: "480p", "720p", or "1080p"

Optional · Default 720p

aspect_ratiostring

Aspect ratio: "16:9" or "9:16" (text-to-video only)

Optional · Default 16:9

durationnumber

Video duration in seconds (5–20)

Optional · Default 5

seednumber

Random seed for reproducibility (-1 for random)

Optional · Default -1

Example request

{
  "endpoint": "/api/v1/videos/generations",
  "headers": {
    "Authorization": "Bearer <API_KEY>",
    "Content-Type": "application/json"
  },
  "body": {
    "model": "ltx-2",
    "prompt": "A cat playing piano in a jazz bar, cinematic lighting",
    "resolution": "720p",
    "aspect_ratio": "16:9",
    "duration": 5
  }
}

Task status route

After creating image, video, audio, or tool tasks, poll the real task endpoint for results.

GET/api/v1/tasks/{task_id}Check generation task status and result.

Pricing

20–512 credits per video (~$0.20–$5.12)

Base cost$0.24

Credits24

BillingSuccessful requests

Use cases

These workflows are supported by the current model and backend node.

01

Generate high-quality videos from text prompts

02

Animate images into video with image-to-video mode

03

Create lipsync videos by combining audio with AI video

04

Apply pose/depth/edge control to existing video

FAQ

How much does LTX-2 cost?

Credits depend on mode, resolution, and duration. Text/image-to-video: 20–76 credits. Lipsync: 32–256 credits. Control: 64–512 credits.

What modes does LTX-2 support?

Four modes: text-to-video, image-to-video, lipsync (audio+video), and control (pose/depth/canny). Specify with the "type" parameter.

What resolutions are supported?

480p, 720p, and 1080p. Higher resolutions cost more credits.

ltx-2 API

Live capabilities

4 specialized modes in one model

Lipsync from audio input

Video control with pose/depth/canny

Up to 1080p and 20 seconds

Request format

Parameters

Example request

Task status route

Pricing

Use cases

FAQ

Related live models