Search
Kling 3.0 Standard: Top-tier text-to-video with cinematic visuals, fluid motion, and native audio generation, with multi-shot support.
Required text prompt for video generation.
Optional start frame image. When provided, Kling runs in image-to-video mode.
Optional end frame image. Requires Start Image.
Describe content or qualities you want the model to avoid.
The Kling 3.0 tier to use for generation.
Aspect ratio of the generated video frames.
Duration of the generated video in seconds (3 to 15).
Controls how closely output follows your prompt (0.0 to 1.0).
Generate native audio for the video.
Generated output video URL.