Seedance 2.0 by ByteDance

Generate Videos with Seedance 2.0

ByteDance's latest AI video model for text-to-video and image-to-video generation with native audio. Cinematic motion, synchronized sound, and multiple aspect ratios in a single generation.

First Frame (optional)

Last Frame (optional)

Ref Video (optional)

Ref Images (optional, up to 9)

Examples — your video will appear here after generation

Text to Video

Live-Action Anime Adaptation · Breathing Technique Decisive Battle (15 seconds · Super Burning Special Effects Version) 【Core Focus】: Water Breathing (Blue Water Dragon) VS Thunder Breathing (Golden Lightning), live-action extreme speed duel. 【Style】: Hollywood live-action anime adaptation film quality, dark samurai style, 4K ultra-clear, extreme fast cuts, explosive particle light effects, no gore. 【Duration】: 15 seconds 【Scene】: Misty forest under the moonlight, muddy ground, falling leaves. [00:00-00:05] Shot 1: Water Melody Prelude · Starting Stance (Sense of charging) Visuals: A young samurai wearing a green and black checkered haori (jacket), lowering his center of gravity under the moonlight, gripping his sword with both hands. Action: He takes a deep breath, and the surrounding air instantly solidifies. As he draws his sword, a giant blue water dragon, condensed from high-pressure water flow, appears out of thin air, rotating rapidly around his body and blade, emitting the roar of flowing water. Special Effects Details: The water flow has a realistic sense of splashing, illuminating the dark forest. [00:05-00:10] Shot 2: Thunder Flash · Charge (Sense of extreme speed) Visuals: The opponent, a blonde swordsman wearing a yellow triangular patterned haori, is crouched extremely low, adopting the posture of Iaijutsu (sword drawing technique). Action: The ground suddenly explodes, and he instantly transforms into a dazzling golden lightning afterimage, refracting and charging through the forest in a "Z" shape at a speed undetectable by the naked eye. Special Effects Details: Golden electric arcs and scorched fallen leaves remain in the places he passes. [00:10-00:15] Shot 3: Water and Thunder Collision · Final Sound (Ultimate move clash) Visuals: Extreme speed collision. The young samurai swings the giant blue water dragon down to meet the attack, and the blonde swordsman, transformed into lightning, crashes into him head-on. Action: The two swords violently collide in the center of the frame. Special Effects Spectacle: The blue water dragon and the golden lightning instantly explode, forming a massive water-thunder energy storm that spreads outwards. The surrounding large trees are snapped in half by the energy wave, and mud and light obscure the camera. The scene ends in an extremely dazzling blue, yellow, and white light.

What Is Seedance 2.0?

Seedance 2.0 is ByteDance's latest AI video generation model, released in February 2026. It builds on Seedance 1.5 with a unified multimodal audio-video joint generation architecture — meaning the model generates video and synchronized audio together in a single pass, rather than adding audio as a separate step. Key improvements include exceptional motion stability, stronger instruction-following, and support for up to 1080p resolution.

Seedance 2.0 accepts multiple input types simultaneously: text descriptions, reference images, audio clips, and reference video clips. The model uses a dual-branch diffusion transformer architecture capable of generating multi-shot sequential videos with consistent subjects, coherent camera language, and native stereo audio — including natural dialogue, synchronized sound effects, and ambient soundscapes.

On Nano Banana, you can use Seedance 2.0 for text-to-video, image-to-video, and video extension. Output is available at 480p, 720p, or 1080p, with six aspect ratios: 16:9 (landscape), 9:16 (portrait), 1:1 (square), 4:3, 3:4, and 21:9 (ultrawide). Clip length is 4 to 15 seconds.

How It Works

Write Your Prompt

Describe the scene, subjects, camera movement, and mood in natural language. For image-to-video, upload your reference image and describe how it should animate.

Choose Resolution & Format

Select resolution (480p, 720p, 1080p), aspect ratio (16:9, 9:16, 1:1, 4:3, 3:4, 21:9), and clip length (4–15 seconds). The credit cost is shown before you generate.

Generate & Download

Seedance 2.0 generates your video with synchronized audio, typically within 1–3 minutes. Preview it and download the MP4 file when you're happy with the result.

What Can You Create?

Seedance 2.0 is well-suited for a range of short-form video needs. Here are the most common use cases:

Social Media Clips

Create short animated content for Instagram Reels, TikTok, and YouTube Shorts in 9:16. A single compelling clip can be generated in minutes without a film crew.

Product & Brand Videos

Animate product shots, generate lifestyle footage, or create atmospheric clips for landing pages and ads. Ideal for visualizing products that don't yet exist physically.

Visual Storytelling

Bring concepts, scripts, or storyboards to life for pitches, presentations, or creative projects. Generate scene-by-scene clips to assemble into a narrative.

Concept & Prototype

Rapidly test visual directions for campaigns or productions before committing budget to a shoot. Generate multiple style variations from the same brief.

Key Capabilities

Exceptional Motion Stability

Subjects maintain their identity and appearance across the full clip. Seedance 2.0 uses a dual-branch diffusion transformer to keep faces, products, and environments coherent from frame to frame — even in multi-shot sequences.

Director-Level Camera Control

Specify camera movements — pan, zoom, dolly, orbit, handheld — directly in your prompt. Seedance 2.0 understands professional cinematography language and applies it to the generated footage.

Multimodal Input: Image, Audio & Video

Combine multiple inputs in one generation: up to 9 reference images, 3 audio clips, and 3 video clips alongside your text prompt. The model references visual composition, motion rhythm, and sound characteristics from all inputs simultaneously.

Native Audio-Video Joint Generation

Seedance 2.0 generates synchronized two-channel stereo audio alongside video in a single pass — including natural dialogue, synchronized sound effects, and immersive ambient soundscapes. Output available at 480p, 720p, or 1080p.

Tips for Best Results

1
Describe motion explicitly — 'a woman walking slowly through a sunlit park' outperforms 'a woman in a park'. The model needs motion cues to animate meaningfully.
2
Use cinematography terms: 'slow push-in', 'pan right', 'overhead drone shot', 'handheld close-up'. These guide camera behavior more precisely than general scene descriptions.
3
To trigger the native audio generation, mention sound in your prompt: 'birds chirping in the background', 'the character speaks clearly saying hello', 'rain hitting the pavement'. Explicit audio cues produce much better sound output.
4
For image-to-video, choose source images with clear subjects and uncluttered backgrounds. The model animates what it can identify — busy or ambiguous images produce inconsistent motion.
5
Specify lighting and time of day: 'golden hour', 'overcast midday', 'neon-lit night scene'. Lighting context helps the model render shadows and atmosphere more accurately across frames.

Frequently Asked Questions

What is Seedance 2.0?

Seedance 2.0 is ByteDance's latest AI video generation model, released in February 2026. It uses a unified multimodal audio-video joint generation architecture — generating video and synchronized stereo audio together in a single pass. Key improvements over Seedance 1.5 include exceptional motion stability, stronger prompt following, and support for up to 1080p resolution.

How long are the generated videos?

Seedance 2.0 supports clips from 4 to 15 seconds. Videos include synchronized native audio generated alongside the video in the same pass. Output is available at 480p, 720p, or 1080p resolution.

What is the difference between text-to-video and image-to-video?

Text-to-video generates a completely new video clip from your text description — the model invents the visual content. Image-to-video starts from a still image you provide and animates it according to your instructions. You can also combine multiple inputs: up to 9 reference images, 3 audio clips, and 3 video reference clips alongside your text prompt.

How many credits does video generation cost?

Seedance 2.0 credits depend on resolution and duration. The rate is 8 credits/second at 480p, 16 credits/second at 720p, and 48 credits/second at 1080p. A 5-second clip at 720p costs 80 credits. The exact cost is shown in the tool interface before you generate.

Can I use Seedance 2.0 videos commercially?

Yes. Videos generated through our platform can be used for commercial purposes including advertising, social media, product demos, and client work. Always review your client contracts and platform-specific rules for AI-generated content when publishing.