What is Seedance 1.0 Lite?
Seedance 1.0 Lite is the small‑parameter variant of the Seedance 1.0 video generation models released by ByteDance Seed and offered via BytePlus ModelArk. The official documentation positions Lite as a high‑speed, cost‑effective option that still delivers strong video quality and flexible control over camera movement, scene composition, and multi‑subject action parsing. In other words, Lite is meant for rapid iteration and high‑volume generation rather than maximum cinematic fidelity.
The model supports both text‑to‑video and image‑to‑video workflows. This means you can generate clips entirely from a text description, or you can animate a still image by providing a prompt that describes the motion you want. For product teams, social media creators, and fast‑paced concepting, Lite’s speed‑first design can be more valuable than the marginal quality gains of heavier models.
Official capabilities highlighted by ModelArk
The ModelArk documentation lists a broad capability set for Seedance 1.0 Lite despite its smaller parameter size. The key themes are semantic understanding, smooth motion, and practical camera control. The following points summarize the official emphasis areas:
- Deep semantic understanding for detailed character, action, and scene descriptions.
- Strong multi‑subject action parsing and support for complex prompts.
- Smooth first‑to‑last frame transitions for guided image‑to‑video generation.
- Professional camera movement options such as orbit, pan, follow, zoom, and handheld.
- Rich, natural style coverage (animation, watercolor, ink wash, voxel, and more).
- Multiple supported resolutions with 24fps output and MP4 video format.
The documentation also calls out text embedding response accuracy and sensitivity to degree adverbs, which suggests that Lite pays attention to intensity modifiers like “slightly,” “dramatic,” or “subtle,” and can render embedded text in the video scene.
Official specification snapshot
The official ModelArk page provides a compact set of baseline specs. These are the values most users care about when deciding if Lite meets their workflow requirements.
| Parameter | Official value |
|---|---|
| Model name | Seedance 1.0 Lite |
| Model ID (ModelArk) | seedance-1-0-lite-t2v-250428 / seedance-1-0-lite-i2v-250428 |
| Input | Text or image |
| Output | MP4 video |
| Resolution | 480p, 720p, 1080p |
| Frame rate | 24 fps |
| Duration | 2–12 seconds |
| Aspect ratios | 16:9, 9:16, 3:4, 4:3, 1:1 |
Input modes and control options
Seedance 1.0 Lite supports both text‑to‑video and image‑to‑video generation. For I2V, the official documentation describes first‑frame and last‑frame control, which lets you specify the start and end images and ask the model to generate a smooth transition between them. This is useful for motion interpolation, before/after transitions, and short narrative clips where you already have key frames.
The same documentation includes references to multi‑reference image input (1–4 images) for guidance. In practice, this can help you preserve character identity, object style, or background mood. However, the overview section also states that “reference image scenarios are not supported.” Because the page contains both statements, it is best to confirm the currently enabled feature set in the provider console for your account and model version.
Camera movement and cinematic language
The ModelArk examples emphasize professional camera movement. Lite can handle orbit shots, pans, tracking shots, zooms, handheld motion, and aerial movements. This makes it more useful than simple static or sliding transitions: you can describe camera direction in the prompt and expect smooth, cinematic motion.
If you want the model to focus attention, combine camera movement with explicit framing guidance, such as “close‑up,” “wide shot,” or “tracking shot.” This improves predictability, especially for multi‑subject scenes or layered environments.
Style coverage and rendering range
The official documentation lists a broad style palette. Text‑to‑video can directly generate different styles, and image‑to‑video can follow the style of the provided image. Examples include Chinese animation, ink wash, watercolor, Japanese anime, American comics, paper cutting, voxel, felt, and line art. This makes Lite suitable for both realistic and illustrative workflows.
When you need consistent art direction, it is often better to anchor the model with a reference image rather than only a text prompt. This is especially helpful for brand design or character‑driven content.
Prompting for multi‑step actions
The official prompt guide suggests listing actions in chronological order when you need multi‑step motion. A simple structure like “subject + action 1 + action 2” helps the model follow the sequence. This also works for multi‑subject scenarios: specify the first subject and its action, then the second subject and its action, in the order they should appear.
ModelArk’s prompt guide also shows that resolution and duration can be specified in prompt parameters, which gives you lightweight control when you are not setting those values in API parameters. For teams running quick experiments, this can reduce configuration overhead and keep prompts self‑contained.
A practical prompt template for Lite
Lite is most reliable when prompts are structured and concise. A practical template is to describe the subject, the action, the environment, and the camera, in that order. For example, “a cyclist in a rain‑soaked city street, riding past neon storefronts, low‑angle tracking shot, soft reflections on wet asphalt.” This layout reduces ambiguity and helps the model prioritize motion and composition.
If you want a specific visual style, add the style tag at the end rather than the beginning. That gives the model a clear semantic core first, then a stylistic framing. In practice, prompts that list style first can degrade motion quality, especially when the scene includes multiple subjects or fine‑grained action.
For multi‑subject scenes, assign each subject a single action and avoid chaining too many verbs. Lite is optimized for speed, so a “focused action plan” usually yields smoother motion than an encyclopedic description of the scene.
Image‑to‑video workflow patterns
When you start from a still image, use the prompt to describe motion rather than restating the entire visual scene. The image already anchors subject, lighting, and style. The prompt’s job is to specify movement, camera direction, and the desired pacing of the clip. This often leads to more coherent motion than repeating the full visual description.
If you are using first‑frame and last‑frame control, make sure the two key frames are visually consistent. Large style changes between the first and last frame can introduce flicker or discontinuity in the intermediate frames. A consistent style in both frames usually results in smoother interpolation.
For product shots, keep the background simple and describe the motion explicitly: “slow 20‑degree rotation” or “gentle forward dolly.” This approach helps Lite focus on the product’s movement rather than inventing additional scene elements.
Quality guardrails for a lightweight model
Lite’s speed comes with tradeoffs. It may struggle with very dense prompts, excessive object counts, or highly technical physical interactions. When prompts become too complex, the model can drift from the requested motion or lose fine details. The easiest fix is to split complex scenes into multiple short clips, each with a clear subject and action.
Another common issue is identity drift across frames, especially for stylized or illustrative characters. If character consistency is critical, lean on image‑to‑video with a stable reference image rather than text‑only prompts. This keeps key visual attributes fixed while still allowing motion in the scene.
For high‑volume generation, maintain consistent prompt structure and aspect ratio settings across a batch. This makes it easier to compare outputs and select candidates for refinement in Pro‑tier models.
Official billing model (ModelArk)
ModelArk bills video generation using a token‑equivalent system. The pricing page states that video token consumption is approximately proportional to width × height × frame rate × duration. It also notes that only successful generations are billed, and that the actual consumption is returned in the API response usage field. This means your effective cost is tied to resolution and duration more than prompt length.
The pricing table lists Seedance 1.0 Lite at 1.8 USD per million tokens for online inference, with a lower price for offline inference. These numbers are provider reference values; your actual billing can differ depending on platform and packaging.
Model versions and rate limits
ModelArk lists two model IDs for Seedance 1.0 Lite: one for text‑to‑video and one for image‑to‑video. The model list also reports default RPM limits and concurrency limits per model version. These limits are important for batch pipelines or production queues.
| Model version | Workflow | Default RPM | Concurrency |
|---|---|---|---|
| seedance-1-0-lite-t2v-250428 | Text-to-Video | 300 | 5 |
| seedance-1-0-lite-i2v-250428 | Image-to-Video (first/last frame, reference images) | 300 | 5 |
How Seedance 1.0 Lite compares to Pro and 1.5 Pro
The ModelArk model list positions Seedance 1.0 Lite as the lightweight option, while Seedance 1.0 Pro targets higher‑quality generation and Seedance 1.5 Pro adds audio‑visual synchronization. If your primary constraint is throughput, Lite is usually the right starting point. If you need higher production value or audio‑synced video, the Pro tiers are better aligned.
| Model | Audio | Durations | Resolution | Rate limit (RPM) |
|---|---|---|---|---|
| Seedance 1.5 Pro | Audio‑visual sync | 4–12s | 480p, 720p, 1080p | 600 |
| Seedance 1.0 Pro | Silent | 2–12s | 480p, 720p, 1080p | 600 |
| Seedance 1.0 Lite | Silent | 2–12s | 480p, 720p, 1080p | 300 |
If you want the highest throughput for rapid testing, Lite is the easiest entry point. If you need audio‑synchronized video or cinematic quality, the official ModelArk listing indicates that Seedance 1.5 Pro is the intended upgrade path.
Practical use cases
Seedance 1.0 Lite is best for fast content loops: social media snippets, ad concept tests, layout previews, and motion storyboards. Teams can use Lite as a first pass to generate many variations, then re‑render the most promising concepts with a higher‑fidelity model if needed.
It is also effective for prototype‑first workflows. When you need to validate timing and motion before investing in production assets, Lite offers a rapid way to visualize the idea. This staged approach keeps creative iteration fast while preserving a clear upgrade path.
Another strong use case is design exploration. You can iterate on camera language and pacing before committing to a full video pipeline. For example, teams often test multiple camera directions with the same prompt to decide whether a scene feels more dynamic with a tracking shot, a slow pan, or a static frame. Lite is fast enough to make this a practical creative loop.
Lite is also useful for localization experiments. If you need the same scene in multiple languages or product variants, a lightweight model lets you generate quick drafts for each market before selecting the final version. This is especially helpful when you are testing which visual direction resonates most strongly with different audiences.
FAQ
Does Seedance 1.0 Lite support both text and image inputs?
Yes. The official ModelArk documentation lists both text and image as supported inputs.
What resolutions and durations are supported?
The official specs list 480p, 720p, and 1080p outputs with durations from 2 to 12 seconds at 24fps.
Is multi‑reference image guidance available?
The ModelArk documentation includes reference image support in the model version list, but the overview section also states that reference image scenarios are not supported. Because of this inconsistency, verify availability in your provider console for your specific model version.
When should I choose Lite over Pro?
Choose Lite when speed, volume, and cost efficiency are more important than maximum visual fidelity. If you need audio‑synchronized video or top‑tier cinematic quality, use the Pro tiers.