Wan 2.5 Text-To-Video Spark Generation
Generate videos from text prompts using the wan2.5-t2v-spark model.
Documentation Index
Fetch the complete documentation index at: https://mulerun.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Generate videos from text prompts using the wan2.5-t2v-spark model with optional audio generation.Key Features
- Text-to-video generation with auto sound or custom audio
- Multiple resolution options (720P/1080P)
- 5s or 10s duration
- 24fps output
- Fine-tuned for enhanced visual quality
Resolution Options
720P
- 1280×720 (16:9)
- 720×1280 (9:16)
- 960×960 (1:1)
- 1088×832 (4:3)
- 832×1088 (3:4)
1080P
- 1920×1080 (16:9)
- 1080×1920 (9:16)
- 1440×1440 (1:1)
- 1632×1248 (4:3)
- 1248×1632 (3:4)
Audio Features
Auto-generated Audio
- Enabled by default
- Automatically generates synchronized audio based on video content
Custom Audio
- Supported formats: WAV, MP3
- Duration: 3-30 seconds
- Max file size: 15MB
- Behavior: If audio is shorter than video, remaining portion is silent; if longer, it’s truncated
Example Requests
Basic Text-to-Video
With Custom Audio
Silent Video
Prompt Tips
For best results when describing motion:- Specify camera movement (pan left, zoom in, dolly shot)
- Describe subject motion (walks forward, turns around)
- Include environment details (windy, foggy, sunlit)
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
Text description for the desired video content (max 2000 characters).
2000Negative prompt describing unwanted content (max 500 characters).
500Enable automatic audio generation. Set to false to force a silent output.
Custom audio file URL (wav/mp3, 3-30s, ≤15MB). Overrides the audio flag.
Output resolution ("width*height"). Supported tiers:
- 720P: 1280*720 (16:9), 720*1280 (9:16), 960*960 (1:1), 1088*832 (4:3), 832*1088 (3:4)
- 1080P: 1920*1080 (16:9), 1080*1920 (9:16), 1440*1440 (1:1), 1632*1248 (4:3), 1248*1632 (3:4)
1280*720, 720*1280, 960*960, 1088*832, 832*1088, 1920*1080, 1080*1920, 1440*1440, 1632*1248, 1248*1632 Video duration in seconds (24 fps). Supported values 5 or 10.
5, 10 Enable intelligent prompt rewriting (slightly longer latency, better detail).
Random seed [0, 2147483647].
0 <= x <= 2147483647Response
Accepted - Task created successfully