VIDEO GENERATIONUNCENSOREDby Alibaba (Tongyi Lab)

WAN 2.7 Spicy

Wan 2.7 by Alibaba Tongyi Lab — latest Wan generation, up to 15s, 1080p, native audio. Unrestricted image-to-video for trusted users on ZenCreator.

Credits never expireCommercial usage
15s
Max duration
1080p
Resolution
🎵
Native audio
🔓
Unrestricted

Why pick WAN 2.7 Spicy

🆕 Latest Wan generation
Wan 2.7 is the newest model in Alibaba's video lineup — better temporal consistency, motion naturalness, and prompt adherence than Wan 2.6. The Wan family's most capable video model currently on ZenCreator.
🎵 Audio + video in one pass
Native audio generation alongside the video — ambient sound, music, and synchronized audio from a single prompt. No separate audio step required for most content.
⏱ Up to 15 seconds
5, 10, or 15-second clips — enough for short-form storytelling, product demos, and social content without cutting scenes mid-action. The full 15s range at 1080p in a single generation.
📺 1080p native
Full 1080p output — publish-ready for Reels, TikTok, and YouTube Shorts without upscaling. Drop to 720p when you want faster generation and lower credit cost.
🔓 Unrestricted for trusted users
No safety filters on the animation. Wan 2.7 runs with minimal content restrictions for trusted users — the same unrestricted access as the full Wan family on ZenCreator.
📌 Start frame anchoring
Upload any photo as the start frame and Wan 2.7 animates outward from it — preserving your character's face, outfit, and scene composition through the full clip.

What is WAN 2.7 Spicy?

Wan 2.7 is Alibaba Tongyi Lab's newest image-to-video model — the latest generation in a family that includes Wan 2.2, 2.5, and 2.6. Each version improved on temporal consistency and motion quality; Wan 2.7 raises the bar again with better prompt adherence, smoother inter-frame coherence, and more natural motion physics across a wider range of subject types.

Practically: upload a source image as the start frame, write a motion + audio prompt, and Wan 2.7 generates a 5–15 second clip at up to 1080p with native audio baked in. No end frame control (for start+end, use Wan 2.2). No LoRA style adaptation (for special styles, use Wan 2.2 + LoRAs).

The model is available for trusted users on ZenCreator with minimal content restrictions — the same unrestricted access as the full Wan family. Cost scales by resolution and duration: 720p is cheaper per second than 1080p.

See WAN 2.7 Spicy in action

Source photo
Source photo
Source photo

Wan 2.7 vs other audio video models

ModelResolutionDurationProviderContent
Wan 2.7720p–1080pUp to 15sAlibabaUnrestricted
Wan 2.6 + Audio1080pUp to 15sAlibabaUnrestricted
Seedance Pro 1.51080p5–10sByteDanceUnrestricted
Kling 2.6 + Audio1080p5–10sKuaishouSafe only

How to get started

1
Upload your photo
Source photo example
2
Write a motion + audio prompt
slow camera push in, lips part, eyes look up, dramatic neon light glow, sensual close-up
3
1080p clip with audio

Bottom line

Wan 2.7 is the most capable Wan video model on ZenCreator — the newest generation with native audio, 1080p output, and 15-second clips. Use it as the default choice when you need Alibaba's video quality plus audio in one generation. For start+end frame control, use Wan 2.2. For special LoRA styles, use Wan 2.2 + LoRAs.

Available in

Image-to-Video
Upload a source image, write a motion + audio prompt, pick Wan 2.7, generate at 720p or 1080p.
Try Image-to-Video
Text-to-Video
Generate video directly from a text prompt — no source image needed. Pick WAN 2.7 Spicy from the model list.
Open Text-to-Video
Wan 2.7 is a newer generation with improved temporal consistency, motion naturalness, and prompt adherence. Both support audio and 15-second duration at 1080p. Wan 2.7 is the more capable model; Wan 2.6 + Audio remains available as an alternative if you prefer it.
No — start frame only. You upload a photo as the opening frame and Wan 2.7 animates outward from it. For start AND end frame control (useful for loops and transitions), use Wan 2.2 or Kling 2.1.
Audio is generated automatically from your prompt — describe the sonic context (ambient rain, upbeat background track, crowd murmur) and the model matches it. Voice auto-selects in English. For a specific voice or non-English speech, generate silently and add audio via the Lipsync tool.
720p generates faster and costs fewer credits per second; 1080p produces sharper, publish-ready output. Use 720p for drafts and direction checks, switch to 1080p for the final clip you're posting.
Yes — Wan 2.7 runs with minimal content filters for trusted users on ZenCreator. No safety filters limit the animation output.
Both are premium unrestricted models with audio at 1080p. Wan 2.7 goes up to 15s and is the newer Alibaba model. Seedance Pro 1.5 (ByteDance) caps at 10s but adds explicit camera control (dolly, pan, orbit). Choose Wan 2.7 for longer clips; Seedance Pro 1.5 for precise camera movement.

Sources

  1. Alibaba Tongyi Lab Wan model family: wan.video
  2. ZenCreator Image-to-Video tool: zencreator.pro
  3. ZenCreator AI Models internal review database, June 2026

Try WAN 2.7 Spicy

Available on ZenCreator — sign in, open the relevant generator, pick WAN 2.7 Spicy from the model list.

WAN 2.7 Spicy is developed by Alibaba (Tongyi Lab). Official page. ZenCreator provides access to WAN 2.7 Spicy through its platform.