Wan AI is an advanced visual generation model developed by Tongyi Lab, specializing in transforming text, images, and other control signals into video content. Its Wan 2.1 series models have been fully open-sourced, offering users an unprecedented video generation experience.
Wan 2.1 excels across multiple domains, handling tasks such as Text-to-Video, Image-to-Video, video editing, text generation, and Video-to-Audio with ease. It supports consumer-grade GPUs, with the T2V-1.3B model requiring only 8.19 GB VRAM to generate a 5-second 480P video on an RTX 4090 in just 4 minutes, an impressive speed.
Moreover, Wan video can produce realistic complex motions, accurate physical simulations, and cinematic-quality visuals. It also supports multilingual text generation, making it accessible to users worldwide. The open-source Wan2.1-I2V-14B model outperforms most models on the market, setting a new benchmark in the field of video generation.