Use Wan 2.1 via API for Effortless Video Creation

use wan 2.1 via api

Key Highlights

Wan2.1 excels in real-world accuracy, including:
Hand details: Renders fingers and textures.
Drawing animation: Turns static drawings into clips.
Physics: Simulates effects like bending trees.
Hair movement: Realistically animates strands.
Object interactions: Handles complex spatial interactions.
Crowd rendering: Maintains clarity in large scenes.

Via Novita AI, a 5-second 720P video costs $0.40, with easy-to-use APIs. Wan2.1 combines affordability, accessibility, and precision in video generation.

Exploring the Full Power of Wan 2.1

What is Wan 2.1?

Alibaba Cloud has taken a significant step by open-sourcing its Wan2.1 series of AI models for video generation. This initiative reduces accessibility barriers for academics, researchers, and commercial users, enabling them to leverage state-of-the-art AI capabilities without substantial upfront costs. By making these models openly available, Alibaba Cloud promotes innovation and collaboration in the fields of AI and video generation.

Versatile Capabilities of Wan2.1

Wan2.1 is a highly versatile tool, excelling in various tasks related to video and image generation:

  • Text-to-Video (T2V): Converts text prompts into videos.
  • Image-to-Video (I2V): Generates videos from a single image input.
  • Video Editing: Enables AI-powered editing for improved video quality and effects.
  • Text-to-Image (T2I): Transforms text descriptions into still images.

A standout feature of Wan2.1 is its ability to generate text in both Chinese and English, making it a pioneer in multilingual video generation. Its robust text generation capabilities enhance its practicality across diverse use cases.

Real-World Accuracy of Wan2.1

The Wan2.1 series stands out for its exceptional accuracy in simulating real-world physics and details. Below are some of the key capabilities that highlight its performance:

a. Hand Details

  • The model excels at rendering hands, showcasing accurate details like individual fingers, skin textures, and even accessories like rings.

b. Drawing Animation

  • Wan2.1 can transform static drawings into short video clips, making it a powerful tool for animators and digital artists.

c. Physics Simulation

  • The model accurately depicts real-world physics. For example, it can render a giraffe hanging upside down from a tree, with the tree branch bending realistically under the weight.

d. Hair Movement

  • It accurately renders hair movement in videos, showing individual strands as they move naturally when people turn their heads.

e. Object Interactions

  • Wan2.1 can simulate complex object interactions, ensuring that multiple objects interact realistically within the same space.

f. Crowd Rendering

  • The model excels at rendering large crowds with each individual remaining distinct, resulting in coherent and detailed scenes.

VBench of Wan 2.1

VBench is designed to deliver valuable insights into the strengths and weaknesses of individual models, facilitating detailed and objective evaluation. These insights not only inform future advancements in video generation but also contribute to improving model performance. To ensure consistency with human perception, VBench integrates human preference annotations, reinforcing its relevance and reliability as a benchmark. The chart below highlights the performance of Wan2.1:

vbench
from Alizila

Hardware Requirements of Wan2.1

Wan2.1 offers multiple models tailored for different use cases, each with specific hardware demands. Below is a detailed summary of the hardware requirements for the four Wan2.1 models:

Model NameFunctionResolution SupportModel SizeHardware DemandRecommended GPU
T2V-14BText-to-Video (T2V)480P / 720P14B⭐⭐⭐⭐A100 / RTX 3090 / RTX 4090
I2V-14B-720PImage-to-Video (I2V)720P14B⭐⭐⭐⭐A100 / RTX 3090 / RTX 4090
I2V-14B-480PImage-to-Video (I2V)480P14B⭐⭐⭐RTX 3090 / RTX 4070 Ti
T2V-1.3BText-to-Video (T2V)Low Resolution1.3B⭐⭐RTX 3060 / RTX 4060 or higher

Run Wan2.1 with an API

Novita AI is an AI cloud platform that offers developers an easy way to deploy AI models using our simple API, while also providing the affordable and reliable GPU cloud for building and scaling.

Novita offers highly competitive pricing in the market.

For example, a Wan 2.1 720P 5-second video costs only $0.4 per video

While a similar video on Replicate costs $1 per video

Step 1: Log In and Access the Model Library

Log in to your account and click on the Model Library button.

Log In and Access the Model Library

Step 2: Choose Your Model

Browse through the available options and select the model that suits your needs.

Step 3: Start Your Free Trial

Begin your free trial to explore the capabilities of the selected model.

start your free trail

Step 4: Get Your API Key

To authenticate with the API, we will provide you with a new API key. Entering the “Settings“ page, you can copy the API key as indicated in the image.

get api key

Step 5: Install the API

Install API using the package manager specific to your programming language.

After installation, import the necessary libraries into your development environment. Initialize the API with your API key to start interacting with Novita AI LLM. This is an example of using chat completions API for python users.

import requests

url = "https://api.novita.ai/v3/async/wan-t2v"

payload = {
    "extra": {"webhook": {
            "url": "<string>",
            "test_mode": {
                "enabled": True,
                "return_task_status": "<string>"
            }
        }},
    "model_name": "<string>",
    "width": 123,
    "height": 123,
    "seed": 123,
    "prompt": "<string>",
    "frames": 123
}
headers = {
    "Content-Type": "<content-type>",
    "Authorization": "<authorization>"
}

response = requests.request("POST", url, json=payload, headers=headers)

print(response.text)
  
  

Frequently Asked Questions

What hardware is required to run Wan2.1?

High-performance GPUs like A100, RTX 3090, or RTX 4090 are recommended for 14B models.
Entry-level models (e.g., T2V-1.3B) can run on RTX 3060+ for low-resolution tasks.

What makes Novita AI cost-effective for Wan2.1?

Competitive pricing: $0.40 per 5-second 720P video compared to $2.39 on similar platforms.
Easy-to-use API with detailed documentation for developers.

How can I use Wan2.1 via API?

Steps to use Wan2.1 through Novita AI:
Log in and access the model library.
Select the desired model.
Start a free trial to explore its capabilities.
Obtain your API key in the Settings section.
Install and integrate the API into your development environment.

Novita AI is the All-in-one cloud platform that empowers your AI ambitions. Integrated APIs, serverless, GPU Instance — the cost-effective tools you need. Eliminate infrastructure, start free, and make your AI vision a reality.

Recommend Reading


Discover more from Novita

Subscribe to get the latest posts sent to your email.

Leave a Comment

Scroll to Top

Discover more from Novita

Subscribe now to keep reading and get access to the full archive.

Continue reading