Repository History

3 repositories tagged with Video Generation

Topic: Video Generation

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation

HunyuanVideo-Avatar is a cutting-edge project by Tencent-Hunyuan for high-fidelity, audio-driven human animation. Utilizing a multimodal diffusion transformer, it generates dynamic, emotion-controllable, and multi-character dialogue videos. This innovative system addresses critical challenges in character consistency, emotion alignment, and multi-character animation, making it suitable for diverse applications like e-commerce and social media.

Analyzed Dec 30, 2025

View Details

InfiniteTalk: Unlimited-Length AI Video Generation from Audio or Images

InfiniteTalk is an innovative AI model for generating unlimited-length talking videos. It excels at creating realistic video content from audio, supporting both image-to-video and video-to-video generation. This framework ensures accurate lip synchronization and consistent identity preservation, aligning head movements, body posture, and facial expressions with the input audio.

Analyzed Nov 13, 2025

View Details

Step-Video-T2V: State-of-the-Art Text-to-Video Generation Model

Step-Video-T2V is a state-of-the-art text-to-video pre-trained model capable of generating videos up to 204 frames with 30 billion parameters. It achieves high efficiency through a deep compression Video-VAE and enhances visual quality using Direct Preference Optimization (DPO). The model's performance is validated on its novel benchmark, Step-Video-T2V-Eval, demonstrating superior text-to-video quality.

Analyzed Oct 29, 2025

View Details

Previous Page 1 Next