Repository History

Explore all analyzed open source repositories

Topic: Video Generation
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation

HunyuanVideo-Avatar is a cutting-edge project by Tencent-Hunyuan for high-fidelity, audio-driven human animation. Utilizing a multimodal diffusion transformer, it generates dynamic, emotion-controllable, and multi-character dialogue videos. This innovative system addresses critical challenges in character consistency, emotion alignment, and multi-character animation, making it suitable for diverse applications like e-commerce and social media.

Dec 30, 2025
View Details
InfiniteTalk: Unlimited-Length AI Video Generation from Audio or Images

InfiniteTalk: Unlimited-Length AI Video Generation from Audio or Images

InfiniteTalk is an innovative AI model for generating unlimited-length talking videos. It excels at creating realistic video content from audio, supporting both image-to-video and video-to-video generation. This framework ensures accurate lip synchronization and consistent identity preservation, aligning head movements, body posture, and facial expressions with the input audio.

Nov 13, 2025
View Details
Step-Video-T2V: State-of-the-Art Text-to-Video Generation Model

Step-Video-T2V: State-of-the-Art Text-to-Video Generation Model

Step-Video-T2V is a state-of-the-art text-to-video pre-trained model capable of generating videos up to 204 frames with 30 billion parameters. It achieves high efficiency through a deep compression Video-VAE and enhances visual quality using Direct Preference Optimization (DPO). The model's performance is validated on its novel benchmark, Step-Video-T2V-Eval, demonstrating superior text-to-video quality.

Oct 29, 2025
View Details
Page 1