Skip to content
View HaoZhang990127's full-sized avatar
  • Tsinghua University
  • Germany

Block or report HaoZhang990127

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers"

Python 313 33 Updated Mar 3, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 15,108 2,270 Updated Mar 8, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 785 44 Updated Mar 8, 2025

SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers

Python 364 40 Updated Mar 5, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,460 595 Updated Mar 7, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 7,558 759 Updated Mar 7, 2025

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

40 1 Updated Feb 21, 2025

[WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"

Python 16 Updated Feb 25, 2025

[arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance

Python 27 Updated Feb 19, 2025

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Python 369 25 Updated Feb 28, 2025

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

CSS 471 32 Updated Mar 5, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,721 153 Updated Feb 24, 2025

Pippo: High-Resolution Multi-View Humans from a Single Image

Python 459 37 Updated Feb 25, 2025

Benchmarking physical understanding in generative video models

Python 121 12 Updated Feb 28, 2025

Investigating CoT Reasoning in Autoregressive Image Generation

Python 525 19 Updated Mar 7, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 6,750 525 Updated Feb 24, 2025

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Python 209 25 Updated Feb 26, 2025

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Python 1,048 60 Updated Jan 22, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 3,859 216 Updated Mar 8, 2025

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Python 383 15 Updated Dec 6, 2024

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 3,679 315 Updated Feb 20, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,634 489 Updated Mar 7, 2025

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment

Python 396 22 Updated Mar 7, 2025

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,642 273 Updated Feb 19, 2025

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 797 39 Updated Feb 24, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 1,988 162 Updated Feb 10, 2025

Taming Stable Diffusion for Lip Sync!

Python 2,845 423 Updated Jan 19, 2025

[CVPR 2025] X-Dyna: Expressive Dynamic Human Image Animation

Python 208 19 Updated Jan 30, 2025
Next
Showing results