HaoZhang990127

Follow

Hao Zhang HaoZhang990127

Follow

16 followers · 6 following

Tsinghua University
Germany

Achievements

Achievements

Lists (27)

Sort

3d

15 repositories

agent

avatar

65 repositories

avatar-motion

control

depth

diffusion

distributed dl

image generation

69 repositories

isaac gym

learning isaac gym

layout

llm

18 repositories

ocr

outpainting

physics

rl

reinforcement learning

segment

sr

video+3d

video edit

video generation

132 repositories

Video Stabilization

video understand

vlm

voice

vqa

world model

Starred repositories

thu-ml / RIFLEx

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers"

Python 313 33 Updated Mar 3, 2025

mannaandpoem / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 15,108 2,270 Updated Mar 8, 2025

Tencent / HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 785 44 Updated Mar 8, 2025

SkyworkAI / SkyReels-A1

SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers

Python 364 40 Updated Mar 5, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,460 595 Updated Mar 7, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 7,558 759 Updated Mar 7, 2025

Fantasy-AMAP / fantasy-id

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

40 1 Updated Feb 21, 2025

Chenguoz / CAIG

[WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"

Python 16 Updated Feb 25, 2025

AnyCharV / AnyCharV

[arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance

Python 27 Updated Feb 19, 2025

bcmi / Light-A-Video

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Python 369 25 Updated Feb 28, 2025

Phantom-video / Phantom

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

CSS 471 32 Updated Mar 5, 2025

stepfun-ai / Step-Video-T2V

Python 2,595 221 Updated Feb 27, 2025

SkyworkAI / SkyReels-V1

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,721 153 Updated Feb 24, 2025

facebookresearch / pippo

Pippo: High-Resolution Multi-View Humans from a Single Image

Python 459 37 Updated Feb 25, 2025

google-deepmind / physics-IQ-benchmark

Benchmarking physical understanding in generative video models

Python 121 12 Updated Feb 28, 2025

ZiyuGuo99 / Image-Generation-CoT

Investigating CoT Reasoning in Autoregressive Image Generation

Python 525 19 Updated Mar 7, 2025

Tencent / Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 6,750 525 Updated Feb 24, 2025

byliutao / 1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Python 209 25 Updated Feb 26, 2025

NJU-PCALab / STAR

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Python 1,048 60 Updated Jan 22, 2025

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 3,859 216 Updated Mar 8, 2025

showlab / VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Python 383 15 Updated Dec 6, 2024

VectorSpaceLab / OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 3,679 315 Updated Feb 20, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,634 489 Updated Mar 7, 2025

bytedance / VideoWorld

[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment

Python 396 22 Updated Mar 7, 2025

Saiyan-World / goku

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,642 273 Updated Feb 19, 2025

pq-yang / MatAnyone

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 797 39 Updated Feb 24, 2025

jixiaozhong / Sonic

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 1,988 162 Updated Feb 10, 2025

bytedance / LatentSync

Taming Stable Diffusion for Lip Sync!

Python 2,845 423 Updated Jan 19, 2025

deepseek-ai / DeepSeek-R1

85,490 11,030 Updated Feb 24, 2025

bytedance / X-Dyna

[CVPR 2025] X-Dyna: Expressive Dynamic Human Image Animation

Python 208 19 Updated Jan 30, 2025

Starred topics

text-to-3d

3d-face-reconstruction