pipixin321

Follow

🎯

Focusing

Huaxin Zhang pipixin321

🎯

Focusing

Follow

Focus in Video Understanding.

19 followers · 10 following

HUST(Huazhong University of Science and Technology)
Wuhan
19:57 (UTC +08:00)

Achievements

Achievements

pipixin321/README.md

Hi there 👋, I'm Huaxin Zhang

I am a Master of HUST (Huazhong University of Science and Technology), supervised by Prof. Changxin Gao and Prof. Nong Sang.

🔭 Reseach-wise, I mainly focus on:

Multi-modal Large Language Models
Video Understanding, more specifically, Weakly-supervised Temporal Action Localization (WSTAL) & Weakly-suervised Video Anomaly Detection (WSVAD).

😄 I am open to:

A internship/job/PhD offer with computer vision/multimodal LLM research and engineering.

📫 Contact me by:

Email: zhanghuaxin@hust.edu.cn

💬 News:

2024-07-01: We release our code and model of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM".[project page]
2024-06-10: We release our code and model of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilities".[project page]
2024-01-29: I start my internship in Baidu VIS, to do some research on Multi-modal Large Language Model (MLLM).
2023-12-09: One paper about point supervised temporal action localization is accepted on AAAI 2024.

Pinned Loading

HolmesVAU HolmesVAU Public

✨✨✨Official implementation of "Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity"

Python 6
HolmesVAD HolmesVAD Public

Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"

Python 88 3
Awesome-Video-MLLMs Awesome-Video-MLLMs Public

🔥 🔥 🔥 Awesome MLLMs/Benchmarks for Short/Long/Streaming Video Understanding 📹

2
Arcana Arcana Public

Forked from syp2ysy/Arcana

Implementation of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilitie"

Python
HR-Pro HR-Pro Public

[AAAI24] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"

Python 27 1
GlanceVAD GlanceVAD Public

Official implementation of "GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection"

Python 21