Publications
* Denotes Equal Contribution.
First-author and co-first-author papers are highlighted.
|
|
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Zhengyao Lv*,
Tianlin Pan*,
Chenyang Si,
Zhaoxi Chen,
Wangmeng Zuo,
Ziwei Liu,
Kwan-Yee K. Wong,
arXiv, 2025
project page
/
arXiv
/
code
|
|
Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
Zhengyao Lv*,
Chenyang Si*,
Tianlin Pan,
Zhaoxi Chen,
Kwan-Yee K. Wong,
Yu Qiao,
Ziwei Liu
arXiv, 2025
project page
/
arXiv
/
code
|
|