Publications

* indicates equal contribution.

Audio Editing

  • MMEdit: A Unified Framework for Multi-Type Audio Editing via Audio Language Model
    Ye Tao, Xuenan Xu, Wen Wu, Shuai Wang, Mengyue Wu, Chao Zhang
    ICME 2026 (accepted), 2026
    A unified audio editing framework covering addition, replacement, removal, reordering, and attribute modification with strong instruction following and content preservation.

Audio Generation

  • UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities
    Xuenan Xu, Jiahao Mei, Zihao Zheng, Ye Tao, Zeyu Xie, Yaoyun Zhang, Haohe Liu, Yuning Wu, Ming Yan, Wen Wu, Chao Zhang, Mengyue Wu
    arXiv preprint, 2025
    A unified flow-matching framework for omni-modal audio generation across time-aligned and non-time-aligned tasks with strong parameter efficiency.