About me

Short Bio

I am Tao Ye, currently an undergraduate student in Artificial Intelligence at Shanghai Jiao Tong University (SJTU), in the AI Talents Pilot Class and Zhiyuan Honors Program.

I will join the Nanjing University Speech Group as an M.S. student (2026-2029), advised by Prof. Shuai Wang.

My research focuses on general audio generation and audio-visual generation, especially controllable generation and editing in multimodal settings.


Basic Information


Education

  • Shanghai Jiao Tong University (SJTU), Shanghai, China
    B.Eng. in Artificial Intelligence, 2022-2026
    AI Talents Pilot Class, Zhiyuan Honors Program, X-LANCE Lab

  • Nanjing University (NJU), Nanjing, China
    M.S. Student (Incoming), Speech Group, 2026-2029
    Advisor: Prof. Shuai Wang


Research Interests

  • General audio generation
  • Dialogue systems
  • Spoken language models

Experience

  • Research Intern, Shanghai AI Laboratory (Speech Group), Jun 2025 - Dec 2026
    Supervised by Prof. Chao Zhang.

  • Research Intern, Video Rebirth, Dec 2026 - Present
    Working on unified audio-visual generation and VTA tasks.


Selected Publications

  • MMEdit: A Unified Framework for Multi-Type Audio Editing via Audio Language Model
    Ye Tao, Xuenan Xu, Wen Wu, Shuai Wang, Mengyue Wu, Chao Zhang.
    arXiv preprint, 2025. arXiv | Project

  • UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities
    Xuenan Xu, Jiahao Mei, Zihao Zheng, Ye Tao, Zeyu Xie, Yaoyun Zhang, Haohe Liu, Yuning Wu, Ming Yan, Wen Wu, Chao Zhang, Mengyue Wu.
    arXiv preprint, 2025. arXiv | Project


Honors

  • Zhiyuan Honors Program, Shanghai Jiao Tong University
  • AI Talents Pilot Class, Shanghai Jiao Tong University