I am Tao Ye, an undergraduate student in Artificial Intelligence at Shanghai Jiao Tong University (SJTU), enrolled in the AI Talents Pilot Class and the Zhiyuan Honors Program.

I am an incoming M.S. student at the Nanjing University Speech Group (2026-2029), advised by Prof. Shuai Wang.


Research Interests

  • General audio generation
  • Dialogue systems
  • Spoken language models

Education

  • Shanghai Jiao Tong University (SJTU), B.Eng. in Artificial Intelligence, 2022-2026
    AI Talents Pilot Class, Zhiyuan Honors Program, X-LANCE Lab

  • Nanjing University (NJU), Incoming M.S. Student, 2026-2029
    Speech Group, advised by Prof. Shuai Wang


Experience

  • Research Intern, Shanghai AI Laboratory (Speech Group), Jun 2025 - Dec 2026
    Supervised by Prof. Chao Zhang.

  • Research Intern, Video Rebirth, Dec 2026 - Present
    Working on unified audio-visual generation and VTA tasks.


Selected Publications

You can find the full list on Publications.

  • MMEdit: A Unified Framework for Multi-Type Audio Editing via Audio Language Model (ICME 2026 accepted)
    Unified editing framework for multiple audio editing operations with strong instruction following and content preservation. arXiv | Project

  • UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities (arXiv 2025)
    Unified flow-matching framework for omni-modal audio generation across aligned and non-aligned tasks. arXiv | Project


Contact


Honors

  • Zhiyuan Honors Program, Shanghai Jiao Tong University
  • AI Talents Pilot Class, Shanghai Jiao Tong University