Yuxuan Wu - Research

My research investigates how structured abstractions arise and organize themselves within learning systems. Human-like intelligence is supported by multiple languages of thought (LoT) like below, from different perceptual modalities (y-axis) and different levels of abstraction (z-axis).

Missing (Natural language is not shown here, as it is a System-2 conglomeration of all.)

My research is circled around the questions of:

How structured abstract representations can emerge from lower-level signals under principled inductive biases, a.k.a. emergent language on the higher z-axis;
How heterogeneous representations can be aligned at the function/model level for coordinated understanding, reasoning and generation, a.k.a. function alignment** across different model

These questions are closely related to the broader topics of self-supervised learning, hierarchical modeling, designed interpretability and neural-symbolic integration. I am also interested in audio/music AI and multimodality as testbeds and application.

Publications

Bridging Perceptual and Analytic Dynamics via Function Alignment
Yuxuan Wu, Gus Xia
ICLR 2026 Re-Align Workshop
Paper · Code (TBD)
Emergence of Symbolic Language from Perception through Physical Symmetry
Xuanjie Liu*, Yuxuan Wu*, Ziyu Wang, Gus Xia
Under Review
Paper (TBD) · Code (TBD)
Unsupervised Disentanglement of Content and Style via Variance-Invariance Constraints
Yuxuan Wu, Ziyu Wang, Bhiksha Raj, Gus Xia
ICLR 2025
Paper · Code · Demo
Automatic Melody Reduction via Shortest Path Finding
Ziyu Wang, Yuxuan Wu, Gus Xia, Roger B. Dannenberg
ISMIR 2025
Paper · Demo
A Closer Look at Reinforcement Learning-based Automatic Speech Recognition
Fan Yang, Muqiao Yang, Xiang Li, Yuxuan Wu, Zhiyuan Zhao, Bhiksha Raj, Rita Singh
Computer Speech and Language, 2024
Paper
Motif-Centric Representation Learning for Symbolic Music
Yuxuan Wu, Roger B. Dannenberg, Gus Xia
arXiv 2023
Paper · Code
TransPlayer: Timbre Style Transfer with Flexible Timbre Control
Yuxuan Wu, Yifan He, Xinlu Liu, Yi Wang, Roger B. Dannenberg
ICASSP 2023 (Oral)
Paper · Code · Demo
SingStyle111: A Multilingual Singing Dataset with Style Transfer
Shuqi Dai, Siqi Chen, Yuxuan Wu, Ruxin Diao, Roy Huang, Roger B. Dannenberg
ISMIR 2023
Paper · Demo
DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion
Ruibin Yuan, Yuxuan Wu, Jacob Li, Jaxter Kim
INTERSPEECH 2022
Paper · Code
A Method for Music Texture Generation Based on Markov Chains
Xia Liang, Yuan Wan, Yuxuan Wu, Bilei Zhu, Zejun Ma
China Patent, 2022

Invited Talks

Emergent Language of Thought in AI: The Birth of Symbols & The Rise of Structure
Institute for Math & AI, Wuhan University — December 2025
Music Through the Lens of Machine and Intelligence
Chengdu University — July 2025
Introduction to Music AI: Programmer’s Perspective and Machine Composition
Nanjing University of the Arts — May 2025
Emergent Content-Style Disentanglement via Variance-Invariance Constraints
Sound & Music Computing Lab, National University of Singapore — April 2025
A Glimpse into Self-Supervised Music Concept Discovery
Nanjing University of the Arts — June 2024
A to I: Music Cocreation with AI
Music AI Group, Mila – Quebec AI Institute — February 2023

Teaching

I have served as a teaching assistant in undergraduate and graduate courses spanning artificial intelligence, machine learning, interpretability, and music AI. I value helping students move beyond implementation toward structural understanding of the landscape of AI and its connections to humanity and life.

MBZUAI (Teaching Assistant and Course Co-Design)

AI1010 — Introduction to AI (First undergraduate cohort)
ML8506 — Interpretable AI (First offering)
ML711 — Intermediate Music AI (First offering)
ML801 — Foundations and Advanced Topics in Machine Learning

Carnegie Mellon University (Teaching Assistant)

11-755 / 18-797 — Machine Learning for Signal Processing
15-322 / 15-622 — Introduction to Computer Music