Publications
The asterisk * next to the author’s name indicates co-first authorship.
A Large-Scale Evaluation of Speech Foundation Models
Shu-wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee
in IEEE/ACM Transactions on Audio Speech and Language Processing, 2024
arxiv (preferred) / ieee / codeSUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Tzu-hsun Feng, Annie Dong, Ching-Feng Yeh, Shu-wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee
in SLT, 2022
arxiv / code / websiteA Comparative Study of Self-Supervised Speech Representation Based Voice Conversion
Wen-Chin Huang, Shu-wen Yang, Tomoki Hayashi, Tomoki Toda
in IEEE Journal of Selected Topics in Signal Processing, 2022
arxiv / codeSelf-supervised Representation Learning for Speech Processing
Hung-yi Lee, Abdelrahman Mohamed, Shinji Watanabe, Tara Sainath, Karen Livescu, Shang-Wen Li, Shu-wen Yang, Katrin Kirchhoff
in NAACL, 2022
tutorial proposal / videoInvestigating Self-Supervised Learning for Speech Enhancement and Separation
Zili Huang, Shinji Watanabe, Shu-wen Yang, Paola Garcia, Sanjeev Khudanpur
in ICASSP, 2022
arxiv / codeDistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Heng-Jui Chang, Shu-wen Yang, Hung-yi Lee
in ICASSP, 2022
arxiv / code / huggingfaceS3PRL-VC: Open-Source Voice Conversion Framework with Self-Supervised Speech Representations
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Hung-Yi Lee, Shinji Watanabe, Tomoki Toda
in ICASSP, 2022
arxiv / code / demoSUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Hsiang-Sheng Tsai*, Heng-Jui Chang*, Wen-Chin Huang*, Zili Huang*, Kushal Lakhotia*, Shu-wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee
in ACL, 2022
arxiv / video / website / codeDUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-wen Yang, Hsuan-Jui Chen, Shuyan Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-shan Lee
in Interspeech, 2022
arxivAn Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Xuankai Chang, Takashi Maekaku, Pengcheng Guo, Jing Shi, Yen-Ju Lu, Aswin Shanmugam Subramanian, Tianzi Wang, Shu-wen Yang, Yu Tsao, Hung-yi Lee, Shinji Watanabe
in ASRU, 2021
arxiv / codeSUPERB: Speech processing Universal PERformance Benchmark
Shu-wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee
in Interspeech, 2021
arxiv / video / website / codeS3PRL: The Self-Supervised Speech Pre-training and Representation Learning Toolkit
Andy T Liu*, Shu-wen Yang*
on GitHub repository, 2020
code / website / videoUnderstanding Self-Attention of Self-Supervised Audio Transformers
Shu-wen Yang, Andy T Liu, Hung-yi Lee
in Interspeech, 2020
in ICML Workshop on Self-supervision in Audio and Speech, 2020
arxiv / videoMockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T Liu, Shu-wen Yang, Po-Han Chi, Po-chun Hsu, Hung-yi Lee
in ICASSP, 2020
arxiv / code / video