Publication

You can also find my updated publications on my Google Scholar profile.

2025

  • Yifan Yang, Shujie Liu, Jinyu Li, Yuxuan Hu, Haibin Wu, Hui Wang, Jianwei Yu, Lingwei Meng, Haiyang Sun, Yanqing Liu, Yan Lu, Kai Yu, Xie Chen. “Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis”. In Proc. ACM MM, 2025.
  • Guanrou Yang, Chen Yang, Qian Chen, Ziyang Ma, Wenxi Chen, Wen Wang, Tianrui Wang, Yifan Yang, Zhikang Niu, Wenrui Liu, Fan Yu, Zhihao Du, Zhifu Gao, ShiLiang Zhang, Xie Chen. “Emovoice: Llm-based emotional text-to-speech model with freestyle text prompting”. In Proc. ACM MM, 2025.
  • Hanglei Zhang, Yiwei Guo, Zhihan Li, Xiang Hao, Xie Chen, Kai Yu. “Unlocking Temporal Flexibility: Neural Speech Codec with Variable Frame Rate”. In Proc. INTERSPEECH, 2025.
  • Yiwei Guo, Zhihan Li, Chenpeng Du, Hankun Wang, Xie Chen, Kai Yu. “LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec”. In Proc. INTERSPEECH, 2025.
  • Qixi Zheng, Yushen Chen, Zhikang Niu, Ziyang Ma, Xiaofei Wang, Kai Yu, Xie Chen. “Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling”. In Proc. INTERSPEECH, 2025.
  • Mingyu Cui, Yifan Yang, Jiajun Deng, Jiawen Kang, Shujie Hu, Tianzi Wang, Zhaoqing Li, Shiliang Zhang, Xie Chen, Xunying Liu. “Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR”. In Proc. INTERSPEECH, 2025.
  • Jeongsoo Choi, Zhikang Niu, Ji-Hoon Kim, Chunhui Wang, Joon Son Chung, Xie Chen. “Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment”. In Proc. INTERSPEECH, 2025.
  • Jianheng Zhuo, Yifan Yang, Yiwen Shao, Yong Xu, Dong Yu, Kai Yu, Xie Chen. “VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining”. In Proc. INTERSPEECH, 2025.
  • Yushen CHEN, Zhikang Niu, Ziyang Ma, Keqi Deng, Chunhui Wang, JianZhao, Kai Yu, Xie Chen. “F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching”. In Proc. ACL, 2025.
  • Yifan Yang, Zheshu Song, Jianheng Zhuo, Mingyu Cui, Jinpeng Li, Bo Yang, Yexing Du, Ziyang Ma, Xunying Liu, Ziyuan Wang, Ke Li, Shuai Fan, Kai Yu, Wei-Qiang Zhang, Guoguo Chen, Xie Chen. “GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement”. In Proc. ACL, 2025.
  • Yexing Du, Youcheng Pan, Ziyang Ma, Bo Yang, Yifan Yang, Keqi Deng, Xie Chen, Yang Xiang, Ming Liu, Bing Qin. “Making LLMs Better Many-to-Many Speech-to-Text Translators with Curriculum Learning”. In Proc. ACL, 2025.
  • Keqi Deng, Wenxi Chen, Xie Chen, Phil Woodland. “SimulS2S-LLM: Unlocking Simultaneous Inference of Speech LLMs for Speech-to-Speech Translation”. In Proc. ACL, 2025.
  • Ziyang Ma, Xiquan Li, Yakun Song, Wenxi Chen, Chenpeng Du, Jian Wu, Yuanzhe Chen, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen. “Towards Reliable Large Audio Language Model”. In Findings of ACL, 2025.
  • Wenxi Chen, Ziyang Ma, Ruiqi Yan, Yuzhe Liang, Xiquan Li, Ruiyang Xu, Zhikang Niu, Yanqiao Zhu, Yifan Yang, Zhanxun Liu, Kai Yu, Yuxuan Hu, Jinyu Li, Yan Lu, Shujie LIU, Xie Chen. “SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training”. In Findings of ACL, 2025.
  • Tianrui Wang, Meng Ge, Zhikang Niu, Cheng Gong, Chunyu Qiang, Haoyu Wang, Zikang Huang, Ziyang Ma, Xiaobao Wang, Xie Chen, Longbiao Wang, Jianwu Dang. “A Progressive Generation Framework with Speech Pre-trained Model for Expressive Voice Conversion”. In Proc. ICME, 2025.
  • Yifan Yang, Jianheng Zhuo, Zengrui Jin, Ziyang Ma, Xiaoyu Yang, Zengwei Yao, Liyong Guo, Wei Kang, Fangjun Kuang, Long Lin, Daniel Povey, Xie Chen. “k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning”. In Proc. ICME, 2025.
  • Guanrou Yang, Fan Yu, Ziyang Ma, Zhihao Du, Zhifu Gao, Shiliang Zhang, Xie Chen. “Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap”. In Proc. ICASSP, 2025.
  • Xiquan Li, Wenxi Chen, Ziyang Ma, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Qiuqiang Kong, Xie Chen. “DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning”. In Proc. ICASSP, 2025.
  • Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, Hui Zhang, Xie Chen, Kai Yu. “VALL-T: Decoder-only generative transducer for robust and decoding-controllable text-to-speech”. In Proc. ICASSP, 2025.
  • Wenxi Chen, Ziyang Ma, Xiquan Li, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Kai Yu, Xie Chen. “SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs”. In Proc. ICASSP, 2025.
  • Ziyang Ma, Yakun Song, Chenpeng Du, Jian Cong, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen. “Language Model Can Listen While Speaking”. In Proc. AAAI, 2025.
  • Yakun Song, Zhuo Chen, Xiaofei Wang, Ziyang Ma, Xie Chen. “ELLA-V: Stable Neural Codec Language Modeling with Alignment-Guided Sequence Reordering”. In Proc. AAAI, 2025.
  • Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, ShiLiang Zhang, Xie Chen. “Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration”. In Proc. AAAI, 2025.
  • Tao Liu, Ziyang Ma, Qi Chen, Feilong Chen, Shuai Fan, Xie Chen, Kai Yu. “VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization”. In Proc. AAAI, 2025.

2024

  • Zheng Liang, Ziyang Ma, Chenpeng Du, Kai Yu, Xie Chen. “E3 TTS: End-to-End Text-Based Speech Editing TTS System and Its Applications”. IEEE/ACM TASLP, 2024
  • Guanrou Yang, Ziyang Ma, Zhifu Gao, Shiliang Zhang, Xie Chen. “CTC-Assisted LLM-Based Contextual ASR”. In Proc. SLT, 2024
  • Zhikang Niu, Sanyuan Chen, Long Zhou, Ziyang Ma, Xie Chen, Shujie Liu. “NDVQ: Robust neural audio codec with normal distribution-based vector quantization”. In Proc. SLT, 2024
  • Hankun Wang, Chenpeng Du, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu. “Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech”. In Proc. SLT, 2024
  • Tao Liu, Feilong Chen, Shuai Fan, Chenpeng Du, Qi Chen, Xie Chen, Kai Yu. “AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding”. In Proc. ACM MM, 2024
  • Bohan Li, Feiyu Shen, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu. “On the Effectiveness of Acoustic BPE in Decoder-Only TTS”. In Proc. INTERSPEECH, 2024
  • Yakun Song, Zhuo Chen, Xiaofei Wang, Ziyang Ma, Guanrou Yang, Xie Chen. “TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers”. In Proc. INTERSPEECH, 2024
  • Anbai Jiang, Bing Han, Zhiqiang Lv, Yufeng Deng, Wei-Qiang Zhang, Xie Chen, Yanmin Qian, Jia Liu, Pingyi Fan. “AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection”. In Proc. INTERSPEECH, 2024
  • Xuankai Chang, Jiatong Shi, Jinchuan Tian, Yuning Wu, Yuxun Tang, Yihan Wu, Shinji Watanabe, Yossi Adi, Xie Chen, Qin Jin. “The Interspeech 2024 Challenge on Speech Processing Using Discrete Units”. In Proc. INTERSPEECH, 2024
  • Ziyang Ma, Mingjie Chen, Hezhao Zhang, Zhisheng Zheng, Wenxi Chen, Xiquan Li, Jiaxin Ye, Xie Chen, Thomas Hain. “EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark”. In Proc. INTERSPEECH, 2024
  • Guanrou Yang, Ziyang Ma, Fan Yu, Zhifu Gao, Shiliang Zhang, Xie Chen. “MaLa-ASR: Multimedia-Assisted LLM-Based ASR”. In Proc. INTERSPEECH, 2024
  • Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua Reiss, Lin Wang, Philip Woodland, Xie Chen, Huy Phan, Thomas Hain. “1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem”. Odyssey 2024
  • Wen Huang, Anbai Jiang, Bing Han, Xinhu Zheng, Yihong Qiu, Wenxi Chen, Yuzhe Liang, Pingyi Fan, Wei-Qiang Zhang, Cheng Lu, Xie Chen, Jia Liu, Yanmin Qian. “Semi-supervised Acoustic Scene Classification with Test-Time Adaptation”. In ICME Workshop, 2024
  • Yuzhe Liang, Wenxi Chen, Yihong Qiu, Xinhu Zheng, Boyuan Chen, Jia Liu, Wei-Qiang Zhang, Cheng Lu, Xie Chen. “Improving Acoustic Scene Classification via Self-Supervised and Semi-Supervised Learning with Efficient Audio Transformer”. In ICME Workshop, 2024
  • Zheshu Song, Jianheng Zhuo, Yifan Yang, Ziyang Ma, Shixiong Zhang, Xie Chen. “LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR”. In Proc. INTERSPEECH, 2024
  • Peng Wang, Yifan Yang, Zheng Liang, Tian Tan, Shiliang Zhang, Xie Chen. “Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer”. In Proc. INTERSPEECH, 2024
  • Junzhe Liu, Jianwei Yu, Xie Chen. “Improved Factorized Neural Transducer Model For text-only Domain Adaptation”. In Proc. INTERSPEECH, 2024
  • Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, Shiliang Zhang, Xie Chen. “emotion2vec: Self-supervised pre-training for speech emotion representation”. In Findings of ACL, 2024
  • Jinpeng Li, Xie Chen, Weiqiang Zhang. “Exploring Generation of Pronunciation Lexicon for Low-Resource Language Automatic Speech Recognition Based on Generic Phone Recognizer”. Journal of Shanghai Jiaotong University (Science), 2024
  • Junjie Li, Yiwei Guo, Xie Chen, Kai Yu. “SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention”. In Proc. ICASSP, 2024
  • Sen Liu, Yiwei Guo, Xie Chen, Kai Yu. “StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations”. In Proc. ICASSP, 2024
  • Feiyu Shen, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu. “Acoustic BPE for speech generation with discrete tokens”. In Proc. ICASSP, 2024
  • Ziyang Ma, Wen Wu, Zhisheng Zheng, Yiwei Guo, Qian Chen, Shiliang Zhang, Xie Chen. “Leveraging speech PTM, text LLM, and emotional TTS for speech emotion recognition”. In Proc. ICASSP, 2024
  • Yifan Yang, Feiyu Shen, Chenpeng Du, Ziyang Ma, Kai Yu, Daniel Povey, Xie Chen. “Towards universal speech discrete tokens: A case study for ASR and TTS”. In Proc. ICASSP, 2024
  • Yiwei Guo, Chenpeng Du, Ziyang Ma, Xie Chen, Kai Yu. “Voiceflow: Efficient text-to-speech with rectified flow matching”. In Proc. ICASSP, 2024
  • Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu. “UniCATS: A unified context-aware text-to-speech framework with contextual VQ-diffusion and vocoding”. In Proc. AAAI, 2024
  • Zhisheng Zheng, Puyuan Peng, Ziyang Ma, Xie Chen, Eunsol Choi, David Harwath. “BAT: Learning to Reason about Spatial Sounds with Large Language Models”. In Proc. ICML, 2024
  • Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian. “Advanced long-content speech recognition with factorized neural transducer”. IEEE/ACM TASLP, 2024
  • Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen. “EAT: Self-supervised pre-training with efficient audio transformer”. In Proc. IJCAI, 2024

2023

  • Guanrou Yang, Ziyang Ma, Zhisheng Zheng, Yakun Song, Zhikang Niu, Xie Chen. “Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning”. In Proc. ASRU, 2023.
  • Yujin Wang, Changli Tang, Ziyang Ma, Zhisheng Zheng, Xie Chen, Wei-Qiang Zhang. “Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition”. In Proc. ASRU, 2023.
  • Chenpeng Du, Yiwei Guo, Xie Chen, Kai Yu. “Speaker Adaptive Text-to-Speech with Timbre-Normalized Vector-Quantized Feature”. In IEEE/ACM TASLP, 2023.
  • Chenpeng Du, Qi Chen, Tianyu He, Xu Tan, Xie Chen, Kai Yu, Sheng Zhao, Jiang Bian. “DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder”. In Proc. ACM MM, 2023.
  • Zheng Liang, Zheshu Song, Ziyang Ma, Chenpeng Du, Kai Yu, Xie Chen. “Improving Code-Switching and Name Entity Recognition in ASR with Speech Editing based Data Augmentation”. In Proc. INTERSPEECH, 2023.
  • Ziyang Ma, Zhisheng Zheng, Changli Tang, Yujin Wang, Xie Chen. “MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets”. In Proc. INTERSPEECH, 2023.
  • Yifan Yang, Xiaoyu Yang, Liyong Guo, Zengwei Yao, Wei Kang, Fangjun Kuang, Long Lin, Xie Chen, Daniel Povey. “Blank-regularized CTC for Frame Skipping in Neural Transducer”. In Proc. INTERSPEECH, 2023.
  • Ziyang Ma, Zhisheng Zheng, Guanrou Yang, Yu Wang, Chao Zhang, Xie Chen. “Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation”. In Proc. INTERSPEECH, 2023.
  • Zhisheng Zheng, Ziyang Ma, Yu Wang, Xie Chen. “Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition”. In Proc. INTERSPEECH, 2023.
  • Mingyu Cui, Jiawen Kang, Jiajun Deng, Xi Yin, Yutao Xie, Xie Chen, Xunying Liu. “Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems”. In Proc. INTERSPEECH, 2023.
  • Sen Liu, Yiwei Guo, Chengpeng Du, Xie Chen, Kai Yu. “DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech”. In Proc. INTERSPEECH, 2023.
  • Tianrui Wang, Xie Chen, Zhuo Chen, Shu Yu, Weibin Zhu. “An Adapter Based Multi-Label Pre-Training for Speech Separation and Enhancement”. Proc. ICASSP, 2023.
  • Xun Gong, Wei Wang, Hang Shao, Xie Chen, Yanmin Qian. “Factorized AED: Factorized Attention-Based Encoder-Decoder for Text-Only Domain Adaptive ASR”. Proc. ICASSP, 2023.
  • Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu. “Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance”. Proc. ICASSP, 2023.
  • Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian. “LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer”. Proc. ICASSP, 2023.
  • Xie Chen, Ziyang Ma, Changli Tang, Yujin Wang, Zhisheng Zheng. “Front-End Adapter: Adapting Front-End Input of Speech Based Self-Supervised Learning for Speech Recognition”. Proc. ICASSP, 2023.
  • Qi Chen, Ziyang Ma, Tao Liu, Xu Tan, Qu Lu, Kai Yu, Xie Chen. “Improving Few-Shot Learning for Talking Face System with TTS Data Augmentation”. Proc. ICASSP, 2023.

2022

  • Z Meng, Y Gaur, N Kanda, J Li, X Chen, Y Wu, Y Gong. “Internal language model adaptation with text-only data for end-to-end speech recognition”. In Proc. INTERSPEECH, 2022
  • Chenpeng Du, Yiwei Guo, Xie Chen, Kai Yu. “VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature”. In Proc. INTERSPEECH, 2022
  • Xie Chen, Zhong Meng, S Parthasarathy, Jinyu Li. “Factorized neural transducer for efficient language model adaptation”. In Proc. ICASSP, 2022

2021 and Before

  • D Narayanan, A Phanishayee, K Shi, X Chen, M Zaharia. “Memory-efficient pipeline-parallel DNN training”. In Proc. ICML, 2021
  • Y Deng, R Zhao, Z Meng, X Chen, B Liu, J Li, Y Gong, L He. “Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS”. In Proc. INTERSPEECH, 2021
  • X. Chen, Y. Wu, Z. Wang, S. Liu, J. Li. “Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset”. In Proc. ICASSP, 2021
  • Z. Meng, N. Kanda, Y. Gaur, S. Parthasarathy, E. Sun, L. Lu, X. Chen, J. Li, Y. Gong. “Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition”. In Proc. IEEE ICASSP, 2021
  • Z. Meng, S. Parthasarathy, E. Sun, Y. Gaur, N. Kanda, L. Lu, X. Chen, R. Zhao, J. Li, Y. Gong. “Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition”. In Proc. IEEE SLT, 2020
  • X. Chen, S. Parthasarathy, W. Gale, S. Chang, M. Zeng. “LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition”. arXiv preprint arXiv:2010.11349, 2020
  • J. Xu, X. Chen, S. Hu, J. Yu, X. Liu, H. Meng. “Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers”. In Proc. ICASSP, 2020
  • X. Chen, X. Liu, Y. Wang, A. Ragni, M. Gales. “Exploiting Future Word Contexts in Neural Network Language Model”. IEEE/ACM TASLP, 2019
  • S. Parthasarathy, W. Gale, X. Chen, G. Polovets, S. Chang. “Long-span language modeling for speech recognition”. arXiv preprint arXiv:1911.04571, 2019
  • X. Chen, J. Zhang, T. Anastasakos, F. Alleva. “Investigation of Sampling Techniques for Maximum Entropy Language Modeling Training”. In Proc. ICASSP, 2019
  • M. Lam, X. Chen, S. Hu, J. Yu, X. Liu, H. Meng. “Gaussian Process LSTM Recurrent Neural Network Language Models for Speech Recognition”. In Proc. ICASSP, 2019
  • J. Yu, M. Lam, X. Chen, S. Hu, S. Liu, X. Wu, X. Liu, H. Meng. “Recurrent Neural Network Language Models Training using Natural Gradient”. In Proc. ICASSP, 2019
  • O. Chen, A. Ragni, M.J.F. Gales, X. Chen. “Active Memory Networks for Language Modeling”. In Proc. INTERSPEECH, 2018
  • M. Zhang, X. Chen, R. Cummins, Q. Andersen, T. Briscoe. “The Effect of Adding Authorship Knowledge in Automated Text Scoring”. In Workshop of BEA, NAACL, 2018
  • X. Liu, S. Liu, J. Sha, J. Yu, Z Xu, X. Chen, H. Meng. “Limited-memory BFGS Optimization of Recurrent Neural Network Language Models For Speech Recognition”. In Proc. ICASSP, 2018
  • Y. Wang, X. Chen, M.J.F. Gales, A. Ragni, J. Wong. “Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription”. In Proc. ICASSP, 2018
  • H. Xu, K. Li, Y. Wang, J. Wang, S. Kang, X. Chen, D. Povey, S. Khudanpur. “Neural Network Language Modeling with Letter-based Features and Importance Sampling”. In Proc. ICASSP, 2018
  • X. Chen, X. Liu, A. Ragni, Y. Wang, M.J.F. Gales. “Future Word Context in Neural Network Language Model”. In Proc. ASRU, 2017
  • X. Chen, A. Ragni, X. Liu, M.J.F. Gales. “Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition”. In Proc. INTERSPEECH, 2017
  • X. Chen, A. Ragni, J. Vasilakes, X. Liu, K. Knill, M.J.F. Gales. “Recurrent Neural Network Language Models for Keyword Search”. In Proc. ICASSP, 2017
  • X. Chen, X. Liu, Y. Wang, M.J.F. Gales, P.C. Woodland. “Efficient Training and Evaluation of Recurrent Neural Network Language Models for Speech Recognition”. IEEE/ACM TASLP, 2016
  • X. Liu, X. Chen, Y. Wang, M.J.F. Gales, P.C. Woodland. “Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models”. IEEE/ACM TASLP, 2016
  • A. Ragni, E. Dakin, X. Chen, M.J.F. Gales, K.M. Knill. “Multi-Language Neural Network Language Models”. In Proc. INTERSPEECH, 2016
  • X. Chen, X. Liu, Y. Qian, M.J.F. Gales, P.C. Woodland. “CUED-RNNLM: An Open-Source Toolkit for Efficient Training and Evaluation of Recurrent Neural Network Language Models”. In Proc. ICASSP, 2016
  • X. Chen, X. Liu, M.J.F. Gales, P.C. Woodland. “Investigation of back-off based interpolation between Recurrent Neural Network and N-Gram Language Models”. In Proc. ASRU, 2015
  • X. Chen, T. Tan, X. Liu, P. Lancheantin, M. Wan, M.J.F. Gales, P.C. Woodland. “Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition”. In Proc. INTERSPEECH, 2015
  • X. Chen, X. Liu, M.J.F. Gales, P.C. Woodland. “Improving the Training and Evaluation Efficiency of Recurrent Neural Network Language Models”. In Proc. ICASSP, 2015.
  • X. Chen, X. Liu, M.J.F. Gales, P.C. Woodland. “Recurrent Neural Network Language Model Training with Noise Contrastive Estimation for Speech Recognition”. In Proc. ICASSP, 2015.
  • X. Liu, X. Chen, M.J.F. Gales, P.C. Woodland. “Paraphrastic Recurrent Neural Network Language Models”. In Proc. ICASSP, 2015.
  • T. Drugman, Y. Stylianou, L. Chen, X. Chen, M.J.F. Gales. “Robust Excitation-based Feature for Automatic Speech Recognition”. In Proc. ICASSP, 2015.
  • X. Chen, M.J.F. Gales, K. Knill et al. “An Initial Investigation of Long-Term Adaptation for Meeting Transcription”. In Proc. INTERSPEECH, 2014.
  • X. Chen, Y. Wang, X. Liu, M.J.F. Gales, P.C. Woodland. “Efficient GPU-based Training of Recurrent Neural Network Language Models Using Spliced Sentence Bunch”. In Proc. INTERSPEECH, 2014.
  • X. Liu, Y. Wang, X. Chen, M.J.F. Gales, P.C. Woodland. “Efficient Lattice Rescoring Using Recurrent Neural Network Language Models”. In Proc. ICASSP, 2014.
  • T. Yoshioka, X. Chen, M.J.F. Gales. “Impact of Single-Microphone Dereverberation on DNN-based Meeting Transcription Systems”. In Proc. ICASSP, 2014.
  • J. Liu, X. Chen, Y. Shan, Y. Shi. “Construction of a Compact Dynamic Decoder Network for Large Vocabulary Continuous Speech Recognition”. Tsinghua Journal of Chinese Studies, 2012.
  • Y. Shan, X. Chen, Y. Shi, J. Liu. “Fast Language Model Look-ahead Algorithm Using Extended N-gram Model”. ACTA AUTOMATICA SINICA, 2012.
  • X. Chen, A. Eversol, D. Yu, F. Seide. “Pipelined Back-Propagation for Context-Dependent Deep Neural Networks”. In Proc. INTERSPEECH, 2012.
  • X. Chen, Y. Shan, X. Zhang, J. Liu. “An Efficient Layer-wised Beam Pruning Algorithm for Large Vocabulary Continuous Speech Recognition System”. In Proc. ICALIP, 2012.
  • F. Seide, G. Li, X. Chen, D. Yu. “Feature Engineering in Context-Dependent Deep Neural Networks for Conversational Speech Transcription”. In Proc. ASRU, 2011.