Publications

You can also find my updated publications on my Google Scholar profile.

An Adapter Based Multi-Label Pre-Training for Speech Separation and Enhancement
T Wang, X Chen, Z Chen, S Yu, W Zhu
Proc. ICASSP, 2023

Factorized AED: Factorized Attention-Based Encoder-Decoder for Text-Only Domain Adaptive ASR
X Gong, W Wang, H Shao, X Chen, Y Qian
Proc. ICASSP, 2023

Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Y Guo, C Du, X Chen, K Yu
Proc. ICASSP, 2023

LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer
X Gong, Y Wu, J Li, S Liu, R Zhao, X Chen, Y Qian
Proc. ICASSP, 2023

Front-End Adapter: Adapting Front-End Input of Speech Based Self-Supervised Learning for Speech Recognition
X Chen, Z Ma, C Tang, Y Wang, Z Zheng
Proc. ICASSP, 2023

Improving Few-Shot Learning for Talking Face System with TTS Data Augmentation
Q Chen, Z Ma, T Liu, X Tan, Q Lu, K Yu, X Chen
Proc. ICASSP, 2023

Internal language model adaptation with text-only data for end-to-end speech recognition
Z Meng, Y Gaur, N Kanda, J Li, X Chen, Y Wu, Y Gong
Proc. INTERSPEECH, 2022

VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
C Du, Y Guo, X Chen, K Yu
Proc. INTERSPEECH, 2022

Factorized neural transducer for efficient language model adaptation
X Chen, Z Meng, S Parthasarathy, J Li
Proc. ICASSP, 2022

2021 and Before

Memory-efficient pipeline-parallel DNN training
D Narayanan, A Phanishayee, K Shi, X Chen, M Zaharia
Proc. ICML, 2021

Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
X. Chen, Y. Wu, Z. Wang, S. Liu, J. Li
Proc. ICASSP, 2021

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Z. Meng, N. Kanda, Y. Gaur, S. Parthasarathy, E. Sun, L. Lu, X. Chen, J. Li, Y. Gong
Proc. IEEE ICASSP, 2021

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Z. Meng, S. Parthasarathy, E. Sun, Y. Gaur, N. Kanda, L. Lu, X. Chen, R. Zhao, J. Li, Y. Gong
Proc. IEEE SLT, 2020

LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition
X. Chen, S. Parthasarathy, W. Gale, S. Chang, M. Zeng
arXiv preprint arXiv:2010.11349, 2020

Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers
J. Xu, X. Chen, S. Hu, J. Yu, X. Liu, H. Meng
Proceedings of ICASSP, 2020

Exploiting Future Word Contexts in Neural Network Language Model
X. Chen, X. Liu, Y. Wang, A. Ragni, M. Gales
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2019

Long-span language modeling for speech recognition
S. Parthasarathy, W. Gale, X. Chen, G. Polovets, S. Chang
arXiv preprint arXiv:1911.04571, 2019

Investigation of Sampling Techniques for Maximum Entropy Language Modeling Training
X. Chen, J. Zhang, T. Anastasakos, F. Alleva
Proceedings of ICASSP, 2019

Gaussian Process LSTM Recurrent Neural Network Language Models for Speech Recognition
M. Lam, X. Chen, S. Hu, J. Yu, X. Liu, H. Meng
Proceedings of ICASSP, 2019

Recurrent Neural Network Language Models Training using Natural Gradient
J. Yu, M. Lam, X. Chen, S. Hu, S. Liu, X. Wu, X. Liu, H. Meng
Proceedings of ICASSP, 2019

Active Memory Networks for Language Modeling
O. Chen, A. Ragni, M.J.F. Gales and X. Chen
Proceedings of INTERSPEECH, 2018

The Effect of Adding Authorship Knowledge in Automated Text Scoring
M. Zhang, X. Chen, R. Cummins, Q. Andersen and T. Briscoe
Workshop of BEA in NAACL, 2018

Limited-memory BFGS Optimization of Recurrent Neural Network Language Models For Speech Recognition
X. Liu, S. Liu, J. Sha, J. Yu, Z Xu, X. Chen, H. Meng
In Proceedings of ICASSP, 2018

Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription
Y. Wang, X. Chen, M.J.F. Gales, A. Ragni, J. Wong
In Proceedings of ICASSP, 2018

Neural Network Language Modeling with Letter-based Features and Importance Sampling
H. Xu, K. Li, Y. Wang, J. Wang, S. Kang, X. Chen, D. Povey, S. Khudanpur
Proceedings of ICASSP, 2018

Future Word Context in Neural Network Language Model
X. Chen, X. Liu, A. Ragni, Y. Wang, M.J.F. Gales
Proceedings of ASRU, 2017

Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
X. Chen, A. Ragni, X. Liu, M.J.F. Gales
Proceedings of INTERSPEECH, 2017

Recurrent Neural Network Language Models for Keyword Search
X. Chen, A. Ragni, J. Vasilakes, X. Liu, K. Knill, M.J.F. Gales
Proceedings of ICASSP, 2017

Efficient Training and Evaluation of Recurrent Neural Network Language Models for Speech Recognition
X. Chen, X. Liu, Y. Wang, M. J. F. Gales and P. C. Woodland
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2016

Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models
X. Liu, X. Chen, Y. Wang, M. J. F. Gales and P. C. Woodland
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2016

Multi-Language Neural Network Language Models
A. Ragni, E. Dakin, X. Chen, M.J. F. Gales and K.M. Knill
Proceedings of INTERSPEECH, 2016

CUED-RNNLM – An Open-Source Toolkit for Efficient Training and Evaluation of Recurrent Neural Network Language Models
X. Chen, X. Liu, Y. Qian, M.J.F. Gales and P.C. Woodland
Proceedings of ICASSP, 2016

Investigation of back-off based interpolation between Recurrent Neural Network and N-Gram Language Models
X. Chen, X. Liu, M.J.F. Gales and P.C. Woodland
Proceedings of ASRU, 2015

Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition
X. Chen, T. Tan, X. Liu, P. Lancheantin, M. Wan, M.J.F. Gales and P.C. Woodland
Proceedings of INTERSPEECH, 2015

Improving the Training and Evaluation Efficiency of Recurrent Neural Network Language Models
X. Chen, X. Liu, M.J.F. Gales, P.C. Woodland
Proceedings of ICASSP, 2015

Recurrent Neural Network Language Model Training with Noise Contrastive Estimation for Speech Recognition
X. Chen, X. Liu, M.J.F. Gales, P.C. Woodland
Proceedings of ICASSP, 2015

Paraphrastic Recurrent Neural Network Language Models
X. Liu, X. Chen, M.J.F. Gales, P.C. Woodland
Proceedings of ICASSP, 2015

Robust Excitation-based Feature for Automatic Speech Recognition
T. Drugman, Y. Stylianou, L. Chen, X. Chen, M.J.F Gales
Proceedings of ICASSP, 2015

An Initial Investigation of Long-Term Adaptation for Meeting Transcription
X. Chen, M.J.F. Gales and K. Knill et, al.
Proceedings of INTERSPEECH, 2014

Efficient GPU-based Training of Recurrent Neural Network Language Models Using Spliced Sentence Bunch
X. Chen, Y. Wang, X. Liu, M.J.F. Gales and P.C. Woodland
Proceedings of INTERSPEECH, 2014

Efficient Lattice Rescoring Using Recurrent Neural Network Language Models
X. Liu, Y. Wang, X. Chen, M.J.F. Gales and P.C. Woodland
In Proceedings of ICASSP, 2014

Impact of Single-Microphone Dereverberation on DNN-based Meeting Transcription Systems
T. Yoshioka, X. Chen, and M.J.F. Gales
Proceedings of ICASSP, 2014

Construction of a Compact Dynamic Decoder Network for Large Vocabulary Continuous Speech Recognition
J. Liu, X. Chen, Y. Shan and Y. Shi
Tsinghua Journal of Chinese Studies, 2012

Fast Language Model Look-ahead Algorithm Using Extended N-gram Model
Y. Shan, X. Chen, Y. Shi and J. Liu
ACTA AUTOMATICA SINICA, 2012

X. Chen, A. Eversol, D. Yu and F. Seide
Pipelined Back-Propagation for Context-Dependent Deep Neural Networks
Proceedings of INTERSPEECH, 2012

An Efficient Layer-wised Beam Pruning Algorithm for Large Vocabulary Continuous Speech Recognition System
X Chen, Y Shan, X Zhang, J Liu
Proceedings of ICALIP, 2012

Feature Engineering in Context-Dependent Deep Neural Networks for Conversational Speech Transcription
F. Seide, G. Li, X. Chen and D. Yu
Proceedings of ASRU, 2011