Mark John Francis Gales

Size: px

Start display at page:

Download "Mark John Francis Gales"

Rosa Hawkins
5 years ago
Views:

1 Mark John Francis Gales Thesis, Book Chapters and Magazines 1. S. Furui, L. Deng, M. Gales, H. Ney, and K. Tokuda, Fundamental technologies in modern speech recognition [from the guest editors], IEEE Signal Processing Magazine, vol. 29, pp , November M. J. F. Gales, S. Watanabe, and E. Fosler-Lussier, Structured discriminative models for speech recognition: An overview, IEEE Signal Processing Magazine, vol. 29, pp , November S. Matsoukas, T. Ng, L. Nguyen, F. Diehl, M. J. F. Gales, X. Liu, P. C. Woodland, J.-L. Gauvain, L. Lamel, A. Messaoudi, and etal, Handbook of natural language processing and machine translation. DARPA Global Autonomous Language Exploitation, ch. Optimizing Speech-To-Text System Combination for Machine Translation. Springer, M. Tomalin, W. Byrne, A. de Gispert, M. Gales, X. Liu, P. Woodland, and etal, Handbook of natural language processing and machine translation. DARPA Global Autonomous Language Exploitation, ch. Improving Speech Transcription for Machine Translation. Springer, M. J. F. Gales, Robust Speech Recognition of Uncertain or Missing Data, ch. Model-Based Approaches to Handling Uncertainty. Springer, M. J. F. Gales, Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, ch. Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models M. J. F. Gales and S. J. Young, The application of hidden Markov models in speech recognition, Foundations and Trends in Signal Processing, vol. 1, no. 3, pp , S. J. Young, G. Evermann, M. J. F. Gales, D. Kershaw, X. Liu, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book (for HTK Version 3.4). Cambridge University Engineering Department, December M. J. F. Gales, Model-Based Techniques for Noise Robust Speech Recognition. PhD thesis, Cambridge University Engineering Department, September

2 Journal Papers and Letters 1. Y. Wang, M. J. F. Gales, K. M. Knill, K. Kyriakopoulos, A. Malinin, R. C. van Dalen, and M. Rashid, Towards automatic assessment of spontaneous spoken English, Speech Communication, vol. 104, pp , G. Degottex, P. Lanchantin, and M. J. F. Gales, A log domain pulse model for parametric speech synthesis, IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 26, no. 1, pp , C. Wu, M. J. F. Gales, A. Ragni, P. Karanasou, and K. C. Sim, Improving interpretability and regularization in deep learning, IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 26, no. 2, pp , P. Karanasou, C. Wu, M. J. F. Gales, and P. C. Woodland, I-vectors and structured neural networks for rapid adaptation of acoustic models, IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 25, no. 4, pp , X. Liu, X. Chen, Y. Wang, M. J. F. Gales, and P. C. Woodland, Two efficient lattice rescoring methods using recurrent neural network language models, IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 24, no. 8, pp , X. Chen, X. Liu, Y. Wang, M. J. F. Gales, and P. C. Woodland, Efficient training and evaluation of recurrent neural network language models for automatic speech recognition, IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 24, no. 11, pp , T. Yoshioka and M. J. F. Gales, Environmentally robust ASR front-end for deep neural network acoustic models, Computer Speech & Language, vol. 31, no. 1, pp , L. Chen, N. Braunschweiler, and M. J. F. Gales, Speaker and expression factorization for audiobook data: Expressiveness and transplantation, IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 23, no. 4, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Paraphrastic language models, Computer Speech & Language, vol. 28, no. 6, pp , V. Wan, J. Latorre, K. Yanagisawa, N. Braunschweiler, L. Chen, M. J. F. Gales, and M. Akamine, Building HMM-TTS voices on diverse data, J. Sel. Topics Signal Processing, vol. 8, no. 2, pp , L. Chen, M. J. F. Gales, N. Braunschweiler, M. Akamine, and K. Knill, Integrated expression prediction and speech synthesis from text, J. Sel. Topics Signal Processing, vol. 8, no. 2, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Use of contexts in language model interpolation and adaptation, Computer Speech & Language, vol. 27, no. 1, pp , R. C. van Dalen and M. J. F. Gales, Importance sampling to compute likelihoods of noise-corrupted speech, Computer Speech & Language, vol. 27, no. 1, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Language model cross adaptation for LVCSR system combination, Computer Speech & Language, vol. 27, no. 4, pp , R. Maia, M. Akamine, and M. J. F. Gales, Complex cepstrum for statistical parametric speech synthesis, Speech Communication, vol. 55, no. 5, pp , S. Zhang and M. J. F. Gales, Structured SVMs for automatic speech recognition, IEEE Trans. Audio, Speech & Language Processing, vol. 21, no. 3, pp , F. Diehl, M. J. F. Gales, M. Tomalin, and P. C. Woodland, Morphological decomposition in Arabic ASR systems, Computer Speech & Language, vol. 26, no. 4, pp , H. Zen, M. J. F. Gales, Y. Nankaku, and K. Tokuda, Product of experts for statistical parametric speech synthesis, IEEE Trans. Audio, Speech & Language Processing, vol. 20, no. 3, pp , H. Zen, N. Braunschweiler, S. Buchholz, M. J. F. Gales, K. Knill, S. Krstulovic, and J. Latorre, Statistical parametric speech synthesis based on speaker and language factorization, IEEE Trans. Audio, Speech & Language Processing, vol. 20, no. 6, pp , Y. Wang and M. J. F. Gales, Speaker and noise factorization for robust speech recognition, IEEE Trans. Audio, Speech & Language Processing, vol. 20, no. 7, pp ,

3 21. J. Park, F. Diehl, M. J. F. Gales, M. Tomalin, and P. C. Woodland, The efficient incorporation of MLP features into automatic speech recognition systems, Computer Speech & Language, vol. 25, no. 3, pp , Z. Roupakia and M. J. F. Gales, Kernel eigenvoices (revisited) for large-vocabulary speech recognition, IEEE Signal Process. Lett., vol. 18, no. 12, pp , D. K. Kim and M. J. F. Gales, Noisy constrained maximum-likelihood linear regression for noiserobust speech recognition, IEEE Trans. Audio, Speech & Language Processing, vol. 19, no. 2, pp , R. C. van Dalen and M. J. F. Gales, Extended VTS for noise-robust speech recognition, IEEE Trans. Audio, Speech & Language Processing, vol. 19, no. 4, pp , H. Xu, M. J. F. Gales, and K. K. Chin, Joint uncertainty decoding with predictive methods for noise robust speech recognition, IEEE Trans. Audio, Speech & Language Processing, vol. 19, no. 6, pp , M. J. F. Gales and F. Flego, Discriminative classifiers with adaptive kernels for noise robust speech recognition, Computer Speech & Language, vol. 24, no. 4, pp , K. Yu, M. J. F. Gales, L. Wang, and P. C. Woodland, Unsupervised training and directed manual transcription for LVCSR, Speech Communication, vol. 52, no. 7-8, pp , S. Zhang, A. Ragni, and M. J. F. Gales, Structured log linear models for noise robust speech recognition, IEEE Signal Process. Lett., vol. 17, no. 11, pp , C. Breslin and M. J. F. Gales, Directed decision trees for generating complementary systems, Speech Communication, vol. 51, no. 3, pp , K. Yu, M. J. F. Gales, and P. C. Woodland, Unsupervised adaptation with discriminative mapping transforms, IEEE Trans. Audio, Speech & Language Processing, vol. 17, no. 4, pp , C. Longworth and M. J. F. Gales, Combining derivative and parametric kernels for speaker verification, IEEE Trans. Audio, Speech & Language Processing, vol. 17, no. 4, pp , H. Liao and M. J. F. Gales, Issues with uncertainty decoding for noise robust automatic speech recognition, Speech Communication, vol. 50, no. 4, pp , K. C. Sim and M. J. F. Gales, Discriminative semi-parametric trajectory model for speech recognition, Computer Speech & Language, vol. 21, no. 4, pp , X. Liu and M. J. F. Gales, Automatic model complexity control using marginalized discriminative growth functions, IEEE Trans. Audio, Speech & Language Processing, vol. 15, no. 4, pp , K. Yu and M. J. F. Gales, Bayesian adaptive inference and adaptive training, IEEE Trans. Audio, Speech & Language Processing, vol. 15, no. 6, pp , M. I. Layton and M. J. F. Gales, Acoustic modelling using continuous rational kernels, VLSI Signal Processing, vol. 48, no. 1-2, pp , M. J. F. Gales and S. S. Airey, Product of Gaussians for speech recognition, Computer Speech & Language, vol. 20, no. 1, pp , M. J. F. Gales and M. I. Layton, Training augmented models using SVMs, IEICE Transactions, vol. 89-D, no. 3, pp , K. C. Sim and M. J. F. Gales, Minimum phone error training of precision matrix models, IEEE Trans. Audio, Speech & Language Processing, vol. 14, no. 3, pp , M. J. F. Gales, D. Y. Kim, P. C. Woodland, H. Y. Chan, D. Mrva, R. Sinha, and S. E. Tranter, Progress in the CU-HTK broadcast news transcription system, IEEE Trans. Audio, Speech & Language Processing, vol. 14, no. 5, pp , K. Yu and M. J. F. Gales, Discriminative cluster adaptive training, IEEE Trans. Audio, Speech & Language Processing, vol. 14, no. 5, pp , T. Hain, P. C. Woodland, G. Evermann, M. J. F. Gales, X. Liu, G. L. Moore, D. Povey, and L. Wang, Automatic transcription of conversational telephone speech, IEEE Trans. Speech and Audio Processing, vol. 13, no. 6, pp , A. I. Rosti and M. J. F. Gales, Factor analysed hidden Markov models for speech recognition, Computer Speech & Language, vol. 18, no. 2, pp ,

4 44. M. J. F. Gales, Transformation streams and the HMM error model, Computer Speech & Language, vol. 16, no. 2, pp , S. S. Chen, E. Eide, M. J. F. Gales, R. A. Gopinath, D. Kanvesky, and P. A. Olsen, Automatic transcription of broadcast news, Speech Communication, vol. 37, no. 1-2, pp , M. J. F. Gales, Maximum likelihood multiple subspace projections for hidden Markov models, IEEE Trans. Speech and Audio Processing, vol. 10, no. 2, pp , M. J. F. Gales, Cluster adaptive training of hidden Markov models, IEEE Trans. Speech and Audio Processing, vol. 8, no. 4, pp , M. J. F. Gales, K. Knill, and S. J. Young, State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs, IEEE Trans. Speech and Audio Processing, vol. 7, no. 2, pp , M. J. F. Gales, Semi-tied covariance matrices for hidden Markov models, IEEE Trans. Speech and Audio Processing, vol. 7, no. 3, pp , M. J. F. Gales, Maximum likelihood linear transformations for HMM-based speech recognition, Computer Speech & Language, vol. 12, no. 2, pp , M. J. F. Gales, Predictive model-based compensation schemes for robust speech recognition, Speech Communication, vol. 25, no. 1-3, pp , M. J. F. Gales and P. C. Woodland, Mean and variance adaptation within the MLLR framework, Computer Speech & Language, vol. 10, no. 4, pp , M. J. F. Gales and S. J. Young, Robust continuous speech recognition using parallel model combination, IEEE Trans. Speech and Audio Processing, vol. 4, no. 5, pp , M. J. F. Gales and S. J. Young, Robust speech recognition in additive and convolutional noise using parallel model combination, Computer Speech & Language, vol. 9, no. 4, pp , M. J. F. Gales and S. J. Young, Cepstral parameter compensation for HMM recognition in noise, Speech Communication, vol. 12, no. 3, pp ,

5 Conference and Workshop Papers 1. Y. Wang, X. Chen, M. J. F. Gales, A. Ragni, and J. H. M. Wong, Phonetic and graphemic systems for multi-genre broadcast transcription, in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2018, Calgary, AB, Canada, April 15-20, 2018, pp , Y. Wang, C. Zhang, M. J. F. Gales, and P. C. Woodland, Speaker adaptation and adaptive training for jointly optimised tandem systems, in Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018., pp , M. Wan, G. Degottex, and M. J. F. Gales, Waveform-based speaker representations for speech synthesis, in Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018., pp , K. Kyriakopoulos, K. Knill, and M. J. F. Gales, A deep learning approach to assessing non-native pronunciation of English using phone distances, in Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018., pp , K. Knill, M. J. F. Gales, K. Kyriakopoulos, A. Malinin, A. Ragni, Y. Wang, and A. Caines, Impact of ASR performance on free speaking language assessment, in Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018., pp , A. Ragni and M. J. F. Gales, Automatic speech recognition system development in the wild, in Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018., pp , O. Chen, A. Ragni, M. J. F. Gales, and X. Chen, Active memory networks for language modeling, in Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018., pp , A. Malinin and M. J. F. Gales, Predictive uncertainty estimation via prior networks, in Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, 3-8 December 2018, Montréal, Canada., pp , A. Malinin, A. Ragni, K. Knill, and M. J. F. Gales, Incorporating uncertainty into deep learning for spoken language assessment, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 2: Short Papers, pp , J. H. M. Wong and M. J. F. Gales, Multi-task ensembles with teacher-student training, in 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017, Okinawa, Japan, December 16-20, 2017, pp , X. Chen, X. Liu, A. Ragni, Y. Wang, and M. J. F. Gales, Future word contexts in neural network language models, in 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017, Okinawa, Japan, December 16-20, 2017, pp , A. Malinin, K. Knill, and M. J. F. Gales, A hierarchical attention based model for off-topic spontaneous spoken response detection, in 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017, Okinawa, Japan, December 16-20, 2017, pp , M. Wan, G. Degottex, and M. J. F. Gales, Integrated speaker-adaptive speech synthesis, in 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017, Okinawa, Japan, December 16-20, 2017, pp , A. Ragni, C. Wu, M. J. F. Gales, J. Vasilakes, and K. M. Knill, Stimulated training for automatic speech recognition and keyword search in limited resource conditions, in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2017, New Orleans, LA, USA, March 5-9, 2017, pp , A. Ragni, D. Saunders, P. Zahemszky, J. Vasilakes, M. J. F. Gales, and K. M. Knill, Morph-toword transduction for accurate and efficient automatic speech recognition and keyword search, in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2017, New Orleans, LA, USA, March 5-9, 2017, pp ,

6 16. X. Chen, A. Ragni, J. Vasilakes, X. Liu, K. Knill, and M. J. F. Gales, Recurrent neural network language models for keyword search, in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2017, New Orleans, LA, USA, March 5-9, 2017, pp , J. H. M. Wong and M. J. F. Gales, Student-teacher training with diverse decision tree ensembles, in Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Stockholm, Sweden, August 20-24, 2017, pp , X. Chen, A. Ragni, X. Liu, and M. J. F. Gales, Investigating bidirectional recurrent neural network language models for speech recognition, in Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Stockholm, Sweden, August 20-24, 2017, pp , C. Wu and M. J. F. Gales, Deep activation mixture model for speech recognition, in Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Stockholm, Sweden, August 20-24, 2017, pp , K. M. Knill, M. J. F. Gales, K. Kyriakopoulos, A. Ragni, and Y. Wang, Use of graphemic lexicons for spoken language assessment, in Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Stockholm, Sweden, August 20-24, 2017, pp , K. Kyriakopoulos, M. J. F. Gales, and K. Knill, Automatic characterisation of the pronunciation of non-native English speakers using phone distance features, in 7th ISCA International Workshop on Speech and Language Technology in Education, SLaTE 2017, Stockholm, Sweden, August 2017, pp , A. Malinin, K. Knill, A. Ragni, Y. Wang, and M. J. F. Gales, An attention based model for offtopic spontaneous spoken response detection: An initial study, in 7th ISCA International Workshop on Speech and Language Technology in Education, SLaTE 2017, Stockholm, Sweden, August 2017, pp , M. J. F. Gales, K. M. Knill, and A. Ragni, Low-resource speech recognition and keyword-spotting, in Speech and Computer - 19th International Conference, SPECOM 2017, Hatfield, UK, September 12-16, 2017, Proceedings, pp. 3 19, A. Malinin, R. C. van Dalen, K. Knill, Y. Wang, and M. J. F. Gales, Off-topic response detection for spontaneous spoken English assessment, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7-12, 2016, Berlin, Germany, Volume 1: Long Papers, C. Wu, P. Karanasou, and M. J. F. Gales, Combining i-vector representation and structured neural networks for rapid adaptation, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016, Shanghai, China, March 20-25, 2016, pp , J. Yang, C. Zhang, A. Ragni, M. J. F. Gales, and P. C. Woodland, System combination with loglinear models, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016, Shanghai, China, March 20-25, 2016, pp , L. Wang, C. Zhang, P. C. Woodland, M. J. F. Gales, P. Karanasou, P. Lanchantin, X. Liu, and Y. Qian, Improved DNN-based segmentation for multi-genre broadcast audio, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016, Shanghai, China, March 20-25, 2016, pp , X. Chen, X. Liu, Y. Qian, M. J. F. Gales, and P. C. Woodland, CUED-RNNLM - an opensource toolkit for efficient training and evaluation of recurrent neural network language models, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016, Shanghai, China, March 20-25, 2016, pp , C. Wu, P. Karanasou, M. J. F. Gales, and K. C. Sim, Stimulated deep neural network for speech recognition, in Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp , J. Yang, A. Ragni, M. J. F. Gales, and K. M. Knill, Log-linear system combination using structured support vector machines, in Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp ,

7 31. S. Kundu, K. C. Sim, and M. J. F. Gales, Incorporating a generative front-end layer to deep neural network for noise robust automatic speech recognition, in Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp , J. H. M. Wong and M. J. F. Gales, Sequence student-teacher training of deep neural networks, in Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp , A. Ragni, E. Dakin, X. Chen, M. J. F. Gales, and K. M. Knill, Multi-language neural network language models, in Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp , P. Lanchantin, M. J. F. Gales, P. Karanasou, X. Liu, Y. Qian, L. Wang, P. C. Woodland, and C. Zhang, Selection of multi-genre broadcast data for the training of automatic speech recognition systems, in Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp , D. J. Litman, S. J. Young, M. J. F. Gales, K. Knill, K. Ottewell, R. C. van Dalen, and D. Vandyke, Towards using conversations with spoken dialogue systems in the automated assessment of nonnative speakers of English, in Proceedings of the SIGDIAL 2016 Conference, The 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, September 2016, Los Angeles, CA, USA, pp , G. Degottex, P. Lanchantin, and M. J. F. Gales, A pulse model in log-domain for a uniform synthesizer, in The 9th ISCA Speech Synthesis Workshop, Sunnyvale, CA, USA, September 2016, pp , R. C. van Dalen, J. Yang, H. Wang, A. Ragni, C. Zhang, and M. J. F. Gales, Structured discriminative models using deep neural-network features, in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp , X. Chen, X. Liu, M. J. F. Gales, and P. C. Woodland, Investigation of back-off based interpolation between recurrent neural network and n-gram language models, in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp , J. Cui, B. Kingsbury, B. Ramabhadran, A. Sethy, K. Audhkhasi, X. Cui, E. Kislal, L. Mangu, M. Nußbaum-Thom, M. Picheny, Z. Tüske, P. Golik, R. Schlüter, H. Ney, M. J. F. Gales, K. M. Knill, A. Ragni, H. Wang, and P. C. Woodland, Multilingual representations for low resource speech recognition and keyword search, in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp , S. Tan, K. C. Sim, and M. J. F. Gales, Improving the interpretability of deep neural networks with stimulated learning, in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp , P. C. Woodland, X. Liu, Y. Qian, C. Zhang, M. J. F. Gales, P. Karanasou, P. Lanchantin, and L. Wang, Cambridge University transcription systems for the multi-genre broadcast challenge, in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp , P. Lanchantin, M. J. F. Gales, P. Karanasou, X. Liu, Y. Qian, L. Wang, P. C. Woodland, and C. Zhang, The development of the Cambridge University alignment systems for the multi-genre broadcast challenge, in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp , P. Karanasou, M. J. F. Gales, P. Lanchantin, X. Liu, Y. Qian, L. Wang, P. C. Woodland, and C. Zhang, Speaker diarisation and longitudinal linking in multi-genre broadcast data, in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp , P. Bell, M. J. F. Gales, T. Hain, J. Kilgour, P. Lanchantin, X. Liu, A. McParland, S. Renals, O. Saz, M. Wester, and P. C. Woodland, The MGB challenge: Evaluating multi-genre broadcast media recognition, in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp ,

8 45. C. Wu and M. J. F. Gales, Multi-basis adaptive neural network for rapid adaptation in speech recognition, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp , A. Ragni, M. J. F. Gales, and K. M. Knill, A language space representation for speech recognition, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp , T. Drugman, Y. Stylianou, L. Chen, X. Chen, and M. J. F. Gales, Robust excitation-based features for automatic speech recognition, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp , R. C. van Dalen, K. M. Knill, P. Tsiakoulis, and M. J. F. Gales, Improving multiple-crowd-sourced transcriptions using a speech recogniser, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp , M. J. F. Gales, K. M. Knill, and A. Ragni, Unicode-based graphemic systems for limited resource languages, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp , X. Chen, X. Liu, M. J. F. Gales, and P. C. Woodland, Improving the training and evaluation efficiency of recurrent neural network language models, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp , X. Liu, X. Chen, M. J. F. Gales, and P. C. Woodland, Paraphrastic recurrent neural network language models, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp , X. Chen, X. Liu, M. J. F. Gales, and P. C. Woodland, Recurrent neural network language model training with noise contrastive estimation for speech recognition, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp , G. Mendels, E. Cooper, V. Soto, J. Hirschberg, M. J. F. Gales, K. M. Knill, A. Ragni, and H. Wang, Improving speech recognition and keyword search for low resource languages using web data, in INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp , P. Lanchantin, C. Veaux, M. J. F. Gales, S. King, and J. Yamagishi, Reconstructing voices within the multiple-average-voice-model framework, in INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp , R. C. van Dalen and M. J. F. Gales, Annotating large lattices with the exact word error, in INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp , P. Karanasou, M. J. F. Gales, and P. C. Woodland, I-vector estimation using informative priors for adaptation of deep neural networks, in INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp , X. Liu, F. Flego, L. Wang, C. Zhang, M. J. F. Gales, and P. C. Woodland, The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation, in INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp , X. Chen, T. Tan, X. Liu, P. Lanchantin, M. Wan, M. J. F. Gales, and P. C. Woodland, Recurrent neural network language model adaptation for multi-genre broadcast speech recognition, in INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp , H. Wang, A. Ragni, M. J. F. Gales, K. M. Knill, P. C. Woodland, and C. Zhang, Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages, in INTER- SPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp ,

9 60. R. C. van Dalen, K. M. Knill, and M. J. F. Gales, Automatically grading learners English using a Gaussian process, in ISCA International Workshop on Speech and Language Technology in Education, SLaTE 2015, Leipzig, Germany, September 4-5, 2015, pp. 7 12, V. Wan, J. Latorre, K. Yanagisawa, M. J. F. Gales, and Y. Stylianou, Cluster adaptive training of average voice models, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp , P. Lanchantin, M. J. F. Gales, S. King, and J. Yamagishi, Multiple-average-voice-based speech synthesis, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp , L. Chen, N. Braunschweiler, and M. J. F. Gales, Speaker dependent expression predictor from text: Expressiveness and transplantation, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp , J. Yang, R. C. van Dalen, S. Zhang, and M. J. F. Gales, Infinite structured support vector machines for speech recognition, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Paraphrastic neural network language models, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp , X. Liu, Y. Wang, X. Chen, M. J. F. Gales, and P. C. Woodland, Efficient lattice rescoring using recurrent neural network language models, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp , T. Yoshioka, X. Chen, and M. J. F. Gales, Impact of single-microphone dereverberation on DNNbased meeting transcription systems, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp , T. Yoshioka, A. Ragni, and M. J. F. Gales, Investigation of unsupervised adaptation of DNN acoustic models with filter bank input, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp , K. Knill, M. J. F. Gales, A. Ragni, and S. P. Rath, Language independent and unsupervised acoustic models for speech recognition and keyword spotting, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp , X. Chen, Y. Wang, X. Liu, M. J. F. Gales, and P. C. Woodland, Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp , A. Ragni, K. M. Knill, S. P. Rath, and M. J. F. Gales, Data augmentation for low resource languages, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp , S. P. Rath, K. M. Knill, A. Ragni, and M. J. F. Gales, Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp , X. Chen, M. J. F. Gales, K. M. Knill, C. Breslin, L. Chen, K. K. Chin, and V. Wan, An initial investigation of long-term adaptation for meeting transcription, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp , B. Kolluru, V. Wan, J. Latorre, K. Yanagisawa, and M. J. F. Gales, Generating multiple-accent pronunciations for TTS using joint sequence model interpolation, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp , K. Yanagisawa, L. Chen, and M. J. F. Gales, Noise-robust TTS speaker adaptation with statistics smoothing, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp ,

10 76. P. Karanasou, Y. Wang, M. J. F. Gales, and P. C. Woodland, Adaptation of deep neural network acoustic models using factorised i-vectors, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp , J. Latorre, K. Yanagisawa, V. Wan, B. Kolluru, and M. J. F. Gales, Speech intonation for TTS: study on evaluation methodology, in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp , M. J. F. Gales, K. M. Knill, A. Ragni, and S. P. Rath, Speech recognition and keyword spotting for low-resource languages: Babel project research at CUED, in 4th Workshop on Spoken Language Technologies for Under-resourced Languages, SLTU 2014, St. Petersburg, Russia, May 14-16, 2014, pp , K. Knill, M. J. F. Gales, S. P. Rath, P. C. Woodland, C. Zhang, and S. Zhang, Investigation of multilingual deep neural networks for spoken term detection, in 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, Olomouc, Czech Republic, December 8-12, 2013, pp , J. Latorre, M. J. F. Gales, K. Knill, and M. Akamine, Training a supra-segmental parametric F0 model without interpolating F0, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , S. Zhang and M. J. F. Gales, Kernelized log linear models for continuous speech recognition, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , R. C. van Dalen, A. Ragni, and M. J. F. Gales, Efficient decoding with generative score-spaces using the expectation semiring, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , Y. Wang and M. J. F. Gales, Tandem system adaptation using multiple linear feature transforms, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , R. Maia, M. Akamine, and M. J. F. Gales, Complex cepstrum analysis based on the minimum mean squared error, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , L. Chen, M. J. F. Gales, N. Braunschweiler, M. Akamine, and K. Knill, Integrated automatic expression prediction and speech synthesis from text, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , J. Mamou, J. Cui, X. Cui, M. J. F. Gales, B. Kingsbury, K. Knill, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran, R. Schlüter, A. Sethy, and P. C. Woodland, System combination and score normalization for spoken term detection, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , B. Kingsbury, J. Cui, X. Cui, M. J. F. Gales, K. Knill, J. Mamou, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran, R. Schlüter, A. Sethy, and P. C. Woodland, A high-performance Cantonese keyword search system, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Paraphrastic language models and combination with neural network language models, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , M. S. Seigel, P. C. Woodland, and M. J. F. Gales, A confidence-based approach for improving keyword hypothesis scores, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp , P. Lanchantin, P. Bell, M. J. F. Gales, T. Hain, X. Liu, Y. Long, J. Quinnell, S. Renals, O. Saz, M. S. Seigel, P. Swietojanski, and P. C. Woodland, Automatic transcription of multi-genre media archives, in Proceedings of the First Workshop on Speech, Language and Audio in Multimedia, Marseille, France, August 22-23, 2013, pp ,

11 91. Y. Wang and M. J. F. Gales, An explicit independence constraint for factorised adaptation in speech recognition, in INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp , Y. Long, M. J. F. Gales, P. Lanchantin, X. Liu, M. S. Seigel, and P. C. Woodland, Improving lightly supervised training for broadcast transcription, in INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp , R. Maia, M. J. F. Gales, Y. Stylianou, and M. Akamine, Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis, in INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp , V. Wan, R. Anderson, A. Blokland, N. Braunschweiler, L. Chen, B. Kolluru, J. Latorre, R. Maia, B. Stenger, K. Yanagisawa, Y. Stylianou, M. Akamine, M. J. F. Gales, and R. Cipolla, Photorealistic expressive text to talking head synthesis, in INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp , J. Yang, R. C. van Dalen, and M. J. F. Gales, Infinite support vector machines in speech recognition, in INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Cross-domain paraphrasing for improving language modelling using out-of-domain data, in INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp , K. Yanagisawa, J. Latorre, V. Wan, M. J. F. Gales, and S. King, Noise robustness in HMM-TTS speaker adaptation, in The Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, Barcelona, Spain, August 31-September 2, 2013, pp , F. Eyben, S. Buchholz, N. Braunschweiler, J. Latorre, V. Wan, M. J. F. Gales, and K. Knill, Unsupervised clustering of emotion and voice styles for expressive TTS, in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2012, Kyoto, Japan, March 25-30, 2012, pp , A. Ragni and M. J. F. Gales, Inference algorithms for generative score-spaces, in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2012, Kyoto, Japan, March 25-30, 2012, pp , R. Maia, M. Akamine, and M. J. F. Gales, Complex cepstrum as phase information in statistical parametric speech synthesis, in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2012, Kyoto, Japan, March 25-30, 2012, pp , F. Flego and M. J. F. Gales, Factor analysis based VTS discriminative adaptive training, in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2012, Kyoto, Japan, March 25-30, 2012, pp , L. Chen, M. J. F. Gales, V. Wan, J. Latorre, and M. Akamine, Exploring rich expressive information from audiobook data using cluster adaptive training, in INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012, pp , J. Latorre, V. Wan, M. J. F. Gales, L. Chen, K. K. Chin, K. Knill, and M. Akamine, Speech factorization for HMM-TTS based on cluster adaptive training, in INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012, pp , V. Wan, J. Latorre, K. K. Chin, L. Chen, M. J. F. Gales, H. Zen, K. Knill, and M. Akamine, Combining multiple high quality corpora for improving HMM-TTS, in INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012, pp , Y. Wang and M. J. F. Gales, Model-based approaches to adaptive training in reverberant environments, in INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012, pp ,

12 106. M. J. F. Gales and F. Flego, Model-based approaches for degraded channel modelling in robust ASR, in INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Paraphrastic language models, in INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012, pp , Z. Roupakia, A. Ragni, and M. J. F. Gales, Rapid nonlinear speaker adaptation for large-vocabulary continuous speech recognition, in INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012, pp , M. J. F. Gales, A. Ragni, A. Zhang, and R. C. van Dalen, Structured discriminative models for speech recognition, in 2012 Symposium on Machine Learning in Speech and Language Processing, MLSLP 2012, Portland, Oregon, USA, September 14, 2012, P. Bell, M. J. F. Gales, P. Lanchantin, X. Liu, Y. Long, S. Renals, P. Swietojanski, and P. C. Woodland, Transcription of multi-genre media archives using out-of-domain data, in 2012 IEEE Spoken Language Technology Workshop (SLT), Miami, FL, USA, December 2-5, 2012, pp , S. Zhang and M. J. F. Gales, Extending noise robust structured support vector machines to larger vocabulary tasks, in 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2011, Waikoloa, HI, USA, December 11-15, 2011, pp , Y. Wang and M. J. F. Gales, Improving reverberant VTS for hands-free robust speech recognition, in 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2011, Waikoloa, HI, USA, December 11-15, 2011, pp , A. Ragni and M. J. F. Gales, Derivative kernels for noise robust ASR, in 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2011, Waikoloa, HI, USA, December 11-15, 2011, pp , R. C. van Dalen and M. J. F. Gales, A variational perspective on noise-robust speech recognition, in 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2011, Waikoloa, HI, USA, December 11-15, 2011, pp , H. Zen and M. J. F. Gales, Decision tree-based context clustering based on cross validation and hierarchical priors, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic, pp , Y. Wang and M. J. F. Gales, Speaker and noise factorisation on the AURORA4 task, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic, pp , J. Latorre, M. J. F. Gales, S. Buchholz, K. Knill, M. Tamura, Y. Ohtani, and M. Akamine, Continuous F0 in the source-excitation generation for HMM-based TTS: do we need voiced/unvoiced classification?, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic, pp , A. Ragni and M. J. F. Gales, Structured discriminative models for noise robust continuous speech recognition, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic, pp , F. Flego and M. J. F. Gales, Factor analysis based VTS and JUD noise estimation and compensation, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic, pp , X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland, Investigation of acoustic units for LVCSR systems, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic, pp ,

13 121. L. Chen, M. J. F. Gales, and K. K. Chin, Constrained discriminative mapping transforms for unsupervised speaker adaptation, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic, pp , K. K. Chin, H. Xu, M. J. F. Gales, C. Breslin, and K. Knill, Rapid joint speaker and noise compensation for robust speech recognition, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic, pp , F. Diehl, M. J. F. Gales, X. Liu, M. Tomalin, and P. C. Woodland, Word boundary modelling and full covariance Gaussians for Arabic speech-to-text systems, in INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp , S. Zhang and M. J. F. Gales, Structured support vector machines for noise robust continuous speech recognition, in INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp , C. Breslin, K. K. Chin, M. J. F. Gales, and K. Knill, Integrated online speaker clustering and adaptation, in INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp , R. Maia, H. Zen, K. Knill, M. J. F. Gales, and S. Buchholz, Multipulse sequences for residual signal modeling, in INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp , T. Li, P. C. Woodland, F. Diehl, and M. J. F. Gales, Graphone model interpolation and Arabic pronunciation generation, in INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp , N. Pilkington, H. Zen, and M. J. F. Gales, Gaussian process experts for voice conversion, in INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Improving LVCSR system combination using neural network language model cross adaptation, in INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp , H. Zen, M. J. F. Gales, Y. Nankaku, and K. Tokuda, Statistical parametric speech synthesis based on product of experts, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010, March 2010, Sheraton Dallas Hotel, Dallas, Texas, USA, pp , M. Tomalin, F. Diehl, M. J. F. Gales, J. Park, and P. C. Woodland, Recent improvements to the cambridge Arabic speech-to-text systems, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010, March 2010, Sheraton Dallas Hotel, Dallas, Texas, USA, pp , X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland, Language model combination and adaptation usingweighted finite state transducers, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010, March 2010, Sheraton Dallas Hotel, Dallas, Texas, USA, pp , M. J. F. Gales and K. Yu, Canonical state models for automatic speech recognition, in INTER- SPEECH 2010, 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 26-30, 2010, pp , X. Liu, M. J. F. Gales, and P. C. Woodland, Language model cross adaptation for LVCSR system combination, in INTERSPEECH 2010, 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 26-30, 2010, pp , R. C. van Dalen and M. J. F. Gales, Asymptotically exact noise-corrupted speech likelihoods, in INTERSPEECH 2010, 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 26-30, 2010, pp ,

SRIRAM GANAPATHY. Indian Institute of Science, Phone: +91-(80) Bangalore, India, Fax: +91-(80)

SRIRAM GANAPATHY. Indian Institute of Science, Phone: +91-(80) Bangalore, India, Fax: +91-(80) SRIRAM GANAPATHY Assistant Professor, Email: sriram@ee.iisc.ernet.in Electrical Engineering, Web: http://www.leap.ee.iisc.ac.in/sriram Indian Institute of Science, Phone: +91-(80)-2293-2433 Bangalore,