Advanced Topics for Class Presentation

(Once you form a group, choose a topic as soon as you can. I assign topics on a first-ask-first-get basis.) (Marked * means the topic is taken by others. 1-person group is allowed only for last person if total number is odd.)

* [1] MAP estimation of HMM: J.-L. Gauvain and C.-H. Lee, “Maximum a posteriori Estimation for Multivariate Gaussian mixture Observation of Markov Chains,” IEEE Trans. on Speech and Audio Processing, pp.291-298, Vol.2, 1994.

[2] On-Line Bayesian Learning of HMM: Q.Huo and C.-H. Lee, “On-line Adaptive Learning of the Continuous Density Hidden Markov Model based on Approximate Recursive Bayes Estimate,” IEEE Trans. on Speech and Audio Processing, pp.161-172, Vol. 5, No. 2, 1997.

[3] MMIE of HMM: Y. Normandin, R. Cardin and R. De Mori, “High-Performance Connected Digit Recognition Using Maximum Mutual Information Estimation,” IEEE Trans. on Speech and Audio Processing, pp. Vol. 2, No. 2, Apr. 1994.

[4] MCE of HMM: B.-H. Juang, W. Chou and C.-H. Lee, “Minimum Classification Error Rate Methods for Speech Recognition,” IEEE Trans. on Speech and Audio Processing, pp.257-265, Vol. 5, No. 3, May 1997.

[5] Large Marge Estimation of HMM: H. Jiang, X. Li and C.-J. Liu, “Large Margin Hidden Markov Models for Speech Recognition,” IEEE Trans. On Audio, Speech and Language Processing,? pp.1584-1595, Vol. 14, No. 5, September 2006.

[6] Decision Tree for HMM tying: W. Reichl and W. Chou, “Robust Decision Tree State Tying for Continuous Speech Recognition,” IEEE Trans. on Speech and Audio Processing, pp.555-566, Vol. 8, No. 5, Sep. 2000.

[7] Search in large vocabulary ASR: S. Ortmanns, H. Ney and X. Aubert, “A word graph algorithm for large vocabulary continuous speech recognition,” Computer Speech and Language, pp.43-72, Vol. 11, No.1, 1997.

[8] Adaptive Statistical Language Modeling: R. Rosenfeld, “A maximum entropy approach to adaptive statistical language modeling,” Computer Speech and Language, Vol. 10, pp.187-228, 1996.

[9] Transformation-based speaker adaptation: C. J. Leggetter and P. C. Woodland, “Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models,” Computer Speech and Language, pp. 171-185, Vol. 9, 1995.

*[10] Speaker Verification: Q. Li, B.-H. Juang, C.-H. Lee, “Automatic verbal information verification for user authentication” IEEE Trans. on Speech and Audio Processing, pp.585-596, Vol. 8, No. 5, Sep. 2000.

[11] Utterance Verification (outlier rejection): M. G. Rahim, C.-H. Lee and B.-H. Juang, “Discriminative Utterance Verification for Connected Digits Recognition,” IEEE Trans. on Speech and Audio Processing, pp.266-277, Vol. 5, No.3, May 1997.

[12] Bayesian Approach to Speaker Verification: H. Jiang and L. Deng, “A Bayesian Approach to the Verification problem: Applications to Speaker Verification,” IEEE Trans. on Speech and Audio Processing, pp.874-884, Vol. 9, No. 8, Nov. 2001.

[13] Latent Semantic Analysis for LM: J. R. Bellegarda, “Exploiting Latent Semantic Information in Statistical Language Modeling,” Proceedings of IEEE, pp.1279-1296, Vol.88, No. 8, August 2000.

*[14] Speech Understanding: T. Kawahara, C.-H. Lee and B.-H. Juang, “Flexible speech understanding based on combined key-phrase detection and verification,” IEEE Trans. on Speech and Audio Processing, pp.558-568, Vol. 6, No. 6, November 1998.

[15] Statistical machine translation: P. F. Brown, et. al., “The Mathematics of Statistical Machine Translation: Parameter Estimation,” Computational Linguistics, pp.263-297, Vol. 19, No. 2, 1993.

[16] Support Vector Machines: C. J. Burges, “A Tutorial on Support Vector Machines for Pattern Recognition,” Data Mining and Knowledge Discovery, 2, 121-167 (1998).

*[17] Weighted Finite State Transducer (WFST) Optimization: M. Mohri, F. Pereira and M. Riley, “Speech Recognition with Weighted Finite-State Transducers,” In Larry Rabiner and Fred Juang, editors, Handbook on Speech Processing and Speech Communication, Part E: Speech recognition. Springer-Verlag, Heidelberg, Germany, 2007.

*[18] Graphical models: Graphical models, Chapter 8 of Pattern Recognition and Machine Learning by Christopher M. Bishop.

[19] Boosting: Y. Freund and R. E. Schapire, “A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting,” Journal of computer and system sciences 55, 119-139 (1997).