READING LIST for COSC 6328  (Winter 2006)

 

Ones marked with * are required for all students. Others are optional for further exploration.

                                                                                                         

Category A: Applications and Background

 

*[A1] R. V. Cox, C. A. Kamm, L. R. Rabiner, J. Schroeter and J. G. Wilpon, “Speech and Language Processing for Next Millennium Communications Service”, Proceeding of The IEEE, pp. 1314—1337, Vol. 88, No. 8, August 2000.

 

[A2] J. Makhoul, F. Kubala, T. Leek, D. Liu, L. Nguyen, R. Schwartz and A. Srivastava, “Speech and Language Technologies for Audio Indexing and Retrieval”, Proceeding of The IEEE, pp. 1338—1353, Vol. 88, No. 8, August 2000.

 

 

Category B: Math Foundation

 

[B1]  Appendix “Mathematical Foundation” , excerpt from Pattern Classification (2nd Edition) by R. O. Duda, P. Hart and D. Stork. (John Wiley & Sons, Inc., ISBN 0-471-05669-3)

 

Category C: Model Estimation

 

[C1] T. K. Moon, “The Expectation-Maximization Algorithm”, IEEE Signal Processing Magazine, pp.47—60, Nov. 1996.

 

Category D: Pattern Verification

 

[D1] C.-H. Lee, “A Tutorial on Speaker and Speech Verification”, Proc. of  NORSIG, 1998.

 

Category E: HMM

 

[E1] L. R. Rabiner, “A Tutorial on Hidden Markov Models and selected Applications in Speech Recognition”,  Proceedings of the IEEE, Vol. 77, No. 2, Feb. 1989.  (only pp.257—267 are required)

 

Category F:  HMM Applications in speech recognition.

 

[F1] Steve Young, et. al. , “The HTK Book (for HTK version 3.4)”, Microsoft Corporation. HMM Toolkit (HTK) Book. 

 

[F2] Steve Young, “Large Vocabulary Continuous Speech Recognition, A Review”, 1996.