CS26001 Intelligent Speech Technology 智能语音技术

 

课程名称 (Course Name) Intelligent Speech Technology

课程代码 (Course Code) CS26001

学分/学时 (Credits/Credit Hours) 3.0/48

开课时间 (Course Term )  Spring

开课学院(School Providing the Course:  Computer Science Department

任课教师(Teacher:  Kai Yu

课程讨论时数(Course Discussion Hours:  4

课程实验数(Lab Hours:   4

课程内容简介(Course Introduction):

This course introduces the basic theory and tools for intelligent speech technology, with a focus on automatic speech recognition. It includes fundamental theories of pattern recognition and machine learning, acoustic modeling (hidden Markov model), language model (N-gram model), and advanced techniques of large vocabulary continuous speech recognition.

教学大纲(Course Teaching Outline):

1.    Overview Intelligent Speech Technology

2.    Probability and Stochastic Process

3.    Pattern Recognition

4.    Bayesian decision theory

5.    Expectation Algorithm and Gaussian Mixture Model

6.    Speech Processing and Feature Extraction

7.    Hidden Markov Model

8.    Statistical language model

9.    Decoding algorithm

10. Issues of large vocabulary continuous speech recognition

课程进度计划(Course Schedule):

Week 1-5: Pattern Recognition and Bayesian decision theory

Week 6: EM algorithm and Gaussian mixture model

Week 7-8: Speech processing and feature extraction

Week 9-10: hidden Markov model

Week 11: Statistical language model

Week 12: Advanced techniques of LVCSR

课程考核要求(Course Assessment Requirements)

Evaluation will be based on two course works and one group presentation.

参考文献(Course References)

Xuedong Huang, Alex Acero, Raj Reddy and Hsiao-Wuen Hon, Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall, 2001.

S. J. Young, D. Kershaw, J. J. Odell, D. Ollason, V. Valtchev, and P. C. Woodland. The HTK Book (for HTK version 3.0). Cambridge University Engineering Department, 2000.

预修课程(Prerequisite Course

Calculus, Probability, Linear Algebra

[ 2015-11-26 ]