本課程包含完整的
課堂講稿。另外,在相關閱讀資料部分提供大量的推薦和指定書目。
This course features a complete set of
lecture notes. In addition an extensive bibliography of assigned and recommended readings is provided in the
readings section.
本課程是研究所的自然語言處理入門課程,主要課程重點是從電腦的觀點研究人類語言。
This course is a graduate level introduction to natural language processing, the primary concern of which is the study of human language from a computational perspective.
課程內容涵蓋語法、語義和言談處理。重點是基於語料庫的方法和演算法,例如隱馬爾可夫模型(Hidden Markov Model)和機率上下文無關文法。我們將在多種應用中討論這些方法和模型的使用,包括句法分析,資訊抽萃取,統計機器翻譯和自動摘要等。
The class will cover models at the level of syntactic, semantic and discourse processing. The emphasis will be on corpus-based methods and algorithms, such as Hidden Markov Models and probabilistic context free grammars. We will discuss the use of these methods and models in a variety of applications including syntactic parsing, information extraction, statistical machine translation, and summarization.
本學科亦屬人工智慧及應用的相關課程。
This subject qualifies as an Artificial Intelligence and Applications concentration subject.
技術需求
本課程網站的.gz和.tar檔需要用檔解壓縮軟體打開,例如WinzipR或StuffItR。本課程網站的.ps檔可以用Postscript流覽器軟體來流覽,例如Ghostscript/Ghostview。
File decompression software, such as Winzip® or StuffIt®, is required to open the .gz and .tar files found on this course site. Postscript viewer software, such as Ghostscript/Ghostview, can be used to view the .ps files found on this course site.