Corpus Research in China: A bird’s–eye view of corpus linguistics in China
Speaker:Prof. Zhiwei FENG
University:Institute of Applied Linguistics, Hangzhou Normal University, China
Time:4:00pm (coffee: 3:30)
Briefly review the development of corpus research.
The development and present situation of corpus linguistics in China: earlier corpus, large-scale & authentic text corpus, national corpus, speech corpus, bilingual corpus and corpus of minority languages in China, treebank.
The various processing techniques for corpus: automatic word segmentation of Chinese text, automatic POS tagging, automatic tagging of phrase structure and automatic alignment of bilingual corpus, complex network.
Several problems in present corpus research: standardization of corpus specifications, commonly sharing of language resources, knowledge properties, etc.