CYUT-III System at Chinese Grammatical Error Diagnosis Task

Po-Lin Chen, Shih-Hung Wu, Liang-Pu Chen, Ping-Che Yang


Abstract
This paper describe the CYUT-III system on grammar error detection in the 2016 NLP-TEA Chinese Grammar Error Detection shared task CGED. In this task a system has to detect four types of errors, in-cluding redundant word error, missing word error, word selection error and word ordering error. Based on the conditional random fields (CRF) model, our system is a linear tagger that can detect the errors in learners’ essays. Since the system performance depends on the features heavily, in this paper, we are going to report how to integrate the collocation feature into the CRF model. Our system presents the best detection accuracy and Identification accuracy on the TOCFL dataset, which is in traditional Chi-nese. The same system also works well on the simplified Chinese HSK dataset.
Anthology ID:
W16-4909
Volume:
Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
NLP-TEA | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
63–72
Language:
URL:
https://www.aclweb.org/anthology/W16-4909
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/W16-4909.pdf