Chinese Spelling Check based on N-gram and String Matching Algorithm

Jui-Feng Yeh, Li-Ting Chang, Chan-Yi Liu, Tsung-Wei Hsu


Abstract
This paper presents a Chinese spelling check approach based on language models combined with string match algorithm to treat the problems resulted from the influence caused by Cantonese mother tone. N-grams first used to detecting the probability of sentence constructed by the writers, a string matching algorithm called Knuth-Morris-Pratt (KMP) Algorithm is used to detect and correct the error. According to the experimental results, the proposed approach can detect the error and provide the corresponding correction.
Anthology ID:
W17-5906
Volume:
Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA 2017)
Month:
December
Year:
2017
Address:
Taipei, Taiwan
Venues:
NLP-TEA | WS
SIG:
Publisher:
Asian Federation of Natural Language Processing
Note:
Pages:
35–38
Language:
URL:
https://www.aclweb.org/anthology/W17-5906
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/W17-5906.pdf