Chunyu Kit

Also published as: Chun-yu Kit


2020

pdf bib
Multi-choice Relational Reasoning for Machine Reading Comprehension
Wuya Chen | Xiaojun Quan | Chunyu Kit | Zhengcheng Min | Jiahai Wang
Proceedings of the 28th International Conference on Computational Linguistics

This paper presents our study of cloze-style reading comprehension by imitating human reading comprehension, which normally involves tactical comparing and reasoning over candidates while choosing the best answer. We propose a multi-choice relational reasoning (McR2) model with an aim to enable relational reasoning on candidates based on fusion representations of document, query and candidates. For the fusion representations, we develop an efficient encoding architecture by integrating the schemes of bidirectional attention flow, self-attention and document-gated query reading. Then, comparing and inferring over candidates are executed by a novel relational reasoning network. We conduct extensive experiments on four datasets derived from two public corpora, Children’s Book Test and Who DiD What, to verify the validity and advantages of our model. The results show that it outperforms all baseline models significantly on the four benchmark datasets. The effectiveness of its key components is also validated by an ablation study.

2013

pdf bib
Finding More Bilingual Webpages with High Credibility via Link Analysis
Chengzhi Zhang | Xuchen Yao | Chunyu Kit
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora

pdf bib
Non-Monotonic Sentence Alignment via Semisupervised Learning
Xiaojun Quan | Chunyu Kit | Yan Song
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

2012

pdf bib
Higher-order Constituent Parsing and Parser Combination
Xiao Chen | Chunyu Kit
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

pdf bib
Semi-automatic Annotation of Chinese Word Structure
Jianqiang Ma | Chunyu Kit | Dale Gerdemann
Proceedings of the Second CIPS-SIGHAN Joint Conference on Chinese Language Processing

pdf bib
Entropy-based Training Data Selection for Domain Adaptation
Yan Song | Prescott Klassen | Fei Xia | Chunyu Kit
Proceedings of COLING 2012: Posters

pdf bib
Extending Machine Translation Evaluation Metrics with Lexical Cohesion to Document Level
Billy T. M. Wong | Chunyu Kit
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning

2011

pdf bib
Improving Part-of-speech Tagging for Context-free Parsing
Xiao Chen | Chunyu Kit
Proceedings of 5th International Joint Conference on Natural Language Processing

2010

pdf bib
The Parameter-Optimized ATEC Metric for MT Evaluation
Billy Wong | Chunyu Kit
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR

pdf bib
Reranking with Multiple Features for Better Transliteration
Yan Song | Chunyu Kit | Hai Zhao
Proceedings of the 2010 Named Entities Workshop

pdf bib
Bigram HMM with Context Distribution Clustering for Unsupervised Chinese Part-of-Speech tagging
Lidan Zhang | Kwok-Ping Chan | Chunyu Kit | Dongfeng Cai
CIPS-SIGHAN Joint Conference on Chinese Language Processing

pdf bib
Automatic Identification of Predicate Heads in Chinese Sentences
Xiaona Ren | Qiaoli Zhou | Chunyu Kit | Dongfeng Cai
CIPS-SIGHAN Joint Conference on Chinese Language Processing

pdf bib
Active Learning Based Corpus Annotation
Hongyan Song | Tianfang Yao | Chunyu Kit | Dongfeng Cai
CIPS-SIGHAN Joint Conference on Chinese Language Processing

pdf bib
Combine Person Name and Person Identity Recognition and Document Clustering for Chinese Person Name Disambiguation
Ruifeng Xu | Jun Xu | Xiangying Dai | Chunyu Kit
CIPS-SIGHAN Joint Conference on Chinese Language Processing

pdf bib
HITSZ_CITYU: Combine Collocation, Context Words and Neighboring Sentence Sentiment in Sentiment Adjectives Disambiguation
Ruifeng Xu | Jun Xu | Chunyu Kit
Proceedings of the 5th International Workshop on Semantic Evaluation

pdf bib
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method
Hai Zhao | Yan Song | Chunyu Kit
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

We investigate the impact of input data scale in corpus-based learning using a study style of Zipf’s law. In our research, Chinese word segmentation is chosen as the study case and a series of experiments are specially conducted for it, in which two types of segmentation techniques, statistical learning and rule-based methods, are examined. The empirical results show that a linear performance improvement in statistical learning requires an exponential increasing of training corpus size at least. As for the rule-based method, an approximate negative inverse relationship between the performance and the size of the input lexicon can be observed.

2009

pdf bib
Semantic Dependency Parsing of NomBank and PropBank: An Efficient Integrated Approach via a Large-scale Feature Selection
Hai Zhao | Wenliang Chen | Chunyu Kit
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing

pdf bib
Multilingual Dependency Learning: A Huge Feature Engineering Method to Semantic Dependency Parsing
Hai Zhao | Wenliang Chen | Chunyu Kit | Guodong Zhou
Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task

pdf bib
Transliteration of Name Entity via Improved Statistical Translation on Character Sequences
Yan Song | Chunyu Kit | Xiao Chen
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration (NEWS 2009)

pdf bib
Cross Language Dependency Parsing using a Bilingual Lexicon
Hai Zhao | Yan Song | Chunyu Kit | Guodong Zhou
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP

2008

pdf bib
An Empirical Comparison of Goodness Measures for Unsupervised Chinese Word Segmentation with a Unified Framework
Hai Zhao | Chunyu Kit
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I

pdf bib
Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition
Hai Zhao | Chunyu Kit
Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing

pdf bib
An Improved Corpus Comparison Approach to Domain Specific Term Recognition
Xiaoyue Liu | Chunyu Kit
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation

pdf bib
Parsing Syntactic and Semantic Dependencies with Two Single-Stage Maximum Entropy Models
Hai Zhao | Chunyu Kit
CoNLL 2008: Proceedings of the Twelfth Conference on Computational Natural Language Learning

2005

pdf bib
Period Disambiguation with Maxent Model
Chunyu Kit | Xiaoyue Liu
Second International Joint Conference on Natural Language Processing: Full Papers

pdf bib
An Example-Based Chinese Word Segmentation System for CWSB-2
Chunyu Kit | Xiaoyue Liu
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing

pdf bib
Harvesting the Bitexts of the Laws of Hong Kong From the Web
Chunyu Kit | Xiaoyue Liu | KingKui Sin | Jonathan J. Webster
Proceedings of the Fifth Workshop on Asian Language Resources (ALR-05) and First Symposium on Asian Language Resources Network (ALRN)

2003

pdf bib
Integrating Ngram Model and Case-based Learning for Chinese Word Segmentation
Chunyu Kit | Zhiming Xu | Jonathan J. Webster
Proceedings of the Second SIGHAN Workshop on Chinese Language Processing

2002

pdf bib
Learning Case-based Knowledge for Disambiguating Chinese Word Segmentation: A Preliminary Study
Chunyu Kit | Haihua Pan | Hongbiao Chen
COLING-02: The First SIGHAN Workshop on Chinese Language Processing

1999

pdf bib
Unsupervised Learning of Word Boundary with Description Length Gain
Chunyu Kit | Yorick Wilks
EACL 1999: CoNLL-99 Computational Natural Language Learning

1994

pdf bib
Automatic Terminology Extraction For Thematic Corpus Based On Subterm Co-Occurrence
Chun-yu Kit
Proceedings of Rocling VII Computational Linguistics Conference VII

1992

pdf bib
Tokenization as the Initial Phase in NLP
Jonathan J. Webster | Chunyu Kit
COLING 1992 Volume 4: The 15th International Conference on Computational Linguistics

1991

pdf bib
Automatic Chinese Text Generation Based On Inference Trees
Hing-Lung Lin | Benjamin K. T’sou | Hing-Cheung Ho | Bong-Yeung Lai | Suen Caesar Lun | Chi-Yuen Choi | Chun-yu Kit
Proceedings of Rocling IV Computational Linguistics Conference IV