The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems

Nobal Niraula, Vasile Rus, Rajendra Banjade, Dan Stefanescu, William Baggett, Brent Morgan


Abstract
We describe the DARE corpus, an annotated data set focusing on pronoun resolution in tutorial dialogue. Although data sets for general purpose anaphora resolution exist, they are not suitable for dialogue based Intelligent Tutoring Systems. To the best of our knowledge, no data set is currently available for pronoun resolution in dialogue based intelligent tutoring systems. The described DARE corpus consists of 1,000 annotated pronoun instances collected from conversations between high-school students and the intelligent tutoring system DeepTutor. The data set is publicly available.
Anthology ID:
L14-1320
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3199–3203
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/372_Paper.pdf
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/372_Paper.pdf