The Effect of Linguistic Parameters in CLIR Performance

Carl Rubino


Abstract
This paper will detail how IARPA’s MATERIAL Cross-Language Information Retrieval (CLIR) program investigated certain linguistic parameters to guide language choice, data collection and partitioning, and understand evaluation results. Discerning which linguistic parameters correlated with overall performance enabled the evaluation of progress when different languages were measured, and also was an important factor in determining the most effective CLIR pipeline design, customized to handle language-specific properties deemed necessary to address.
Anthology ID:
2020.clssts-1.1
Volume:
Proceedings of the workshop on Cross-Language Search and Summarization of Text and Speech (CLSSTS2020)
Month:
May
Year:
2020
Address:
Marseille, France
Venues:
CLSSTS | LREC | WS
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1–6
Language:
English
URL:
https://www.aclweb.org/anthology/2020.clssts-1.1
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/2020.clssts-1.1.pdf