Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach

Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski, Elke Teich


Abstract
In this paper, we present corpus-based procedures to semi-automatically discover features relevant for the study of recent language change in scientific registers. First, linguistic features potentially adherent to recent language change are extracted from the SciTex Corpus. Second, features are assessed for their relevance for the study of recent language change in scientific registers by means of correspondence analysis. The discovered features will serve for further investigations of the linguistic evolution of newly emerged scientific registers.
Anthology ID:
L12-1111
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2786–2790
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/268_Paper.pdf
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/268_Paper.pdf