Comparing Methods for Measuring Dialect Similarity in Norwegian
Janne Johannessen | Andre Kåsen | Kristin Hagen | Anders Nøklestad | Joel Priestley
Proceedings of the 12th Language Resources and Evaluation Conference
The present article presents four experiments with two different methods for measuring dialect similarity in Norwegian: the Levenshtein method and the neural long short term memory (LSTM) autoencoder network, a machine learning algorithm. The visual output in the form of dialect maps is then compared with canonical maps found in the dialect literature. All of this enables us to say that one does not need fine-grained transcriptions of speech to replicate classical classification patterns.