Encoder-decoder models for latent phonological representations of words
Cassandra L. Jacobs | Fred Mailhot
Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology
We use sequence-to-sequence networks trained on sequential phonetic encoding tasks to construct compositional phonological representations of words. We show that the output of an encoder network can predict the phonetic durations of American English words better than a number of alternative forms. We also show that the model’s learned representations map onto existing measures of words’ phonological structure (phonological neighborhood density and phonotactic probability).