Comparing the Intrinsic Performance of Clinical Concept Embeddings by Their Field of Medicine
John-Jose Nunez | Giuseppe Carenini
Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019)
Pre-trained word embeddings are becoming increasingly popular for natural language processing tasks. This includes medical applications, where embeddings are trained for clinical concepts using specific medical data. Recent work continues to improve on these embeddings. However, no one has yet sought to determine whether these embeddings work as well for one field of medicine as they do in others. In this work, we use intrinsic methods to evaluate embeddings from the various fields of medicine as defined by their ICD-9 systems. We find significant differences between fields, and motivate future work to investigate whether extrinsic tasks will follow a similar pattern.