Carolyn Haberkern


pdf bib
Using BERT for Qualitative Content Analysis in Psychosocial Online Counseling
Philipp Grandeit | Carolyn Haberkern | Maximiliane Lang | Jens Albrecht | Robert Lehmann
Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science

Qualitative content analysis is a systematic method commonly used in the social sciences to analyze textual data from interviews or online discussions. However, this method usually requires high expertise and manual effort because human coders need to read, interpret, and manually annotate text passages. This is especially true if the system of categories used for annotation is complex and semantically rich. Therefore, qualitative content analysis could benefit greatly from automated coding. In this work, we investigate the usage of machine learning-based text classification models for automatic coding in the area of psycho-social online counseling. We developed a system of over 50 categories to analyze counseling conversations, labeled over 10.000 text passages manually, and evaluated the performance of different machine learning-based classifiers against human coders.