bidirectional encoder representations from transformers
deep learning
multi-view learning
neural machine translation
weak supervision
Beschreibung:
Text-based emotion recognition plays a crucial role in various domains and applications due to its significance in understanding human behavior and improving communication systems. Although extensive research has been conducted for texts in English, only few studies deal with non-English languages. Especially for the German language, previous work is limited due to the focus on specific text domains and the usage of lexicon-based approaches, which lack the ability to consider contextual information. In this paper, an approach for emotion recognition on German text corpora is presented, which addresses these challenges with domain-generalized and context-sensitive methodologies. To achieve domain generalization, neural machine translation and weak supervision techniques are combined with multi-view learning, in which various data sources from different domains are utilized to improve the generalizability and to overcome the problem of lack of data. Using the BERT model as the primary architecture to capture contextual information, an overall F1-score of 65.5% and an accuracy of 68.1% are achieved while maintaining well-balanced results on the metrics over all considered domains and emotions. To ensure the legitimacy and reliability of the findings, further comprehensive evaluations and comparisons are conducted. Given the absence of an established benchmark specifically for emotion recognition on German text corpora, the used datasets, evaluations, and results can serve for benchmarking purposes.