Automating Violence Detection and Categorization from Ancient Texts

Link:
Autor/in:
Beteiligte Personen:
  • Kazantseva, Anna
  • Szpakowicz, Stan
  • Degaetano-Ortlieb, Stefania
  • Bizzoni, Yuri
  • Pagel, Janis
Verlag/Körperschaft:
Association for Computational Linguistics
Erscheinungsjahr:
2025
Medientyp:
Text
Beschreibung:
  • Violence descriptions in literature offer valuable insights for a wide range of research in the humanities. For historians, depictions of violence are of special interest for analyzing the societal dynamics surrounding large wars and individual conflicts of influential people. Harvesting data for violence research manually is laborious and time-consuming. This study is the first one to evaluate the effectiveness of large language models (LLMs) in identifying violence in ancient texts and categorizing it across multiple dimensions. Our experiments identify LLMs as a valuable tool to scale up the accurate analysis of historical texts and show the effect of fine-tuning and data augmentation, yielding an F1-score of up to 0.93 for violence detection and 0.86 for fine-grained violence categorization.
Lizenz:
  • info:eu-repo/semantics/openAccess
Quellsystem:
Forschungsinformationssystem der UHH

Interne Metadaten
Quelldatensatz
oai:www.edit.fis.uni-hamburg.de:publications/7ee2a297-3464-4505-a57a-bd10bd3e6b8f