Download
Abstract
We introduce a new dataset for multi-class emotion analysis from long-form narratives in English. The Dataset for Emotions of Narrative Sequences (DENS) was collected from both classic literature available on Project Gutenberg and modern online narratives avail- able on Wattpad, annotated using Amazon Mechanical Turk. A number of statistics and baseline benchmarks are provided for the dataset. Of the tested techniques, we find that the fine-tuning of a pre-trained BERT model achieves the best results, with an average micro-F1 score of 60.4%. Our results show that the dataset provides a novel opportunity in emotion analysis that requires moving beyond existing sentence-level techniques.
Citation
Chen Liu, Osama Muhammad, & Anderson de Andrade. (2019). “DENS: A dataset for multi-class emotion analysis.” EMNLP.
@inproceedings{DBLP:conf/emnlp/LiuOA19,
author = {Chen Liu and
Muhammad Osama and
Anderson de Andrade},
editor = {Kentaro Inui and
Jing Jiang and
Vincent Ng and
Xiaojun Wan},
title = {{DENS:} {A} Dataset for Multi-class Emotion Analysis},
booktitle = {Proceedings of the 2019 Conference on Empirical Methods in Natural
Language Processing and the 9th International Joint Conference on
Natural Language Processing, {EMNLP-IJCNLP} 2019, Hong Kong, China,
November 3-7, 2019},
pages = {6292--6297},
publisher = {Association for Computational Linguistics},
year = {2019},
url = {https://doi.org/10.18653/v1/D19-1656},
doi = {10.18653/V1/D19-1656},
timestamp = {Tue, 22 Aug 2023 20:03:10 +0200},
biburl = {https://dblp.org/rec/conf/emnlp/LiuOA19.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}