Skip to content

Latest commit

 

History

History
59 lines (36 loc) · 4.93 KB

README.md

File metadata and controls

59 lines (36 loc) · 4.93 KB





EmotionArcs Static Badge

This is the repository for the data for our paper "EmotionArcs: Emotion Arcs for 9,000 Literary Texts", Öhman et al. (2024), at the 8th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2024), St. Julians, Malta.

It is a dataset comprising emotional arcs and intensities from over 9,000 English novels (1880-2000), assembled to understand the dynamics of emotions represented in text and how these emotions may influence a novel’s reception and perceived quality.

  • NRC emotions (anger, disgust, fear, anticipation, surprise, trust, sadness, joy)
  • NRC emotion dictionary expanded for the literary domain using Word2Vec

⚡ Data included

  • Data for 9,000 titles
  • Author, title & year
  • Emotion intensities
  • Emotion arcs

Available formats: .xlsx, .json


🔬 Example

BOOK_ID TITLE AUTH_FIRST AUTH_LAST PUBL_DATE ARC_ang ARC_fea ... INTENS_ang INTENS_joy ...
25732 Infinite Jest David Foster Wallace 1996 0.758,0.901... 1.451,1.601... ... 70.44 102.37 ...
20636 Dune Frank Herbert 1965 2.918,5.031... 4.164,4.231... ... 89.95 92.39 ...
22741 Beloved Toni Morrison 1987 7.603,5.461... 7.806,6.235... ... 63.46 136.63 ...
21974 The Gunslinger Stephen King 1982 2.627,0.581... 3.308,1.764... ... 84.02 102.07 ...
86 The Portrait of a Lady Henry James 1881 0.792,4.212... 2.381,4.672... ... 40.59 169.74 ...

📖 Documentation

Documentation
👩‍💻 Notebook A notebook showing an example usage of the arcs.
📄 Paper The EmotionArcs paper.
📚 Citation Bibtex citation.
🏆 The Chicago Corpus Data The Chicago Corpus (a linked dataset with additional data about the books).