Computational methods for the analysis of fiction genres

Andreas van Cranenburgh, Laura Allen, Serge Sharoff, K.H. van Dalen-Oskam

Research output: Chapter in book/volumeChapterScientificpeer-review

Abstract

This chapter presents a multimethod, multidisciplinary analysis of genre in a large dataset of 9,800 English novels, in order to deepen our understanding of aspects of fiction genres and subgenres. We specifically focus on applying well-established, interpretable methods, in order to benefit scholars from a variety of disciplines. Objects of our analysis are written texts and the linguistic features of the texts. We approach the analysis from two directions: data-driven, with topic modeling of content words, and theory-driven, with features Douglas Biber selected for his research on register, for example, in his 1988 book Variation across Speech and Writing, as well as simple readability metrics. We illustrate these methods by applying them to a corpus of fiction (novels). The texts in our corpora are English, but our methods aim to be also applicable to corpora in other languages. The research questions we try to answer with the proposed methods are whether different kinds of novels (“subgenres”) can be distinguished from each other in their use of linguistic features, and what the results of the computational methods can reveal to researchers to assist them in a renewed qualitative analysis of the texts or in phrasing new hypotheses for further research. Code and data are available at https://github.com/andreasvc/fictiongenres/

Original languageEnglish
Title of host publicationMultidisciplinary Views on Discourse Genre
Subtitle of host publicationA Research Agenda
EditorsNinke Stukker, John A. Bateman, Danielle McNamara, Wilbert Spooren
PublisherRoutledge
Chapter6
Pages135-167
Number of pages33
ISBN (Electronic)9781003335603
ISBN (Print)9781032371610
Publication statusPublished - 01 Oct 2024

Publication series

NameRoutledge research in language and communication

Keywords

  • digital humanities
  • genre
  • fiction
  • computational literary studies

Fingerprint

Dive into the research topics of 'Computational methods for the analysis of fiction genres'. Together they form a unique fingerprint.

Cite this