Document-term matrix for fiction genres from Underwood, Distant Horizons, chap. 2
genre_features.Rd
Data (extracted word-frequency and other features for fiction volumes) used in modeling fiction genre by Underwood in Distant Horizons, chap. 2. The features are only those used in the regularized logistic regression models of science fiction, detective fiction, and Gothic supplied by Underwood in his reproduction repository, and do not include all features found in the accompanying source data files.
Source
https://www.ideals.illinois.edu/items/105528 for the data; files in https://github.com/tedunderwood/horizon/blob/master/chapter2/modeloutput for the choice of features.