flair.datasets.document_classification#
The AG's News Topic Classification Corpus, classifying news into 4 coarse-grained topics.  | 
|
A very large corpus of Amazon reviews with positivity ratings.  | 
|
The Communicative Functions Classification Corpus.  | 
|
Classification corpus instantiated from CSV data files.  | 
|
Dataset for text classification from CSV column formatted data.  | 
|
A classification corpus from FastText-formatted text files.  | 
|
Dataset for classification instantiated from a single FastText-formatted file.  | 
|
GermEval 2018 corpus for identification of offensive language.  | 
|
Corpus of Linguistic Acceptability from GLUE benchmark.  | 
|
GoEmotions dataset containing 58k Reddit comments labeled with 27 emotion categories.  | 
|
Corpus of IMDB movie reviews labeled by sentiment (POSITIVE, NEGATIVE).  | 
|
20 newsgroups corpus, classifying news items into one of 20 categories.  | 
|
The customer reviews dataset of SentEval, classified into NEGATIVE or POSITIVE sentiment.  | 
|
The opinion-polarity dataset of SentEval, classified into NEGATIVE or POSITIVE polarity.  | 
|
The movie reviews dataset of SentEval, classified into NEGATIVE or POSITIVE sentiment.  | 
|
The Stanford sentiment treebank dataset of SentEval, classified into NEGATIVE or POSITIVE sentiment.  | 
|
The Stanford sentiment treebank dataset of SentEval, classified into 5 sentiment classes.  | 
|
The subjectivity dataset of SentEval, classified into SUBJECTIVE or OBJECTIVE sentiment.  | 
|
Twitter sentiment corpus.  | 
|
Stackoverflow corpus classifying questions into one of 20 labels.  | 
|
The TREC Question Classification Corpus, classifying questions into 50 fine-grained answer types.  | 
|
The TREC Question Classification Corpus, classifying questions into 6 coarse-grained answer types.  | 
|
WASSA-2017 anger emotion-intensity corpus.  | 
|
WASSA-2017 fear emotion-intensity corpus.  | 
|
WASSA-2017 joy emotion-intensity dataset corpus.  | 
|
WASSA-2017 sadness emotion-intensity corpus.  | 
|
The YAHOO Question Classification Corpus, classifying questions into 10 coarse-grained answer types.  | 
|