flair.datasets.base.StringDataset#
- class flair.datasets.base.StringDataset(texts, use_tokenizer=<flair.tokenization.SpaceTokenizer object>)View on GitHub#
Bases:
FlairDatasetA Dataset taking string as input and returning Sentence during iteration.
- __init__(texts, use_tokenizer=<flair.tokenization.SpaceTokenizer object>)View on GitHub#
Instantiate StringDataset.
- Parameters:
texts (
Union[str,list[str]]) – a string or List of string that make up StringDatasetuse_tokenizer (
Union[bool,Tokenizer]) – Custom tokenizer to use. If instead of providing a function, this parameter is just set to True,flair.tokenization.SegTokTokenizerwill be used.
Methods
__init__(texts[, use_tokenizer])Instantiate StringDataset.
- abstract is_in_memory()View on GitHub#
- Return type:
bool