Tagging other things#

This tutorial gives you a tour of other crazy models shipped with Flair. These include:

tagging semantic frames
chunking text
relation extraction
others

Let’s get started!

Semantic Frame Detection#

For English, we provide a pre-trained model that detects semantic frames in text, trained using Propbank 3.0 frames. This provides a sort of word sense disambiguation for frame evoking words, and we are curious what researchers might do with this.

Here’s an example:

from flair.nn import Classifier
from flair.data import Sentence

# load model
tagger = Classifier.load('frame')

# make English sentence
sentence = Sentence('George returned to Berlin to return his hat.')

# predict NER tags
tagger.predict(sentence)

# go through tokens and print predicted frame (if one is predicted)
for token in sentence:
    print(token)

This should print:

Token[0]: "George"
Token[1]: "returned" → return.01 (0.9951)
Token[2]: "to"
Token[3]: "Berlin"
Token[4]: "to"
Token[5]: "return" → return.02 (0.6361)
Token[6]: "his"
Token[7]: "hat"
Token[8]: "."

As we can see, the frame detector makes a distinction in the sentence between two different meanings of the word ‘return’. ‘return.01’ means returning to a location, while ‘return.02’ means giving something back.

Syntactic Chunking#

For English, we provide a model for chunking verb and noun phrases, trained using CoNLL 2000.

from flair.nn import Classifier
from flair.data import Sentence

# load model
tagger = Classifier.load('chunk')

# make English sentence
sentence = Sentence('The quick brown fox jumps over the lazy dog.')

# predict NER tags
tagger.predict(sentence)

# print the chunks
for chunk in sentence.get_labels():
  print(chunk)

This should print:

Span[0:4]: "The quick brown fox" → NP (0.9914)
Span[4:5]: "jumps" → VP (1.0)
Span[5:6]: "over" → PP (0.9967)
Span[6:9]: "the lazy dog" → NP (0.9991)

This tells us for instance that “the quick brown fox” and “the lazy dog” form syntactic units in this sentence.

Tagging Relations#

Relations hold between two entities. For instance, a text like “George was born in Washington” names two entities and also expresses that there is a born_in relationship between both.

We added an experimental relation extraction model trained over a modified version of TACRED. You must use this model together with an entity tagger. Here is an example:

from flair.data import Sentence
from flair.nn import Classifier

# 1. make example sentence
sentence = Sentence("George was born in Washington")

# 2. load entity tagger and predict entities
tagger = Classifier.load('ner-fast')
tagger.predict(sentence)

# check which named entities have been found in the sentence
entities = sentence.get_labels('ner')
for entity in entities:
    print(entity)

# 3. load relation extractor
extractor = Classifier.load('relations')

# predict relations
extractor.predict(sentence)

# check which relations have been found
relations = sentence.get_labels('relation')
for relation in relations:
    print(relation)
print("")

# Use the `get_labels()` method with parameter 'relation' to iterate over all relation predictions. 
for label in sentence.get_labels('relation'):
    print(label)

This should print:

Span[0:1]: "George" → PER (0.9971)
Span[4:5]: "Washington" → LOC (0.9847)

Relation[0:1][4:5]: "George -> Washington" → born_in (1.0)

Indicating that a born_in relationship holds between “George” and “Washington”!

List of Other Models#

We end this section with a list of all other models we currently ship with Flair:

ID	Task	Language	Training Dataset	Accuracy	Contributor / Notes
‘chunk’	Chunking	English	Conll-2000	96.47 (F1)
‘chunk-fast’	Chunking	English	Conll-2000	96.22 (F1)	(fast model)
‘frame’	Frame Detection	English	Propbank 3.0	97.54 (F1)
‘frame-fast’	Frame Detection	English	Propbank 3.0	97.31 (F1)	(fast model)
‘negation-speculation’	Negation / speculation	English	Bioscope	80.2 (F1)
‘de-historic-indirect’	historical indirect speech	German	@redewiedergabe project	87.94 (F1)	redewiedergabe
‘de-historic-direct’	historical direct speech	German	@redewiedergabe project	87.94 (F1)	redewiedergabe
‘de-historic-reported’	historical reported speech	German	@redewiedergabe project	87.94 (F1)	redewiedergabe
‘de-historic-free-indirect’	historical free-indirect speech	German	@redewiedergabe project	87.94 (F1)	redewiedergabe

Next#

Congrats, you learned about some other models we have in Flair!

So far, we only focused on predicting for single sentences. Next, let’s discuss how to create predictions for a whole corpus of documents.

Table of Contents

Tagging other things#

Semantic Frame Detection#

Syntactic Chunking#

Tagging Relations#

List of Other Models#

Next#