Understanding Spacy's Scorer Output

Tags:

I'm evaluating a custom NER model that I built using Spacy. I'm evaluating the training sets using Spacy's Scorer class.

    def Eval(examples):
    # test the saved model
    print("Loading from", './model6/')
    ner_model = spacy.load('./model6/')

    scorer = Scorer()
    try:
        for input_, annot in examples:
            doc_gold_text = ner_model.make_doc(input_)
            gold = GoldParse(doc_gold_text, entities=annot['entities'])
            pred_value = ner_model(input_)
            scorer.score(pred_value, gold)
    except Exception as e: print(e)

    print(scorer.scores)

It works fine but I don't understand the output. Here's what I get for each training set.

{'uas': 0.0, 'las': 0.0, 'ents_p': 90.14084507042254, 'ents_r': 92.7536231884058, 'ents_f': 91.42857142857143, 'tags_acc': 0.0, 'token_acc': 100.0}

{'uas': 0.0, 'las': 0.0, 'ents_p': 91.12227805695142, 'ents_r': 93.47079037800687, 'ents_f': 92.28159457167091, 'tags_acc': 0.0, 'token_acc': 100.0}

{'uas': 0.0, 'las': 0.0, 'ents_p': 92.45614035087719, 'ents_r': 92.9453262786596, 'ents_f': 92.70008795074759, 'tags_acc': 0.0, 'token_acc': 100.0}

{'uas': 0.0, 'las': 0.0, 'ents_p': 94.5993031358885, 'ents_r': 94.93006993006993, 'ents_f': 94.76439790575917, 'tags_acc': 0.0, 'token_acc': 100.0}

{'uas': 0.0, 'las': 0.0, 'ents_p': 92.07920792079209, 'ents_r': 93.15525876460768, 'ents_f': 92.61410788381743, 'tags_acc': 0.0, 'token_acc': 100.0}

Does anyone know what the keys are? I've looked over Spacy's documentation and could not find anything.

Thanks!

871

asked Jun 01 '18 13:06

Evan Lalo

1 Answers

UAS (Unlabelled Attachment Score) and LAS (Labelled Attachment Score) are standard metrics to evaluate dependency parsing. UAS is the proportion of tokens whose head has been correctly assigned, LAS is the proportion of tokens whose head has been correctly assigned with the right dependency label (subject, object, etc).
ents_p, ents_r, ents_f are the precision, recall and fscore for the NER task.
tags_acc is the POS tagging accuracy.
token_acc seems to be the precision for token segmentation.

124

answered Sep 27 '22 20:09

mcoav

Related questions
                            
                                Importing text file : No Columns to parse from file
                            
                                ImportError: No module named 'botocore.parameters'
                            
                                hdf5 file to pandas dataframe
                            
                                what is the quickest way to iterate through a numpy array
                            
                                python 2.7 set and list remove time complexity
                            
                                pandas dataframe sort by date
                            
                                Is it possible to visualize keras embeddings in tensorboard?
                            
                                Sum along axis in numpy array
                            
                                Train multi-class image classifier in Keras
                            
                                Difference between subprocess.Popen preexec_fn and start_new_session in python
                            
                                Running scrapy with PyCharm - Debug works but Run does not work
                            
                                How can I re-upload package to pypi?
                            
                                Efficient pairwise DTW calculation using numpy or cython
                            
                                What is the effect of using pip to install python packages on anaconda?
                            
                                python 3.6 and ValueError: loop argument must agree with Future
                            
                                Python `socket.getaddrinfo` taking 5 seconds about 0.1% of requests
                            
                                How to check whether all values in a column satisfy a condition in Data Frame?
                            
                                RuntimeError: module compiled against API version 0xc but this version of numpy is 0xb
                            
                                How to prevent alphabetical sorting for python bars with matplotlib?
                            
                                pipenv: only works in a installed folder?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Understanding Spacy's Scorer Output

Tags:

python

named-entity-recognition

spacy

Evan Lalo

People also ask

1 Answers

mcoav

Recent Activity

Donate For Us