How to get Index of an Entity in a Sentence in Spacy?

Tags:

I want to know if there is an elegant way to get the index of an Entity with respect to a Sentence. I know I can get the index of an Entity in a string using ent.start_char and ent.end_char, but that value is with respect to the entire string.

import spacy

nlp = spacy.load("en_core_web_sm")
doc = nlp(u"Apple is looking at buying U.K. startup for $1 billion. Apple just launched a new Credit Card.")

for ent in doc.ents:
    print(ent.text, ent.start_char, ent.end_char, ent.label_)

I want the Entity Apple in both the sentences to point to start and end indexes 0 and 5 respectively. How can I do that?

583

asked Aug 22 '19 10:08

iCHAIT

1 Answers

You need to subtract the sentence start position from the entity start positions:

for ent in doc.ents:
    print(ent.text, ent.start_char-ent.sent.start_char, ent.end_char-ent.sent.start_char, ent.label_)
#                                 ^^^^^^^^^^^^^^^^^^^^              ^^^^^^^^^^^^^^^^^^^^

Output:

Apple 0 5 ORG
U.K. 27 31 GPE
$1 billion 44 54 MONEY
Apple 0 5 ORG
Credit Card 26 37 ORG

187

answered Nov 14 '22 23:11

Wiktor Stribiżew

Related questions
                            
                                Remove string element in a list of strings if the first characters match with another string element in the list
                            
                                DiGraph parallel ordering
                            
                                Drop rows in pandas if records in two columns do not appear together at least twice in the dataset
                            
                                Django Rest Framework Custom JWT authentication
                            
                                How to fetch a product from woocommerce api based on the sku?
                            
                                Pulling Zillow Rent Data from Zillow API
                            
                                How to convert a continuous variable to a categorical variable?
                            
                                Nexus pypi repository "Could not find a version that satisfies the requirement"
                            
                                Find an element where data-tb-test-id attribute is present instead of id using Selenium and Python
                            
                                How to properly use dask's upload_file() to pass local code to workers
                            
                                Matplotlib plot from Python script not showing up in output when run in Jupyter Notebook
                            
                                pandas int or float column to percentage distribution
                            
                                How to use pathlib.Path.expanduser() and amend and use a PosixPath value?
                            
                                How SelectKBest (chi2) calculates score?
                            
                                Pandas str.split without stripping split pattern
                            
                                tf.keras.layers.pop() doesn't work, but tf.keras._layers.pop() does
                            
                                Using Typing and Mypy with Descriptors
                            
                                Python comparison operator precedence
                            
                                Filling torch tensor with zeros after certain index
                            
                                Storing Spotify token in flask session using spotipy?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get Index of an Entity in a Sentence in Spacy?

Tags:

python

nlp

spacy

iCHAIT

People also ask

1 Answers

Wiktor Stribiżew

Recent Activity

Donate For Us