Stanford CoreNLP OpenIE annotator

Tags:

stanford-nlp

I have a question regarding Stanford CoreNLP OpenIE annotator.

I am using Stanford CoreNLP version stanford-corenlp-full-2015-12-09 in order to extract relations using OpenIE. I don't know much Java that's why I am using the pycorenlp wrapper for Python 3.4.

I want to extract relation between all words of a sentence, below is the code I used. I am also interested in showing the confidence of each triplet:

Click to copy

import nltk
from pycorenlp import *
import collections
nlp=StanfordCoreNLP("http://localhost:9000/")
s="Twenty percent electric motors are pulled from an assembly line"
output = nlp.annotate(s, properties={"annotators":"tokenize,ssplit,pos,depparse,natlog,openie",
                                 "outputFormat": "json","triple.strict":"true"})
result = [output["sentences"][0]["openie"] for item in output]
print(result)
for i in result:
for rel in i:
    relationSent=rel['relation'],rel['subject'],rel['object']
    print(relationSent)

This is the result i got:

Click to copy

[[{'relationSpan': [4, 6], 'subject': 'Twenty percent electric motors', 'objectSpan': [8, 10], 'relation': 'are pulled from', 'object': 'assembly line', 'subjectSpan': [0, 4]}, {'relationSpan': [4, 6], 'subject': 'percent electric motors', 'objectSpan': [8, 10], 'relation': 'are pulled from', 'object': 'assembly line', 'subjectSpan': [1, 4]}, {'relationSpan': [4, 5], 'subject': 'Twenty percent electric motors', 'objectSpan': [5, 6], 'relation': 'are', 'object': 'pulled', 'subjectSpan': [0, 4]}, {'relationSpan': [4, 5], 'subject': 'percent electric motors', 'objectSpan': [5, 6], 'relation': 'are', 'object': 'pulled', 'subjectSpan': [1, 4]}]]

And the triplets are:

Click to copy

('are pulled from', 'Twenty percent electric motors', 'assembly line')
('are pulled from', 'percent electric motors', 'assembly line')
('are', 'Twenty percent electric motors', 'pulled')
('are', 'percent electric motors', 'pulled')

First problem is that the confidence is not showing in the result. Second problem is that I only want to retrieve the triplet that that includes all words of the sentence i.e this triplet:

Click to copy

('are pulled from', 'Twenty percent electric motors', 'assembly line')

What I’m getting is more than one combination of triplets. I tried to use the option "triple.strict":"true" because it extracts "triples only if they consume the entire fragment" but it is NOT working.

Can anyone advise me on this?

745

asked May 22 '16 13:05

Shany

2 Answers

You should try this setting:

Click to copy

"openie.triple.strict":"true"

Looking through the code it appears at this time the confidence is not stored with the returned json, so you cannot get that from the CoreNLP server.

Since you bring this up I will push a change that will add those to the output json and let you know when that is live on the GitHub.

130

answered Oct 19 '22 20:10

StanfordNLPHelp

Thanks a lot, it is working now i added both: "openie.triple.strict":"true" and "openie.max_entailments_per_clause":"1" the code now is:

Click to copy

output = nlp.annotate(chunkz, properties={"annotators":"tokenize,ssplit,pos,depparse,natlog,openie",
                                "outputFormat": "json",
                                 "openie.triple.strict":"true",
                                 "openie.max_entailments_per_clause":"1"})

answered Oct 19 '22 22:10

Shany

Related questions
                            
                                Majority Element Python
                            
                                Get non-duplicate rows from numpy array
                            
                                How to properly numref table in Sphinx?
                            
                                Avoiding infinite recursion with os.walk
                            
                                How to calculate the inverse of the log normal cumulative distribution function in python?
                            
                                which python neo4j drivers are stable/production ready?
                            
                                Can i press two keys simultaneously for a single event using Pygame?
                            
                                How can I use threading in Python to parallelize AWS S3 API calls?
                            
                                Define a column type as 'list' in Pandas
                            
                                flask sqlalchemy multiple foreign keys in relationship
                            
                                Flask-SQLAlchemy - TypeError: __init__() takes only 1 position
                            
                                sklearn.tree.export_graphviz alternatives
                            
                                'exit' is not a keyword in Python, but no error occurs while using it
                            
                                Removing intersection between data frame based on multiple columns
                            
                                What is a right way for REST API response?
                            
                                Python one liner to substitute a list indices
                            
                                Pandas: Convert lists within a single column to multiple columns
                            
                                How i can disable alembic logging at runtime?
                            
                                High-dimensional data structure in Python
                            
                                How to sort a list of strings with a different order?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Stanford CoreNLP OpenIE annotator

Tags:

python

stanford-nlp

Shany

People also ask

2 Answers

StanfordNLPHelp

Shany

Recent Activity

Donate For Us