Stanford Dependency Parser Setup and NLTK

Tags:

So I got the "standard" Stanford Parser to work thanks to danger89's answers to this previous post, Stanford Parser and NLTK.

However, I am now trying to get the dependency parser to work and it seems the method highlighted in the previous link no longer works. Here is my code:

import nltk
import os
java_path = "C:\\Program Files\\Java\\jre1.8.0_51\\bin\\java.exe" 
os.environ['JAVAHOME'] = java_path


from nltk.parse import stanford
os.environ['STANFORD_PARSER'] = 'path/jar'
os.environ['STANFORD_MODELS'] = 'path/jar'
parser = stanford.StanfordDependencyParser(model_path="path/jar/englishPCFG.ser.gz")

sentences = parser.raw_parse_sents(nltk.sent_tokenize("The iPod is expensive but pretty."))

I get the following error: 'module' object has no attribute 'StanfordDependencyParser'

The only thing I changed was "StanfordDependencyParser" from "StanfordParser". Any ideas how I can get this to work?

I also tried the Stanford Neural Dependency parser by importing it as shown in the documentation here: http://www.nltk.org/_modules/nltk/parse/stanford.html

This one didn't work either.

Pretty new to NLTK. Thanks in advance for any helpful input.

359

asked Dec 02 '15 21:12

Max

1 Answers

The StanfordDependencyParser API is a new class object created since NLTK version 3.1.

Ensure that you have the latest NLTK available either through pip

pip install -U nltk

or through your linux package manager, e.g.:

sudo apt-get python-nltk

or in windows, download https://pypi.python.org/pypi/nltk and install and it should overwrite your previous NLTK version.

Then you can use the API as shown in the documentation:

from nltk.parse.stanford import StanfordDependencyParser
dep_parser=StanfordDependencyParser(model_path="edu/stanford/nlp/models/lexparser/englishPCFG.ser.gz")
print [parse.tree() for parse in dep_parser.raw_parse("The quick brown fox jumps over the lazy dog.")]

[out]:

[Tree('jumps', [Tree('fox', ['The', 'quick', 'brown']), Tree('dog', ['over', 'the', 'lazy'])])]

(Note: Make sure you get your path to jar and os.environ correct, in Windows, it's something\\something\\some\\path, in unix it's something/something/some/path)

See also https://github.com/nltk/nltk/wiki/Installing-Third-Party-Software#stanford-tagger-ner-tokenizer-and-parser and when you need a TL;DR solution, see https://github.com/alvations/nltk_cli

145

answered Oct 18 '22 02:10

alvas

Related questions
                            
                                Interchanging between different scipy ode solvers
                            
                                Link error with cblas when installing scikit-learn
                            
                                using python itertools to generate custom iteration
                            
                                Flask-admin, editing relationship giving me object representation of Foreign Key object
                            
                                determine the coordinates where two pandas time series cross, and how many times the time series cross
                            
                                Transform input data for ALS in pyspark
                            
                                Python setuptools not including C++ standard library headers
                            
                                How to set custom timestep values for a series of legacy VTK files in ParaView?
                            
                                Splitting a Graphlab SFrame Date column into three columns (Year Month Day)
                            
                                Access ansible playbook results after run of playbook
                            
                                Python dictionary in Jinja
                            
                                What is the maximum size of an XML file when using python's lxml etree
                            
                                Can not use unicode string in django template
                            
                                multiple/split class associations in sqlalchemy
                            
                                Sympy - altering the range of the y axis for a plot
                            
                                How does the number of partitions affect `wholeTextFiles` and `textFiles`?
                            
                                Dask DataFrame: Resample over groupby object with multiple rows
                            
                                Downloading HTTPS pages with urllib, error:14077438:SSL routines:SSL23_GET_SERVER_HELLO:tlsv1 alert internal error
                            
                                Efficient way of merging two numpy masked arrays
                            
                                Multiple Minimum Values in Pandas Dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Stanford Dependency Parser Setup and NLTK

Tags:

python

nlp

nltk

stanford-nlp

Max

People also ask

1 Answers

alvas

Recent Activity

Donate For Us