I am trying to download all the data package for the nltk
. But it is always failing while trying to download framenet_v15. It simply hangs there.
Tried multiple times from the same machine. Each time almost left for 30 mins and once more than one hour. Tried to replace the source server to google svn, but downloader gave an error.
Unfortunately, I don't have any other information. Is there way to figure what the problem is? Or is there any alternate source from where I can download the nltk
data?
Thanks.
Edit:
finally downloaded with wget -c
, it took lot of retries before finally completing the download.
Some observations
Download individual packages from https://www.nltk.org/nltk_data/ (see the “download” links). Unzip them to the appropriate subfolder. For example, the Brown Corpus, found at: https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/corpora/brown.zip is to be unzipped to nltk_data/corpora/brown .
Go to GitHub repo, download the package we need and unzip this file. For example, in this punkt case, we are going to download the zip file in this link. After we download it, we can then unzip it to get a folder named punkt.
EDIT: Here is a direct link that will allow you to request the data from the Frame Net project: https://framenet.icsi.berkeley.edu/fndrupal/framenet_request_data
When I downloaded the NLTK data I had to run the downloader several times since it kept hanging.
Alternatively here is a list of the individual files: http://nltk.org/nltk_data/
I just downloaded framenet_v15 from this link: http://nltk.github.com/nltk_data/packages/corpora/framenet_v15.zip
Also, see this question for more discussions on this: Installing natural language toolkit data
I tried downloading by
import nltk nltk.download('all')
And it worked for me
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With