Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

CORPUS resource

Tags:

nlp

corpus

I am designing an Automatic text summarizer. One of the major modules in this project requires TRAINING CORPUS. Can someone please help me out by providing TRAINING CORPUS or referring some link to download it. Thanks in anticipation

like image 933
Shishir Jaiswal Avatar asked Mar 01 '23 04:03

Shishir Jaiswal


2 Answers

See How to Write a Spelling Corrector by Norvig. He mentions Project Gutenberg, Wiktionary, British National Corpus, Birkbeck spelling error corpus. There's also Brown Corpus.

like image 57
Eugene Yokota Avatar answered Mar 06 '23 19:03

Eugene Yokota


Here are some Text summarization resources, including corpora. The Stanford list of NLP/Corpus linguistics resources may also help.

like image 33
Yuval F Avatar answered Mar 06 '23 19:03

Yuval F