Is there any library, preferably in python but at least open source, that can summarize and or simplify natural-language text?
The two broad categories of approaches to text summarization are extraction and abstraction.
Maybe you can try sumy. It's a quite small library that I wrote in Python. There are implemented Luhn's and Edmundson's approaches, LSA method, SumBasic, KL-Sum, LexRank and TextRank algorithms. It's Apache2 licensed and supports Czech, Slovak, English, French, Japanese, Chinese, Portuguese, Spanish and German languages.
Feel free to open an issue or send a pull request if there is something you are missing.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With