Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

A Java-based Readability-like parser

I would like to know what standalone options exist for parsing and converting normal webpages to the easy to read format, provided by services like Pocket, Readability, and Instapaper.

I am primarily looking for a Java parser library to include in my app, but in general, all kinds of standalone tools are appreciated.

Thanks.

like image 311
Preslav Rachev Avatar asked Dec 03 '25 06:12

Preslav Rachev


1 Answers

Snackatory now itself states that it's outdated and recommends using Crux.

But there's also Readability4J. It's a Kotlin port of Mozilla's Readability.js, which is used for Firefox's Reader View. So Readability4J's output matches exactly Firefox's Reader View.

Disclaimer: I am the author of Readability4J. But I just ported it, kudos for the great work go to Mozilla: https://github.com/mozilla/readability.

like image 127
dankito Avatar answered Dec 04 '25 18:12

dankito



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!