Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Alternative to HtmlUnit

Tags:

I have been researching about the headless browsers available till to date and found HtmlUnit being used pretty extensively. Do we have any alternative to HtmlUnit with possible advantage compared to HtmlUnit?

Thanks Nayn

like image 852
Nayn Avatar asked Nov 23 '10 07:11

Nayn


2 Answers

As far as I know, HtmlUnit` is the most powerful headless browser.

What are you issues with it?

like image 161
Ahmed Ashour Avatar answered Sep 21 '22 21:09

Ahmed Ashour


There are many other libraries that you can use for this.

  • If you need to scrape xml base data use JTidy.
  • If you need to scrape specific data from HTML you can use Jsoup.

Well I use jsoup - it's pretty much faster than any other API.

like image 33
Sajid Hussain Avatar answered Sep 18 '22 21:09

Sajid Hussain