Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Perform click on Web page element before parsing in Java

Tags:

java

html

jsoup

I'm trying to parse HTML page with DOM parser and jsoup library. The problem that I'm facing is this:

On Web site there are two buttons which show two different tables. I need to parse the table which is shown when the second button is clicked. There are different attribute values set after clicking the second button.

When I do Jsoup.connect("example.com")

I get response like first button is selected and I don't need that data.

Is there a way to perform click on second button, and then start parsing and retrieving data from Web site?

like image 539
Veljko Avatar asked Jan 14 '13 13:01

Veljko


2 Answers

Jsoup is just a parser, i.e. it can't handle events such as clicking on buttons. Have a look at browser automation tools (e.g. Selenium) to perform this kind of job.

like image 118
sp00m Avatar answered Nov 09 '22 04:11

sp00m


JSoup can't control the web page, only parse the content. For manipulation and interaction, there are some tools. I recommend Geb, which uses a Groovy DSL with a JQuery like syntax, making it very fluent. It's also pretty easy to parse xml/html with it.

like image 38
Will Avatar answered Nov 09 '22 03:11

Will