How I can traverse the HTML tree using Jsoup?

Tags:

I think this question has been asked, but I not found anything.

From the Document element in Jsoup, how I can traverse for all elements in the HTML content?

I was reading the documentation and I was thinking about using the childNodes() method, but it only takes the nodes from one leval below (what I understand). I think I can use some recursion with this method, but I want to know if there is a more appropriate/native way to do this.

976

asked Apr 11 '12 18:04

Renato Dinhani

2 Answers

From Document (and any Node subclass), you can use the traverse(NodeVisitor) method.

For example:

document.traverse(new NodeVisitor() {
    public void head(Node node, int depth) {
        System.out.println("Entering tag: " + node.nodeName());
    }
    public void tail(Node node, int depth) {
        System.out.println("Exiting tag: " + node.nodeName());
    }
});

186

answered Sep 21 '22 13:09

Vivien Barousse

1) You can select all elements of the document using * selector.

Elements elements = document.body().select("*");

2) For retrieve text of each individually using Element.ownText() method.

for (Element element : elements) {
  System.out.println(element.ownText());
}

3) For modify the text of each individually using Element.html(String strHtml). (clears any existing inner HTML in an element, and replaces it with parsed HTML.)

element.html(strHtml);

Hope this will help you. Thank you!

answered Sep 21 '22 13:09

Gaurav Darji

Related questions
                            
                                How to make font bold in java dialogue box?
                            
                                Why are compiled Java class files smaller than C compiled files?
                            
                                How to invoke a servlet without mapping in web.xml?
                            
                                Conversion of .class to jar and .class to exe
                            
                                How can I load java class from database?
                            
                                Inflated layout's buttons onClick listener not working
                            
                                Writing a method to accept interface type instead of class type
                            
                                what's wrong with e.printStackTrace() for an unknown exception
                            
                                Loading JDBC driver
                            
                                Java - extending a class and reusing the methods?
                            
                                JTable - Boolean Cell Type - Background
                            
                                Why are Iterable<E> and Iterator<E> in different packages?
                            
                                java.lang.OutOfMemoryError: PermGen space
                            
                                How to create a println/print method for a custom class
                            
                                How do we achieve "substring-match" under O(n) time?
                            
                                Generating very large random numbers java
                            
                                Getting Null Pointer Exception when using isEmpty() method
                            
                                Issue with Java Regex \b
                            
                                Configuring EHCache for Spring3.1.1 and Hibernate
                            
                                Spring security not hitting default-target-url after successful authtication

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How I can traverse the HTML tree using Jsoup?

Tags:

java

traversal

jsoup

Renato Dinhani

People also ask

2 Answers

Vivien Barousse

Gaurav Darji

Recent Activity

Donate For Us