How to extract separate text nodes with Jsoup?

Tags:

I have an element like this :

<td> TextA <br/> TextB </td>

How can I extract TextA and TextB separately?

292

asked Aug 23 '11 16:08

M.M

1 Answers

Several ways. That really depends on the document itself and whether the given HTML markup is consistent or not. In this particular example you could get the td's child nodes by Element#childNodes() and then test every node individually if it's a TextNode or not.

E.g.

Element td = getItSomehow();

for (Node child : td.childNodes()) {
    if (child instanceof TextNode) {
        System.out.println(((TextNode) child).text());
    }
}

which results in

 TextA 
 TextB

I think it would be nice if Jsoup offered a Element#textNodes() or something to get the child text nodes like as Element#children() does to get the child elements (which would have returned the <br /> element in your example).

182

answered Sep 23 '22 05:09

BalusC

Related questions
                            
                                Android Studio 1.0.2 new project - Cannot resolve symbol 'ActionBarActivity'
                            
                                Set Collection for mutable objects in Java
                            
                                How does Server-Sent-Events work
                            
                                NoClassDefFoundError at Runtime with Gradle
                            
                                How to inject module declaration into JAR?
                            
                                How can I create a Spring 5 component index?
                            
                                Reading from a ZipInputStream into a ByteArrayOutputStream
                            
                                Parsing very large XML documents (and a bit more) in java
                            
                                Configuring ant to run unit tests. Where should libraries be? How should classpath be configured? avoiding ZipException
                            
                                Java library for free-text diff [closed]
                            
                                Do you have any recommended plugins for Netbeans? [closed]
                            
                                Who Uses Real Time Java? [closed]
                            
                                Are Axis2 generated stubs thread-safe?
                            
                                Accessing Spring beans from a Tiles view (JSP)
                            
                                Does having more methods in a class mean that object uses more memory at runtime
                            
                                Can I do an atomic MERGE in Oracle?
                            
                                Is there any way to add a MouseListener to a Graphic object?
                            
                                How to fix "Requested array size exceeds VM limit" error in Java?
                            
                                Why does Java limit the size of a method to 65535 byte?
                            
                                Validate X.509 certificate against CA in Java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to extract separate text nodes with Jsoup?

Tags:

java

html-parsing

jsoup

M.M

People also ask

1 Answers

BalusC

Recent Activity

Donate For Us