<p>There's some work in progress related to adding xpath support to jsoup https://github.com/jhy/jsoup/pull/80.</p> <ul> <li>Is it working?</li> <li>How can I use it?</li> </ul>

<p><strong>JSoup</strong> doesn't support <em>XPath</em> yet, but you may try <strong>XSoup</strong> - <em>"Jsoup with XPath"</em>.</p> <p>Here's an example quoted from the projects Github site (link):</p> <pre class="prettyprint"><code>@Test public void testSelect() { String html = "<html><div><a href='https://github.com'>github.com</a></div>" + "<table><tr><td>a</td><td>b</td></tr></table></html>"; Document document = Jsoup.parse(html); String result = Xsoup.compile("//a/@href").evaluate(document).get(); Assert.assertEquals("https://github.com", result); List<String> list = Xsoup.compile("//tr/td/text()").evaluate(document).list(); Assert.assertEquals("a", list.get(0)); Assert.assertEquals("b", list.get(1)); } </code></pre> <p>There you'll also find a list of features and expressions of XPath that are supported by XSoup.</p>

Does jsoup support xpath?

2 Answers

JSoup doesn't support XPath yet, but you may try XSoup - "Jsoup with XPath".

Here's an example quoted from the projects Github site (link):

@Test public void testSelect() {      String html = "<html><div><a href='https://github.com'>github.com</a></div>" +             "<table><tr><td>a</td><td>b</td></tr></table></html>";      Document document = Jsoup.parse(html);      String result = Xsoup.compile("//a/@href").evaluate(document).get();     Assert.assertEquals("https://github.com", result);      List<String> list = Xsoup.compile("//tr/td/text()").evaluate(document).list();     Assert.assertEquals("a", list.get(0));     Assert.assertEquals("b", list.get(1)); }

There you'll also find a list of features and expressions of XPath that are supported by XSoup.

answered Oct 03 '22 00:10

ollo

Not yet,but the project JsoupXpath has make it.For example,

String html = "<html><body><script>console.log('aaaaa')</script><div class='test'>some body</div><div class='xiao'>Two</div></body></html>"; JXDocument underTest = JXDocument.create(html); String xpath = "//div[contains(@class,'xiao')]/text()"; JXNode node = underTest.selNOne(xpath); Assert.assertEquals("Two",node.asString());

By the way,it supports the complete W3C XPATH 1.0 standard syntax.Such as

//ul[@class='subject-list']/li[./div/div/span[@class='pl']/num()>(1000+90*(2*50))][last()][1]/div/h2/allText() //ul[@class='subject-list']/li[not(contains(self::li/div/div/span[@class='pl']//text(),'14582'))]/div/h2//text()

answered Oct 02 '22 23:10

xiaohuo

Related questions
                            
                                Xpath expression to find values that start with
                            
                                How to get HTML5 data attribute using xpath?
                            
                                Xpath for button having text as 'New'
                            
                                Xpath: select node based in a condition (with local-name())
                            
                                XPath.evaluate performance slows down (absurdly) over multiple calls
                            
                                How do I retrieve element text inside CDATA markup via XPath?
                            
                                Trim function in XPath 1.0?
                            
                                How can I select an element with multiple classes with Xpath?
                            
                                How to use document.evaluate() and XPath to get a list of elements?
                            
                                How to set "value" to input web element using selenium?
                            
                                Python Selenium - get href value
                            
                                Performant parsing of HTML pages with Node.js and XPath
                            
                                How can I use XPath to find the minimum value of an attribute in a set of elements?
                            
                                DOM Level 3 XPath in Internet Explorer
                            
                                XPath search by "id" attribute , giving NPE - Java
                            
                                How to get the preceding element?
                            
                                How to select element using XPATH syntax on Selenium for Python?
                            
                                Get the inner HTML of a element in lxml
                            
                                Need to remove <?xml version="1.0" encoding="utf-16"?> from the xml
                            
                                Retrieve an xpath text contains using text()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Does jsoup support xpath?

Tags:

xpath

jsoup

gguardin

People also ask

2 Answers

ollo

xiaohuo

Recent Activity

Donate For Us