I have a XPath to select to a class I want: <code>//div[@class='myclass']</code>. But it returns me the whole div (with the <code><div class='myclass'></code> also, but I would like to return only the contents of this tag without the tag itself. How can I do it?

<pre class="prettyprint"><code>node() = innerXml text() = innerText </code></pre> both are arrays, so <code>text()[1]</code> is a first children text node...

With xpath, the thing you will get returned is the last thing in the path that is not a condition. What that means? Well, conditions are the stuff between <code>[]</code>'s (but you already knew that) and yours reads like pathElement[that has a 'class' attribute with value 'my class']. The pathElement comes directly before the <code>[</code>. All the stuff outside of <code>[]</code>'s then is the path, so in <code>//a/b/c[@blah='bleh']/d</code> a, b, c and d are all path elements, blah is an attribute and bleh a literal value. If this path matches it will return you a d, the last non-condition thing. Your particular path returns a (series of) div, being the last thing in your xpath's path. This return value thus includes the top-level node(s), div in your case, and underneath it (them) all its (their) children. Nodes can be elements or text (or comments, processing instructions, ...). Underneath a node there can be multiple text nodes, hence the array pOcHa talks about. <code>x/text()</code> returns all text that is a direct child of x, <code>x/node()</code> returns all child nodes, including text.

How to get node value / innerHTML with XPath?

Tags:

parsing

xml

html-parsing

xpath

I have a XPath to select to a class I want: //div[@class='myclass']. But it returns me the whole div (with the <div class='myclass'> also, but I would like to return only the contents of this tag without the tag itself. How can I do it?

985

asked Jun 05 '12 13:06

Tom Smykowski

2 Answers

node() = innerXml  text() = innerText

both are arrays, so text()[1] is a first children text node...

164

answered Oct 02 '22 03:10

Nikola Bogdanović

With xpath, the thing you will get returned is the last thing in the path that is not a condition. What that means? Well, conditions are the stuff between []'s (but you already knew that) and yours reads like pathElement[that has a 'class' attribute with value 'my class']. The pathElement comes directly before the [.

All the stuff outside of []'s then is the path, so in //a/b/c[@blah='bleh']/d a, b, c and d are all path elements, blah is an attribute and bleh a literal value. If this path matches it will return you a d, the last non-condition thing.

Your particular path returns a (series of) div, being the last thing in your xpath's path. This return value thus includes the top-level node(s), div in your case, and underneath it (them) all its (their) children. Nodes can be elements or text (or comments, processing instructions, ...).

Underneath a node there can be multiple text nodes, hence the array pOcHa talks about. x/text() returns all text that is a direct child of x, x/node() returns all child nodes, including text.

answered Oct 02 '22 03:10

jos

Related questions
                            
                                Self-closing tags in XML files
                            
                                How to prevent XXE attack (XmlDocument in .NET)
                            
                                DOMElement cloning and appending: 'Wrong Document Error'
                            
                                cURL and PHP: Stop output to screen
                            
                                Marshalling a List of objects implementing a common interface, with JaxB
                            
                                How to validate xml against xsd and get *ALL* errors?
                            
                                How to lower the opacity of the alpha layer in an svg filter?
                            
                                Hexadecimal value 0x00 is a invalid character
                            
                                XML Error: Extra content at the end of the document
                            
                                Edit specific Element in XDocument
                            
                                Powershell: Convert XML to String
                            
                                Get child Node of another Node, given node name
                            
                                Setting xsl:value-of into an href attribute and the text field of a link in an XSLT
                            
                                What does %S mean in PHP, HTML or XML?
                            
                                How do I convert a Ruby hash to XML?
                            
                                Check if an element exists when parsing XML
                            
                                How do I check particular attributes exist or not in XML?
                            
                                XSL if else condition
                            
                                Scala - modifying nested elements in xml
                            
                                Repaired Records : Cell information from worksheet created from scratch

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With