When researching details for an answer to an XPath question here on Stack Overflow, I run into a difference between XPath 1.0 and 2.0 I can find no rationale for. I tried to understand what <code>.</code> really means. <ul> <li>In XPath 1.0, <code>.</code> is an abbreviation for <code>self::node()</code>. Both <code>self</code> and <code>node</code> are crystal-clear to me.</li> <li>In XPath 2.0, <code>.</code> is primary expression "context item expression". Abbreviated Syntax section explicitly states that as a note.</li> </ul> What was the rationale for the change? Is there a difference between <code>.</code> and <code>self::node()</code> in XPath 2.0? From the spec itself, the intent of the change is not clear to me. I tried googling keywords like dot or period, primary expression, and rationale.

In short: <code>self::node()</code> filters out atomic items, while <code>.</code> does not. Atomic items (numbers, strings, and many other XML Schema types) are not nodes (unlike elements, attributes, comments, etc.). Consider the example from the spec: <code>(1 to 100)[. mod 5 eq 0]</code>. If the <code>.</code> is replaced by <code>self::node()</code>, the expression is not valid XPath, because <code>mod</code> requires both arguments to be numeric and atomization does not help in this case. For those scanning the spec: XPath 2.0 defines <code>item()</code> type-matching construct, but it has nothing to do with node tests as atomics are not nodes and axis steps always return just nodes. Therefore, dot cannot be defined as <code>self::item()</code>. It really needs to be a special language construct.

Why did the definition of dot (.) change between XPath 1.0 and 2.0?

2 Answers

XPath 1.0 had four data types: string, number, boolean, and node-set. There was no way of handling collections of values other than nodes. This meant, for example, that there was no way of summing over derived values (if elements had attributes of the form price='$23.95', there was no way of summing over the numbers obtained by stripping off the $ sign because the result of such stripping would be a set of numbers, and there was no such data type).

So XPath 2.0 introduced more general sequences, and that meant that the facilities for manipulating sequences had to be generalised; for example if $X is a sequence of numbers, then $X[. > 0] filters the sequence to include only the positive numbers. But that only works if "." can refer to a number as well as to a node.

146

answered Oct 17 '22 04:10

Michael Kay

In short: self::node() filters out atomic items, while . does not. Atomic items (numbers, strings, and many other XML Schema types) are not nodes (unlike elements, attributes, comments, etc.).

Consider the example from the spec: (1 to 100)[. mod 5 eq 0]. If the . is replaced by self::node(), the expression is not valid XPath, because mod requires both arguments to be numeric and atomization does not help in this case.

For those scanning the spec: XPath 2.0 defines item() type-matching construct, but it has nothing to do with node tests as atomics are not nodes and axis steps always return just nodes. Therefore, dot cannot be defined as self::item(). It really needs to be a special language construct.

answered Oct 17 '22 06:10

Palec

Related questions
                            
                                XSLT Ternary "If" Operator?
                            
                                How do I use xpath in Java to find a node value or attribute in an xml and replace it with another value?
                            
                                Why would an xpath position selection expression return multiple nodes?
                            
                                Scraping text without javascript code using scrapy
                            
                                "Two-way binding requires Path or XPath" when edit wpf datagrid
                            
                                xpath exclude element and all its children by parent attribute containing a value
                            
                                Pulling out the version in the pom.xml
                            
                                Highlighting when HTML and Xpath is given
                            
                                With XSLT/XPath, how can I match any element in the null namespace?
                            
                                Can I build this XPath query dynamically in XSLT?
                            
                                Replace an attribute in xml with xpath
                            
                                Select multiple rows from xml database column using xpath (possible without cursor?)
                            
                                XPath - abbreviation of position() function
                            
                                Using xquery FLWOR expressions to find multiple "where" restrictions
                            
                                XPath queries in IE use zero-based indexes but the W3C spec is one-based. How should I handle the difference?
                            
                                Find prev() tags with several selectors
                            
                                JavaScript's XPath: How to get the attribute value of an element?
                            
                                Merging two XML files using XSLT
                            
                                Scrapy/Python/XPath - How to extract data from within data?
                            
                                How to insert NULL into SQL Server DATE field *from XML*

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why did the definition of dot (.) change between XPath 1.0 and 2.0?

Tags:

language-lawyer

xpath

xpath-2.0

Palec

People also ask

2 Answers

Michael Kay

Palec

Recent Activity

Donate For Us