XPath: Default to 'Node A', select 'Node B' instead if 'Node B' is not empty

Question

I need to create an XPath expression that does the following:

Returns the element inside of 'NodeA' by default
Returns the element inside of 'NodeB' if it is not empty.

Here is some sample XML so that my target structure can be clearly seen (I am using MS InfoPath):

<?xml version="1.0" encoding="UTF-8"?><?mso-infoPathSolution solutionVersion="1.0.0.10" productVersion="14.0.0" PIVersion="1.0.0.0" href="file:///C:\Documents%20and%20Settings\Chris\Local%20Settings\Application%20Data\Microsoft\InfoPath\Designer3\9016384cab6148f6\manifest.xsf" ?><?mso-application progid="InfoPath.Document" versionProgid="InfoPath.Document.3"?>
<my:myFields xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xhtml="http://www.w3.org/1999/xhtml" xmlns:my="http://schemas.microsoft.com/office/infopath/2003/myXSD/2012-09-07T14:19:10" xmlns:xd="http://schemas.microsoft.com/office/infopath/2003" xml:lang="en-us">
<my:NodeASection>
    <my:NodeA>2012-09-13</my:NodeA>
</my:NodeASection>
    <my:NodeBSection>
        <my:NodeBGroup>
            <my:NodeB>2012-09-14</my:NodeB>
        </my:NodeBGroup>
    </my:NodeBSection>
</my:myFields>

This XPath expression can be used to evaluate NodeB for the existence of text: boolean(//my:NodeB[(text())])

I have heard of the "Becker Method" but I'm not sure how that applies when both nodes exist. I'm very new to XPath and appreciate any help that can be offered.

David Pärsson · Accepted Answer

This XPath expression returns NodeB if it exists (and has text content) and NodeA in the other case:

//my:NodeB[text()] | //my:NodeA[text() and not(//my:NodeB[text()])]

If you want to get all sub-elements you can append /* after the selected node, like this

//my:NodeB[text()]/* | //my:NodeA[text() and not(//my:NodeB[text()])]/*

Dimitre Novatchev · Answer

A correct XPath expression is:

(//my:NodeB[node()] | //my:NodeA[not(//my:NodeB/node())])/node()

As the conditions in the predicates are mutually exclusive, only one of them can be true() and this guarantees that only one of the two nodes is selected by the expression within the brackets.

So, the expression above selects any node that is a child of: my:NodeB if it has children, or my:NodeA -- otherwize.

Here we assume as given that at most one element named my:NodeA and at most one element named my:NodeB exist in the XML document.

Another assumption is that the namespace to which the prefix my is bound has been "registered" with the XPath expression evaluator (the specific XPath implementation you are using).

Do note that in the provided XML document neither of the elements my:NodeA and my:NodeB has any element children (they both have just a text node child) -- so I assume that by "element" you actually mean "node".

Sean B. Durkin · Answer

If it is safe to rely on the fact that any NodeA's will come before NodeB in document order (as implied by your sample), then a simpler and much more efficient XPATH expression to select the required element is...

(//my:NodeA[text()]|//my:NodeB)[1]

The above selects the element. If you want to select the text node of the element, then use instead...

(//my:NodeA[text()]|//my:NodeB)[1]/text()

If there is no positional relationship between NodeA and NodeB (they can come in any relative order), and you are using XPATH 2.0, then the following expression will select the required text node..

(//my:NodeA[text()],//my:NodeB)[1]/text()

XPath: Default to 'Node A', select 'Node B' instead if 'Node B' is not empty

Tags:

xml

if-statement

xpath

infopath

Shrout1

3 Answers

David Pärsson

Dimitre Novatchev

Sean B. Durkin

Recent Activity

Donate For Us

XPath: Default to 'Node A', select 'Node B' instead if 'Node B' is not empty

Tags:

xml

if-statement

xpath

infopath

Shrout1

3 Answers

David Pärsson

Dimitre Novatchev

Sean B. Durkin

Related questions

Recent Activity

Donate For Us