Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

XPath: Default to 'Node A', select 'Node B' instead if 'Node B' is not empty

I need to create an XPath expression that does the following:

  • Returns the element inside of 'NodeA' by default
  • Returns the element inside of 'NodeB' if it is not empty.

Here is some sample XML so that my target structure can be clearly seen (I am using MS InfoPath):

<?xml version="1.0" encoding="UTF-8"?><?mso-infoPathSolution solutionVersion="1.0.0.10" productVersion="14.0.0" PIVersion="1.0.0.0" href="file:///C:\Documents%20and%20Settings\Chris\Local%20Settings\Application%20Data\Microsoft\InfoPath\Designer3\9016384cab6148f6\manifest.xsf" ?><?mso-application progid="InfoPath.Document" versionProgid="InfoPath.Document.3"?>
<my:myFields xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xhtml="http://www.w3.org/1999/xhtml" xmlns:my="http://schemas.microsoft.com/office/infopath/2003/myXSD/2012-09-07T14:19:10" xmlns:xd="http://schemas.microsoft.com/office/infopath/2003" xml:lang="en-us">
<my:NodeASection>
    <my:NodeA>2012-09-13</my:NodeA>
</my:NodeASection>
    <my:NodeBSection>
        <my:NodeBGroup>
            <my:NodeB>2012-09-14</my:NodeB>
        </my:NodeBGroup>
    </my:NodeBSection>
</my:myFields>

This XPath expression can be used to evaluate NodeB for the existence of text: boolean(//my:NodeB[(text())])

I have heard of the "Becker Method" but I'm not sure how that applies when both nodes exist. I'm very new to XPath and appreciate any help that can be offered.

like image 484
Shrout1 Avatar asked Sep 11 '12 13:09

Shrout1


3 Answers

This XPath expression returns NodeB if it exists (and has text content) and NodeA in the other case:

//my:NodeB[text()] | //my:NodeA[text() and not(//my:NodeB[text()])]

If you want to get all sub-elements you can append /* after the selected node, like this

//my:NodeB[text()]/* | //my:NodeA[text() and not(//my:NodeB[text()])]/*
like image 111
David Pärsson Avatar answered Nov 02 '22 05:11

David Pärsson


A correct XPath expression is:

(//my:NodeB[node()] | //my:NodeA[not(//my:NodeB/node())])/node()

As the conditions in the predicates are mutually exclusive, only one of them can be true() and this guarantees that only one of the two nodes is selected by the expression within the brackets.

So, the expression above selects any node that is a child of: my:NodeB if it has children, or my:NodeA -- otherwize.

Here we assume as given that at most one element named my:NodeA and at most one element named my:NodeB exist in the XML document.

Another assumption is that the namespace to which the prefix my is bound has been "registered" with the XPath expression evaluator (the specific XPath implementation you are using).

Do note that in the provided XML document neither of the elements my:NodeA and my:NodeB has any element children (they both have just a text node child) -- so I assume that by "element" you actually mean "node".

like image 33
Dimitre Novatchev Avatar answered Nov 02 '22 06:11

Dimitre Novatchev


If it is safe to rely on the fact that any NodeA's will come before NodeB in document order (as implied by your sample), then a simpler and much more efficient XPATH expression to select the required element is...

(//my:NodeA[text()]|//my:NodeB)[1]

The above selects the element. If you want to select the text node of the element, then use instead...

(//my:NodeA[text()]|//my:NodeB)[1]/text()

If there is no positional relationship between NodeA and NodeB (they can come in any relative order), and you are using XPATH 2.0, then the following expression will select the required text node..

(//my:NodeA[text()],//my:NodeB)[1]/text()
like image 36
Sean B. Durkin Avatar answered Nov 02 '22 05:11

Sean B. Durkin