Skip to main content

XPath query result order


For another question I have created some XML related code that works on my development machine but not on viper codepad where I tested it before adding it to my answer.



I could reduce my problem to the point that the order of nodes returned by DOMXPath::query() differs between my system and the codepad.



XML: <test>This is some <span>text</span>, fine.</test>



When I query all textnodes //child::text() the result differs:



Viper Codepad:




#0: This is some
#1: , fine.
#2: text



My Machine:




#0: This is some
#1: text
#2: , fine.



I'm not that experienced with xpath that I do understand why this happens and how it's probably possible to influence the return order with the PHP implementation.



Edit:



Further testing has revealed that LIBXML_VERSION differs between the two systems:




Viper Codepad: 20626 (2.6.26; 6 Jun 2006)
My Machine...: 20707 (2.7.7; 15 Mar 2010)


Source: Tips4allCCNA FINAL EXAM

Comments

  1. Technically XPath 1.0 returns node-sets rather than node sequences. In the XPath 1.0 specification there is no statement about the order of these node-sets - indeed, being sets, they have no intrinsic order.

    However, XSLT 1.0 always processes the node-sets returned by XPath 1.0 in document order, and because of that precedent, there is a widespread expectation that XPath results will be in document order when XPath is invoked from languages other than XSLT. However, there is nothing in the spec to guarantee this. In XPath 2.0 the user expectation becomes part of the spec, and the results of a path expression MUST be in document order.

    ReplyDelete
  2. It looks like an bug in 20626 version:

    It process first all child text nodes in document order, then content of child element nodes. Should be as result on your machine

    ReplyDelete
  3. I could find the following bug-report which looks like the issue: Bug 363252 - proximity position in libxml2's xmlXPathEvalExpression() reported 18 Oct 2006 and confirmed dating back since May 2006 which is before the 2.6.26 version in question.

    This should have been fixed in libxml2 2.6.27.

    ReplyDelete
  4. It appears that Viper Codepad is not returning the selected text() nodes in depth first document order, but doing a breadth first evaluation.

    It is supposed to be a depth first traversal.

    Saxon, MSXML, Altova XML each returned the results in a depth-first order.

    ReplyDelete
  5. XPath is a query language, thus it should only read the structure of the .xml document as is and never modify it. This includes the node order. In your first example however this is not true. So this is definitely a bug according to this.

    ReplyDelete

Post a Comment

Popular posts from this blog

Why is this Javascript much *slower* than its jQuery equivalent?

I have a HTML list of about 500 items and a "filter" box above it. I started by using jQuery to filter the list when I typed a letter (timing code added later): $('#filter').keyup( function() { var jqStart = (new Date).getTime(); var search = $(this).val().toLowerCase(); var $list = $('ul.ablist > li'); $list.each( function() { if ( $(this).text().toLowerCase().indexOf(search) === -1 ) $(this).hide(); else $(this).show(); } ); console.log('Time: ' + ((new Date).getTime() - jqStart)); } ); However, there was a couple of seconds delay after typing each letter (particularly the first letter). So I thought it may be slightly quicker if I used plain Javascript (I read recently that jQuery's each function is particularly slow). Here's my JS equivalent: document.getElementById('filter').addEventListener( 'keyup', function () { var jsStart = (new Date).getTime()...

Is it possible to have IF statement in an Echo statement in PHP

Thanks in advance. I did look at the other questions/answers that were similar and didn't find exactly what I was looking for. I'm trying to do this, am I on the right path? echo " <div id='tabs-".$match."'> <textarea id='".$match."' name='".$match."'>". if ($COLUMN_NAME === $match) { echo $FIELD_WITH_COLUMN_NAME; } else { } ."</textarea> <script type='text/javascript'> CKEDITOR.replace( '".$match."' ); </script> </div>"; I am getting the following error message in the browser: Parse error: syntax error, unexpected T_IF Please let me know if this is the right way to go about nesting an IF statement inside an echo. Thank you.