Skip to main content

Getting content of partial html in DomDocument



I have a string:







$string = 'some text <img src="www">';







I want to get the image source and the text.


Here is what I have:







$doc= new DOMDocument();

$doc->loadHTML($string);

$nodes=$doc->getElementsByTagName ('img');







From $nodes->item(0) I get the image source.


How can I get the the "some text"?


Comments

  1. For simple cases like this, try:

    $doc->documentElement->textContent

    ReplyDelete
  2. textContent, or with DOMXPaths $xpath->query('//text()')

    ReplyDelete
  3. You could make it like jQuery in javascript. Wrap the whole string with anything, and get this. Then you can get the TextNode, which contains this text.

    $string = 'some text <img src="www">';
    $string = '<div id="wrapper">' . $string . '</div>';

    $nodes = $doc->getElementById('wrapper');

    ReplyDelete

Post a Comment

Popular posts from this blog

Slow Android emulator

I have a 2.67 GHz Celeron processor, 1.21 GB of RAM on a x86 Windows XP Professional machine. My understanding is that the Android emulator should start fairly quickly on such a machine, but for me it does not. I have followed all instructions in setting up the IDE, SDKs, JDKs and such and have had some success in staring the emulator quickly but is very particulary. How can I, if possible, fix this problem?