Skip to main content

Why does an infinitely recursive function in PHP cause a segfault?



A hypothetical question for you all to chew on...





I recently answered another question on SO where a PHP script was segfaulting, and it reminded me of something I have always wondered, so let's see if anyone can shed any light on it.





Consider the following:







<?php



function segfault ($i = 1) {

echo "$i\n";

segfault($i + 1);

}



segfault();



?>







Obviously, this (useless) function loops infinitely. And eventually, will run out of memory because each call to the function executes before the previous one has finished. Sort of like a fork bomb without the forking.





But... eventually, on POSIX platforms, the script will die with SIGSEGV (it also dies on Windows, but more gracefully - so far as my extremely limited low-level debugging skills can tell). The number of loops varies depending on the system configuration (memory allocated to PHP, 32bit/64bit, etc etc) and the OS but my real question is - why does it happen with a segfault?





  • Is this simply how PHP handles "out-of-memory" errors? Surely there must be a more graceful way of handling this?



  • Is this a bug in the Zend engine?



  • Is there any way this can be controlled or handled more gracefully from within a PHP script?



  • Is there any setting that generally controls that maximum number of recursive calls that can be made in a function?




Comments

  1. If you use XDebug, there is a maximum function nesting depth which is controlled by an ini setting:

    $foo = function() use (&$foo) {
    $foo();
    };
    $foo();


    Produces the following error:


    Fatal error: Maximum function nesting level of '100' reached, aborting!


    This IMHO is a far better alternative than a segfault, since it only kills the current script, not the whole process.

    There is this thread that was on the internals list a few years ago (2006). His comments are:


    So far nobody had proposed a solution for endless loop problem that
    would satisfy these conditions:


    No false positives (i.e. good code always works)
    No slowdown for execution
    Works with any stack size


    Thus, this problem remains unsloved.


    Now, #1 is quite literally impossible to solve due to the halting problem. #2 is trivial if you keep a counter of stack depth (since you're just checking the incremented stack level on stack push).

    Finally, #3 Is a much harder problem to solve. Considering that some operating systems will allocate stack space in a non-contiguous manner, it's not going to be possible to implement with 100% accuracy, since it's impossible to portably get the stack size or usage (for a specific platform it may be possible or even easy, but not in general).

    Instead, PHP should take the hint from XDebug and other languages (Python, etc) and make a configurable nesting level (Python's is set to 1000 by default)....

    Either that, or trap memory allocation errors on the stack to check for the segfault before it happens and convert that into a RecursionLimitException so that you may be able to recover....

    ReplyDelete
  2. I could be totally wrong about this since my testing was fairly brief. It seems that Php will only seg fault if it runs out of memory (and presumably tries to access an invalid address). If the memory limit is set and low enough, you will get an out of memory error beforehand. Otherwise, the code seg faults and is handled by the OS.

    Can't say whether this is a bug or not, but the script should probably not be allowed to get out of control like this.

    See the script below. Behavior is practically identical regardless of options. Without a memory limit, it also slows my computer down severely before it's killed.

    <?php
    $opts = getopt('ilrv');
    $type = null;
    //iterative
    if (isset($opts['i'])) {
    $type = 'i';
    }
    //recursive
    else if (isset($opts['r'])) {
    $type = 'r';
    }
    if (isset($opts['i']) && isset($opts['r'])) {
    }

    if (isset($opts['l'])) {
    ini_set('memory_limit', '64M');
    }

    define('VERBOSE', isset($opts['v']));

    function print_memory_usage() {
    if (VERBOSE) {
    echo memory_get_usage() . "\n";
    }
    }

    switch ($type) {
    case 'r':
    function segf() {
    print_memory_usage();
    segf();
    }
    segf();
    break;
    case 'i':
    $a = array();
    for ($x = 0; $x >= 0; $x++) {
    print_memory_usage();
    $a[] = $x;
    }
    break;
    default:
    die("Usage: " . __FILE__ . " <-i-or--r> [-l]\n");
    break;
    }
    ?>

    ReplyDelete
  3. Know nothing about PHP implementation, but it's not uncommon in a language runtime to leave pages unallocated at the "top" of the stack so that a segfault will occur if the stack overflows. Usually this is handled inside the runtime and either the stack is extended or a more elegant error is reported, but there could be implementations (and situations in others) where the segfault is simply allowed to rise (or escapes).

    ReplyDelete

Post a Comment

Popular posts from this blog

Why is this Javascript much *slower* than its jQuery equivalent?

I have a HTML list of about 500 items and a "filter" box above it. I started by using jQuery to filter the list when I typed a letter (timing code added later): $('#filter').keyup( function() { var jqStart = (new Date).getTime(); var search = $(this).val().toLowerCase(); var $list = $('ul.ablist > li'); $list.each( function() { if ( $(this).text().toLowerCase().indexOf(search) === -1 ) $(this).hide(); else $(this).show(); } ); console.log('Time: ' + ((new Date).getTime() - jqStart)); } ); However, there was a couple of seconds delay after typing each letter (particularly the first letter). So I thought it may be slightly quicker if I used plain Javascript (I read recently that jQuery's each function is particularly slow). Here's my JS equivalent: document.getElementById('filter').addEventListener( 'keyup', function () { var jsStart = (new Date).getTime()...

Is it possible to have IF statement in an Echo statement in PHP

Thanks in advance. I did look at the other questions/answers that were similar and didn't find exactly what I was looking for. I'm trying to do this, am I on the right path? echo " <div id='tabs-".$match."'> <textarea id='".$match."' name='".$match."'>". if ($COLUMN_NAME === $match) { echo $FIELD_WITH_COLUMN_NAME; } else { } ."</textarea> <script type='text/javascript'> CKEDITOR.replace( '".$match."' ); </script> </div>"; I am getting the following error message in the browser: Parse error: syntax error, unexpected T_IF Please let me know if this is the right way to go about nesting an IF statement inside an echo. Thank you.