Skip to main content

Why does an infinitely recursive function in PHP cause a segfault?



A hypothetical question for you all to chew on...





I recently answered another question on SO where a PHP script was segfaulting, and it reminded me of something I have always wondered, so let's see if anyone can shed any light on it.





Consider the following:







<?php



function segfault ($i = 1) {

echo "$i\n";

segfault($i + 1);

}



segfault();



?>







Obviously, this (useless) function loops infinitely. And eventually, will run out of memory because each call to the function executes before the previous one has finished. Sort of like a fork bomb without the forking.





But... eventually, on POSIX platforms, the script will die with SIGSEGV (it also dies on Windows, but more gracefully - so far as my extremely limited low-level debugging skills can tell). The number of loops varies depending on the system configuration (memory allocated to PHP, 32bit/64bit, etc etc) and the OS but my real question is - why does it happen with a segfault?





  • Is this simply how PHP handles "out-of-memory" errors? Surely there must be a more graceful way of handling this?



  • Is this a bug in the Zend engine?



  • Is there any way this can be controlled or handled more gracefully from within a PHP script?



  • Is there any setting that generally controls that maximum number of recursive calls that can be made in a function?




Comments

  1. If you use XDebug, there is a maximum function nesting depth which is controlled by an ini setting:

    $foo = function() use (&$foo) {
    $foo();
    };
    $foo();


    Produces the following error:


    Fatal error: Maximum function nesting level of '100' reached, aborting!


    This IMHO is a far better alternative than a segfault, since it only kills the current script, not the whole process.

    There is this thread that was on the internals list a few years ago (2006). His comments are:


    So far nobody had proposed a solution for endless loop problem that
    would satisfy these conditions:


    No false positives (i.e. good code always works)
    No slowdown for execution
    Works with any stack size


    Thus, this problem remains unsloved.


    Now, #1 is quite literally impossible to solve due to the halting problem. #2 is trivial if you keep a counter of stack depth (since you're just checking the incremented stack level on stack push).

    Finally, #3 Is a much harder problem to solve. Considering that some operating systems will allocate stack space in a non-contiguous manner, it's not going to be possible to implement with 100% accuracy, since it's impossible to portably get the stack size or usage (for a specific platform it may be possible or even easy, but not in general).

    Instead, PHP should take the hint from XDebug and other languages (Python, etc) and make a configurable nesting level (Python's is set to 1000 by default)....

    Either that, or trap memory allocation errors on the stack to check for the segfault before it happens and convert that into a RecursionLimitException so that you may be able to recover....

    ReplyDelete
  2. I could be totally wrong about this since my testing was fairly brief. It seems that Php will only seg fault if it runs out of memory (and presumably tries to access an invalid address). If the memory limit is set and low enough, you will get an out of memory error beforehand. Otherwise, the code seg faults and is handled by the OS.

    Can't say whether this is a bug or not, but the script should probably not be allowed to get out of control like this.

    See the script below. Behavior is practically identical regardless of options. Without a memory limit, it also slows my computer down severely before it's killed.

    <?php
    $opts = getopt('ilrv');
    $type = null;
    //iterative
    if (isset($opts['i'])) {
    $type = 'i';
    }
    //recursive
    else if (isset($opts['r'])) {
    $type = 'r';
    }
    if (isset($opts['i']) && isset($opts['r'])) {
    }

    if (isset($opts['l'])) {
    ini_set('memory_limit', '64M');
    }

    define('VERBOSE', isset($opts['v']));

    function print_memory_usage() {
    if (VERBOSE) {
    echo memory_get_usage() . "\n";
    }
    }

    switch ($type) {
    case 'r':
    function segf() {
    print_memory_usage();
    segf();
    }
    segf();
    break;
    case 'i':
    $a = array();
    for ($x = 0; $x >= 0; $x++) {
    print_memory_usage();
    $a[] = $x;
    }
    break;
    default:
    die("Usage: " . __FILE__ . " <-i-or--r> [-l]\n");
    break;
    }
    ?>

    ReplyDelete
  3. Know nothing about PHP implementation, but it's not uncommon in a language runtime to leave pages unallocated at the "top" of the stack so that a segfault will occur if the stack overflows. Usually this is handled inside the runtime and either the stack is extended or a more elegant error is reported, but there could be implementations (and situations in others) where the segfault is simply allowed to rise (or escapes).

    ReplyDelete

Post a Comment

Popular posts from this blog

Slow Android emulator

I have a 2.67 GHz Celeron processor, 1.21 GB of RAM on a x86 Windows XP Professional machine. My understanding is that the Android emulator should start fairly quickly on such a machine, but for me it does not. I have followed all instructions in setting up the IDE, SDKs, JDKs and such and have had some success in staring the emulator quickly but is very particulary. How can I, if possible, fix this problem?

CCNA 3 Final Exam => latest version

1 . Which security protocol or measure would provide the greatest protection for a wireless LAN? WPA2 cloaking SSIDs shared WEP key MAC address filtering   2 . Refer to the exhibit. All trunk links are operational and all VLANs are allowed on all trunk links. An ARP request is sent by computer 5. Which device or devices will receive this message? only computer 4 computer 3 and RTR-A computer 4 and RTR-A computer 1, computer 2, computer 4, and RTR-A computer 1, computer 2, computer 3, computer 4, and RTR-A all of the computers and the router   3 . Refer to the exhibit. Hosts A and B, connected to hub HB1, attempt to transmit a frame at the same time but a collision occurs. Which hosts will receive the collision jamming signal? only hosts A and B only hosts A, B, and C only hosts A, B, C, and D only hosts A, B, C, and E   4 . Refer to the exhibit. Router RA receives a packet with a source address of 192.168.1.65 and a destination address of 192.168.1.161...