New subject: Resuming on BlockCannotReturn exception

18 Nov 2023

      Hi Eliot,
How about to nil the pc just before making the return:
```
Context >> #cannotReturn: result
self push: self pc.  "backup the pc for the sake of debugging"
     closureOrNil ifNotNil: [^self cannotReturn: result to: self home 
sender; pc: nil].
     Processor debugWithTitle: 'Computation has been terminated!' 
translated full: false
```
The nilled pc should not even potentially interfere with the #isDead 
now.
I hope this is at least a step in the right direction :)
However, there's still a problem when debugging the resumption of 
#cannotReturn because the encoders expect a reasonable index. I haven't 
figured out yet where to place a nil check - #step, 
#stepToSendOrReturn... ?
Thanks again,
Jaromir
------ Original Message ------
From "Eliot Miranda" eliot.miranda@gmail.com
To "Jaromir Matas" mail@jaromir.net
Date 11/17/2023 8:36:50 PM
Subject Re: [squeak-dev] Re: Resuming on BlockCannotReturn exception
...
Hi Jaromir,
...
On Nov 17, 2023, at 7:05 AM, Jaromir Matas mail@jaromir.net wrote:

Eliot, hi again,
Please disregard my previous comment about nilling the contexts that 
have returned. We are indeed talking about the context directly under 
the #cannotReturn context which is totally different from the home 
context in another thread that's gone.
I may still be confused but would nilling the pc of the context 
directly under the cannotReturn context help? Here's what I mean:
Context >> #cannotReturn: result

    closureOrNil ifNotNil: [^self pc: nil; cannotReturn: result to: 
self home sender].
    Processor debugWithTitle: 'Computation has been terminated!' 
translated full: false.

Instead of crashing the VM invokes the debugger with the 'Computation 
has been terminated!' message.
Does this make sense?
Nearly. But it loses the information on what the pc actually is, and 
that’s potentially vital information.  So IMO the ox should only be 
nilled between the BlockCannotReturn exception being created and 
raised.
[But if you try this don’t be surprised if it causes a few temporary 
problems.  It looks to me that without a little refactoring this could 
easily cause an infinite recursion around the sending of isDead.  I’m 
sure you’ll be able to fix the code to work correctly]
...
Thanks,
Jaromir
------ Original Message ------
From "Jaromir Matas" mail@jaromir.net
To "Eliot Miranda" eliot.miranda@gmail.com; "The general-purpose 
Squeak developers list" squeak-dev@lists.squeakfoundation.org
Date 11/17/2023 10:15:17 AM
Subject [squeak-dev] Re: Resuming on BlockCannotReturn exception
...
Hi Eliot,
------ Original Message ------
From "Eliot Miranda" eliot.miranda@gmail.com
To "Jaromir Matas" mail@jaromir.net
Cc "The general-purpose Squeak developers list" 
squeak-dev@lists.squeakfoundation.org
Date 11/16/2023 11:52:45 PM
Subject Re: Re[2]: [squeak-dev] Re: Resuming on BlockCannotReturn 
exception
...
Hi Jaromir,
On Thu, Nov 16, 2023 at 2:22 PM Jaromir Matas mail@jaromir.net 
wrote:
...
Hi Nicolas, Eliot,
here's what I understand is happening (see the enclosed 
screenshot):

we fork a new process to evaluate [^1]
the new process evaluates [^1] which means instruction 18 is

being evaluated, hence pc points to instruction 19 now
3) however, the home context where ^1 should return to is gone by 
this time (the process that executed the fork has already returned

notice the two up arrows in the debugger screenshot)

the VM can't finish the instruction and returns control to the

image via placing the #cannotReturn: context on top of the [^1] 
context
5) #cannotReturn: evaluation results in signalling the BCR 
exception which is then handled by the #resume handler
    (in our debugged case the [:ex | self halt. ex resume] handler)
6) ex resume is evaluated, however, this means requesting the VM to 
evaluate instruction 19 of the [^1] context - which is past the 
last instruction of the context and the crash ensues
I wonder whether such situations could/should be prevented inside 
the VM or whether such an expectation is wrong for some reason.
As Nicolas says, IMO this is best done at the image level.
It could be prevented in the VM, but at great cost, and only 
partially.  The performance issue is that the last bytecode in a 
method is not marked in any way, and that to determine the last 
bytecode the bytecodes must be symbolically evaluated from the start 
of the method.  See implementors of endPC at the image level (which 
defer to the method trailer) and implementors of endPCOf: in the 
VMMaker code. Doing this every time execution commences is 
prohibitively expensive.  The "only partially" issue is that 
following the return instruction may be other valid bytecodes, but 
these are not a continuation.
Consider the following code in some block:
        [self expression ifTrue:
            [^1].
        ^2
The bytecodes for this are
    pushReceiver
    send #expression
    jumpFalse L1
    push 1
    methodReturnTop
L1
    push 2
    methodReturnTop
Clearly if expression is true these should be *no* continuation in 
which ^2 is executed.
Well, in that case there's a bug because the computation in the 
following example shouldn't continue past the [^1] block but it 
silently does:
`[[true ifTrue: [^ 1]] on: BlockCannotReturn do: #resume ] fork`
The bytecodes are
    push true
    jumpFalse L1
    push 1
    returnTop
L1
    push nil
    blockReturn
...
So even if the VM did try and detect whether the return was at the 
last block method, it would only work for special cases.
It seems to me the issue is simply that the context that cannot be 
returned from should be marked as dead (see Context>>isDead) by 
setting its pc to nil at some point, presumably after copying the 
actual return pc into the BlockCannotReturn exception, to avoid ever 
trying to resume the context.
Does this mean, in other words, that every context that returns 
should nil its pc to avoid being "wrongly" reused/executed in the 
future, which concerns primarily those being referenced somewhere 
hence potentially executable in the future, is that right?
Hypothetical question: would nilling the pc during returns "fix" the 
example?
Thanks a lot for helping me understand this.
Best,
Jaromir
...
...
Thanks,
Jaromir
<bdxuqalu.png>
------ Original Message ------
From "Eliot Miranda" eliot.miranda@gmail.com
To "Jaromir Matas" mail@jaromir.net; "The general-purpose Squeak 
developers list" squeak-dev@lists.squeakfoundation.org
Date 11/16/2023 6:48:43 PM
Subject Re: [squeak-dev] Re: Resuming on BlockCannotReturn 
exception
...
Hi Jaromir,
>On Nov 16, 2023, at 3:23 AM, Jaromir Matas mail@jaromir.net 
>wrote:
>
>
>Hi Nicloas,
>No no, I don't have any practical scenario in mind, I'm just 
>trying to understand why the VM is implemented like this, whether 
>there were a reason to leave this possibility of a crash, e.g. it 
>would slow down the VM to try to prevent such a dumb situation 
>(who would resume from BCR in his right mind? :) ) - or perhaps I 
>have overlooked some good reason to even keep this behavior in 
>the VM. That's all.
Let’s first understand what’s really happening. Presumably at tone 
point a context is resumed those pc is already at the block return 
bytecode (effectively, because it crashes in JITted code, but I 
bet the stack vm will crash also, but not as cleanly - it will try 
and execute the bytes in the encoded method trailer). So which 
method actually sends resume, and to what, and what state is 
resume’s receiver when resume is sent?
>
>Thanks for your reply.
>Regards,
>Jaromir
>
>
>
>
>------ Original Message ------
>From "Nicolas Cellier" nicolas.cellier.aka.nice@gmail.com
>To "Jaromir Matas" mail@jaromir.net; "The general-purpose 
>Squeak developers list" squeak-dev@lists.squeakfoundation.org
>Date 11/16/2023 7:20:20 AM
>Subject Re: [squeak-dev] Resuming on BlockCannotReturn exception
>
>>Hi Jaromir,
>>Is there a scenario where it would make sense to resume a 
>>BlockCannotReturn?
>>If not, I would suggest to protect at image side and override 
>>#resume.
>>
>>Le mer. 15 nov. 2023, 23:42, Jaromir Matas mail@jaromir.net a 
>>écrit :
>>>Hi Eliot, Christoph, All,
>>>
>>>It's known the following example crashes the VM. Is this an 
>>>intended behavior or a "tolerated bug"?
>>>
>>>`[[^ 1] on: BlockCannotReturn do: #resume] fork`
>>>
>>>I understand why it crashes: the non-local return has nowhere 
>>>to return to and so resuming the computation leads to a crash. 
>>>But why not raise another BCR exception to prevent the crash? 
>>>Potential infinite loop? Perhaps I'm just missing the purpose 
>>>of this behavior...
>>>
>>>Thanks for an explanation.
>>>
>>>Best,
>>>Jaromir
>>>
>>>--
>>>
>>>Jaromir Matas
>>>
>>>
>
--
_,,,^..^,,,_
best, Eliot
<Context-cannotReturn.st>

Re: Resuming on BlockCannotReturn exception

PS: re tests - I'll reserve some time this weekend :)

tim

tim

tim