[Vm-dev] Re: Cog segmentation fault on Linux
Rob Withers
reefedjib at yahoo.com
Tue Jul 20 08:51:07 UTC 2010
Hi Eliot,
(I forgot to CC the mailing list - added)
I made a few things happen.
First, I found that the argument to -leakcheck is an integer that gets masked to determine whether to leak check an incremental or full GC. I made the call with '-leakcheck 7'.
Second, I added a -leakcheck section to the COGVM section:
#if COGVM
else if (!strcmp(argv[0], "-leakcheck")) {
extern sqInt checkForLeaks;
checkForLeaks = atoi(argv[1]);
return 2; }
I compiled and ran it. I am unsure where any output from the leak checker goes. If it is to stdout or stderr, I forget the magic incantation to redirect these to files. I think it is '2> stderr.txt 1> stdout.txt' for /bin/sh. Is that right?
So when I ran it, it runs (the new image with stepping button bar - takes 30% cpu). When I send 'kill -USR1 <pid>' it seg faults guaranteed. This may or may not be the original seg fault - it may be the leakchecker?
The only stuff I am doing that calls out of the image is socket stuff. This may or may not be in the middle of a call when it seg faults. I will work to turn off all the socket activity and see if it still seg faults.
Am I activating the leakchecker ok?
Regards,
Rob
From: Rob Withers
Sent: Monday, July 19, 2010 9:22 PM
To: Eliot Miranda
Subject: Re: [Vm-dev] Re: Cog segmentation fault on Linux
hey Eliot,
It looks like this command line argument, -leakcheck, is for the STACKVM, not the COGVM. Is this an issue?
Thanks,
Rob
#if STACKVM
else if (!strcmp(argv[0], "-eden")) {
extern sqInt desiredEdenBytes;
desiredEdenBytes = strtobkm(argv[1]);
return 2; }
else if (!strcmp(argv[0], "-leakcheck")) {
extern sqInt checkForLeaks;
checkForLeaks = atoi(argv[1]);
return 2; }
else if (!strcmp(argv[0], "-stackpages")) {
extern sqInt desiredNumStackPages;
desiredNumStackPages = atoi(argv[1]);
return 2; }
else if (!strcmp(argv[0], "-breaksel")) {
extern void setBreakSelector(char *);
setBreakSelector(argv[1]);
return 2; }
else if (!strcmp(argv[0], "-noheartbeat")) {
extern sqInt suppressHeartbeatFlag;
suppressHeartbeatFlag = 1;
return 1; }
#endif /* STACKVM */
From: Rob Withers
Sent: Monday, July 19, 2010 9:19 PM
To: Eliot Miranda
Cc: Squeak VM Dev
Subject: [Vm-dev] Re: Cog segmentation fault on Linux
--------------------------------------------------------------------------------
Hi Eliot,
Got home from my new job and started looking into this. It turns out that this morning I found that I had a button bar that was stepping and part of the step was a Smalltalk garbageCollect to force collection before checking for instances. It may be something I don't need to do anymore, however it helps expose this seg fault. Both stack dumps were in the garbageCollect. I removed the button bar, uploaded the image, and ran it. CPU% dropped from 33% to 2%. I let it run all day. At some point it exited, for an unknown reason, as it was gone when I returned tonight.
I have reinstated the button bar, to help this bug occur, and uploaded it to the server.
Now I just need to enable -leakcheck. From sqUnixMain.c it looks like it takes an argument. What is that argument?
Thanks,
Rob
From: Eliot Miranda
Sent: Monday, July 19, 2010 1:47 PM
To: Rob Withers
Cc: Squeak Virtual Machine Development Discussion
Subject: Re: Cog segmentation fault on Linux
Hi Rob,
On Mon, Jul 19, 2010 at 3:10 AM, Rob Withers <reefedjib at yahoo.com> wrote:
Eliot,
I am getting a segmentation fault running Cog headless on linux. Here is the stack dump. Below is a second stack dump that looks different.
While the heap corruption might be a bug in Cog it might also be heap corruption from external code (e.g. objects passed through FFI calls to external code that overwrites those objects' bounds).
There's a leak checker in Cog (see the -leakcheck argument in platforms/unix/vm/sqUnixMain.c) that can help you localise this. Its best to distrust your code before you distrust the VM, simply because thinking it's the VM can blind-side you to potential bugs in your own code or other parts of the system. The goal here is a reproducible case. If you get a reproducible case that doesn't use any external code then the bug is in the VM.
HTH
Eliot
HTH,
Rob
FIRST STACK DUMP
vawhigso at vawhigs.org [~/public_html/squeakelib/Cog]# kill -USR1 30247
vawhigso at vawhigs.org [~/public_html/squeakelib/Cog]#
Received user signal, printing active stack:
0xff940ab8 I SmalltalkImage>garbageCollect -1207282732: a(n) SmalltalkImage
0xff940ad0 M Introducer class>areVatsRunning -1144872564: a(n) Introducer class
0xff940ae8 M PluggableButtonMorph>getModelState -1134952004: a(n) PluggableButtonMorph
0xff940b00 M PluggableButtonMorph>update: -1134952004: a(n) PluggableButtonMorph
0xff940b1c M StepMessage(MessageSend)>value -1134947088: a(n) StepMessage
0xff940b38 M StepMessage(MorphicAlarm)>value: -1134947088: a(n) StepMessage
0xff940b64 M WorldState>runLocalStepMethodsIn: -1215450852: a(n) WorldState
0xff940b90 M WorldState>runStepMethodsIn: -1215450852: a(n) WorldState
0xff940bac M PasteUpMorph>runStepMethods -1215450600: a(n) PasteUpMorph
0xff940bc8 M WorldState>doOneCycleNowFor: -1215450852: a(n) WorldState
0xff940be4 M WorldState>doOneCycleFor: -1215450852: a(n) WorldState
0xff940c00 M PasteUpMorph>doOneCycle -1215450600: a(n) PasteUpMorph
0xff940c20 I [] in Project class>spawnNewProcess -1138060792: a(n) Project class
-1133485544 s [] in
Segmentation fault
Smalltalk stack dump:
0xff940ab8 I SmalltalkImage>garbageCollect -1207282732: a(n) SmalltalkImage
0xff940ad0 M Introducer class>areVatsRunning -1144872564: a(n) Introducer class
0xff940ae8 M PluggableButtonMorph>getModelState -1134952004: a(n) PluggableButtonMorph
0xff940b00 M PluggableButtonMorph>update: -1134952004: a(n) PluggableButtonMorph
0xff940b1c M StepMessage(MessageSend)>value -1134947088: a(n) StepMessage
0xff940b38 M StepMessage(MorphicAlarm)>value: -1134947088: a(n) StepMessage
0xff940b64 M WorldState>runLocalStepMethodsIn: -1215450852: a(n) WorldState
0xff940b90 M WorldState>runStepMethodsIn: -1215450852: a(n) WorldState
0xff940bac M PasteUpMorph>runStepMethods -1215450600: a(n) PasteUpMorph
0xff940bc8 M WorldState>doOneCycleNowFor: -1215450852: a(n) WorldState
0xff940be4 M WorldState>doOneCycleFor: -1215450852: a(n) WorldState
0xff940c00 M PasteUpMorph>doOneCycle -1215450600: a(n) PasteUpMorph
0xff940c20 I [] in Project class>spawnNewProcess -1138060792: a(n) Project class
SECOND STACK DUMP
vawhigso at vawhigs.org [~/public_html/squeakelib/Cog]# kill -USR1 7340
Received user signal, printing active stack:
vawhigso at vawhigs.org [~/public_html/squeakelib/Cog]# 0xffaf3398 I SmalltalkImage>garbageCollect -1207897132: a(n) SmalltalkImage
0xffaf33b0 M Introducer class>areVatsRunning -1145486964: a(n) Introducer class
0xffaf33c8 M PluggableButtonMorph>getModelState -1135564980: a(n) PluggableButtonMorph
0xffaf33e0 M PluggableButtonMorph>update: -1135564980: a(n) PluggableButtonMorph
0xffaf33fc M StepMessage(MessageSend)>value -1135561416: a(n) StepMessage
0xffaf3418 M StepMessage(MorphicAlarm)>value: -1135561416: a(n) StepMessage
0xffaf3444 M WorldState>runLocalStepMethodsIn: -1216065252: a(n) WorldState
0xffaf3470 M WorldState>runStepMethodsIn: -1216065252: a(n) WorldState
0xffaf348c M PasteUpMorph>runStepMethods -1216065000: a(n) PasteUpMorph
0xffaf34a8 M WorldState>doOneCycleNowFor: -1216065252: a(n) WorldState
0xffaf34c4 M WorldState>doOneCycleFor: -1216065252: a(n) WorldState
0xffaf34e0 M PasteUpMorph>doOneCycle -1216065000: a(n) PasteUpMorph
0xffaf3500 I [] in Project class>spawnNewProcess -1138675192: a(n) Project class
-1134099944 s [] in BlockClosure>newProcess
Received user signal, printing all processes:
Process 0xbc670278 priority 40
0xffaf3398 I SmalltalkImage>garbageCollect -1207897132: a(n) SmalltalkImage
0xffaf33b0 M Introducer class>areVatsRunning -1145486964: a(n) Introducer class
0xffaf33c8 M PluggableButtonMorph>getModelState -1135564980: a(n) PluggableButtonMorph
0xffaf33e0 M PluggableButtonMorph>update: -1135564980: a(n) PluggableButtonMorph
0xffaf33fc M StepMessage(MessageSend)>value -1135561416: a(n) StepMessage
0xffaf3418 M StepMessage(MorphicAlarm)>value: -1135561416: a(n) StepMessage
0xffaf3444 M WorldState>runLocalStepMethodsIn: -1216065252: a(n) WorldState
0xffaf3470 M WorldState>runStepMethodsIn: -1216065252: a(n) WorldState
0xffaf348c M PasteUpMorph>runStepMethods -1216065000: a(n) PasteUpMorph
0xffaf34a8 M WorldState>doOneCycleNowFor: -1216065252: a(n) WorldState
0xffaf34c4 M WorldState>doOneCycleFor: -1216065252: a(n) WorldState
0xffaf34e0 M PasteUpMorph>doOneCycle -1216065000: a(n) PasteUpMorph
0xffaf3500 I [] in Project class>spawnNewProcess -1138675192: a(n) Project class
-1134099944 s [] in BlockClosure>newProcess
Process 0xbc97fc44 priority 50
0xffafa4c0 M WeakArray class>finalizationProcess -1210174624: a(n) WeakArray class
0xffafa4e0 I [] in WeakArray class>restartFinalizationProcess -1210174624: a(n) WeakArray class
0xffafa500 I [] in BlockClosure>newProcess -1130890396: a(n) BlockClosure
Process 0xb8125518 priority 80
widowed caller frame
EventualProcess 0xbbbd6e10 priority 60
-1134095516 s [] in Delay>wait
-1134049404 s BlockClosure>ifCurtailed:
-1134095648 s Delay>wait
-1134049312 s [] in VatTPManager class>finalizationLoop
-1145210696 s BlockClosure>repeat
-1145213336 s VatTPManager class>finalizationLoop
-1145213520 s [] in VatTPManager class>?
EventualProcess 0xbc5c9174 priority 30
-1134783048 s SharedQueue>next
-1134783140 s [] in Vat>processSends
-1134751716 s BlockClosure>ifCurtailed:
-1134783276 s Vat>processSends
-1134783984 s [] in EventualProcess>setupContext
Process 0xbc86c01c priority 60
0xffaf44c0 I RFBEventSensor(InputSensor)>userInterruptWatcher -1130865472: a(n) RFBEventSensor
0xffaf44e0 I [] in RFBEventSensor(InputSensor)>installInterruptWatcher -1130865472: a(n) RFBEventSensor
0xffaf4500 I [] in BlockClosure>newProcess -1132019908: a(n) BlockClosure
Process 0xbc86c1dc priority 60
widowed caller frame 8TÅSĸ"Ä»r·Zžð·ð·Zžð·ļ§r·TTÅSĸÅr·ĻZžZžÃ,ž4Zžir·tTÅSĸT·Ã,žüYžÃ,žÄ?\žð··TÅSĸĶ·sZžðZžð·8·s·Ä?TÅSĸ(õq·$ZžĪ*îZžüYžÃ,žÄ?\ž ]Ä'·
Process 0xbc86c3c8 priority 60
0xffaf74c0 I SmalltalkImage>lowSpaceWatcher -1207897132: a(n) SmalltalkImage
0xffaf74e0 I [] in SmalltalkImage>installLowSpaceWatcher -1207897132: a(n) SmalltalkImage
0xffaf7500 I [] in BlockClosure>newProcess -1132018968: a(n) BlockClosure
Process 0xbc985ef0 priority 60
widowed caller frame HÃ"ÅSĸúq·Ä?\žð·ð·Ä?\žð· úq·lÃ"ÅSĸ(õq·Ã"\žÄ?\žD\žļ·
Segmentation fault
Can't dump Smalltalk stack. Not in VM thread
Most recent primitives
wait
signal
millisecondClockValue
wait
signal
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
perform:with:
basicNew:
basicNew
value:
millisecondClockValue
basicNew
basicNew
new:
at:put:
at:put:
at:put:
basicNew
basicNew
basicNew
basicNew:
at:put:
replaceFrom:to:with:startingAt:
replaceFrom:to:with:startingAt:
basicNew:
at:put:
replaceFrom:to:with:startingAt:
replaceFrom:to:with:startingAt:
species
basicNew:
replaceFrom:to:with:startingAt:
compare:with:collated:
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
perform:withArguments:
perform:
species
basicNew:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
species
basicNew:
basicReplaceFrom:to:with:startingAt:
species
basicNew:
basicAt:put:
basicAt:put:
species
basicNew:
basicReplaceFrom:to:with:startingAt:
species
basicNew:
basicAt:put:
species
basicNew:
basicReplaceFrom:to:with:startingAt:
new:
basicNew
at:put:
at:put:
at:put:
new:
basicNew
at:put:
at:put:
at:put:
new:
basicNew
at:put:
at:put:
at:put:
new:
basicNew
at:put:
at:put:
at:put:
primitiveGarbageCollect
millisecondClockValue
signal
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
suspend
primitiveResume
at:put:
at:put:
at:put:
at:put:
suspend
primitiveResume
at:put:
at:put:
primSignal:atMilliseconds:
millisecondClockValue
wait
millisecondClockValue
millisecondClockValue
wait
signal
at:put:
at:put:
millisecondClockValue
primSignal:atMilliseconds:
millisecondClockValue
wait
value
wait
signal
wait
value
signal
millisecondClockValue
primSignal:atMilliseconds:
millisecondClockValue
wait
signal
primSocketConnectionStatus:
millisecondClockValue
basicNew:
byteAt:put:
byteAt:put:
species
basicNew:
replaceFrom:to:with:startingAt:
replaceFrom:to:with:startingAt:
species
basicNew:
replaceFrom:to:with:startingAt:
replaceFrom:to:with:startingAt:
basicNew
findNextHandlerContextStarting
tempAt:
tempAt:
tempAt:put:
valueNoContextSwitch
tempAt:
valueWithArguments:
findNextUnwindContextUpTo:
tempAt:
tempAt:put:
tempAt:
terminateTo:
value
tempAt:put:
findNextUnwindContextUpTo:
terminateTo:
primSocketConnectionStatus:
value
value
millisecondClockValue
primSocketConnectionStatus:
millisecondClockValue
millisecondClockValue
basicNew
valueNoContextSwitch
millisecondClockValue
wait
signal
at:put:
at:put:
at:put:
millisecondClockValue
primSignal:atMilliseconds:
millisecondClockValue
wait
signal
wait
basicNew
new:
someInstance
nextInstance
at:put:
species
new:
replaceFrom:to:with:startingAt:
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
at:put:
perform:withArguments:
perform:
species
basicNew:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
basicAt:put:
species
basicNew:
basicReplaceFrom:to:with:startingAt:
species
basicNew:
basicAt:put:
basicAt:put:
species
basicNew:
basicReplaceFrom:to:with:startingAt:
species
basicNew:
basicAt:put:
species
basicNew:
basicReplaceFrom:to:with:startingAt:
new:
basicNew
at:put:
at:put:
at:put:
new:
basicNew
at:put:
at:put:
at:put:
new:
basicNew
at:put:
at:put:
at:put:
new:
basicNew
at:put:
at:put:
at:put:
primitiveGarbageCollect
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.squeakfoundation.org/pipermail/vm-dev/attachments/20100720/6611618c/attachment-0001.htm
More information about the Vm-dev
mailing list