[Q] Project: Better performance for LargeIntegers

Mon Nov 1 20:41:23 UTC 1999

> -----Original Message-----
> From: Andrew C. Greenberg [mailto:werdna at gate.net]
> Sent: Samstag, 30. Oktober 1999 14:54
> To: squeak at cs.uiuc.edu
> Subject: RE: [Q] Project: Better performance for LargeIntegers
>
>
> > > >Dear Squeakers,
> > > >
> > > >I have changed my mind a little bit after studying some
> Squeak sources.
> > > >
> > > >Now I plan to implement LargeInteger primitives (20-39)
> > > _without_ changing
> > > >the representation of LargeIntegers by extending the Interpreter
> > > arithmetic
> > > >primitives using the CCodeGenerator.
> > > >This avoids some problems dealing with different platforms.
> > > >
> > > >In a later step changing the representation to bigger
> 'digits' (16,32,..
> > > >bits?) _could_ be interesting.
> > >
> > > Perhaps 16 bits, but unlikely 32.  How, for example, would you handle
> > > integer overflow when doing, say, the LargeInteger add?
> >
> >My first idea was to take one small and one big C-integer type, represent
> >digits with the small and perform the operations with the bigger
> one. If the
> >bigger one is twice as big as the smaller there are no problems
> with mult.
>
> I agree that would work for 16-bit digits, but you need stronger
> stuff for 32.   A downside is that this would not work in any of the
> simulated Smalltalk, since SmallIntegers are not full 32-bit values.
> The resulting Smalltalk code would only be useful for compilation.

I agree.

I've forgotten to say that at the time of 'my first idea' I didn't thought
of using the CCodeGenerator... I wanted to do the whole stuff in C.

>
> >I think on all machines with ANSI-C these conditions are met; at
> _least_ we
> >could take an 8 bit val for representation and make the computation in 16
> >bit, at best - so far - representation in 32 and computation in 64.
>
> Indeed, that's exactly what is done right now.  bytes are accumulated
> by addition and multiplication into SmallIntegers.
>
> >But here we introduce platform specific stuff, which has to be
> handled. By
> >using the Smalltalk to C compiler without changing the current
> >representation we are _totally_ platform independent (no #defines in C at
> >all) and in spite of this hopefully much faster than without
> compiling in C.
>
> I don't understand.  Of course we would use the Smalltalk to C
> compiler, but won't will still have to compile the C code?  It does
> raise a byte endian gender issue, that inside the C-code is
> irrelevant, even though different platforms will execute differently,
> but for the remaining smalltalk code than manipulates digits of the
> object will need to address.  A simple solution is to implement a
> digitAt: function as well, so that your plugin or core code will work
> fine across platforms, bigendian or otherwise.

'Without changing the current representation' and building the C-code in
analogue to the existing Smalltalk implementation I cannot see any byte
endian problem, this approach includes just exactly 'A simple solution is to
implement a digitAt: function as well'.

>
> I see no reason why any implementation need be platform specific.
> Done properly, the generated code should work fine with any
> conforming C compiler supporting 32-bit ints.  (Note that Squeak does
> expect the C compiler to treat  ints as 32-bitters.)
>

I've thought over the problem that there are the ANSI-C assertions (formally
lazy written)
	sizeof(char) == 1 < 2 <= sizeof(short) <= sizeof(int) <= sizeof(long)
&&	2 <= sizeof(short) <= sizeof(int)
&&	4 <= sizeof(long)
, but it's possible to have sizeof(short) == sizeof(int) == sizeof(long) for
example. OK, I don't know such a computer... But what's needed is just a
smaller data type for representation as for computing to totally do this in
C.

Yet another problem: Pi * thumb I think it could be a problem to transfer
the internal representation of a very big number by big - say 16 Bit -
'digits' in a printable 10 based one. I guess that there is needed some kind
of string computation, isn't it?

>
> > >  And how
> > > could you do the multiply without bit munging?  Is it possible that a
> > > straight loop with carry would be much faster than all the masking
> > > and shifting necessary to do a carry written in C?
> > >
> >
> >That could be, but then you'd introduce assembler, what is
> _very_ platform
> >dependent.
> >
> >
> >Because I've never compiled a Squeak so far I think it's best
> for me first
> >to take the straigthforward approach and then to think of other
> >representations...
>
> Are you planning to rewrite as numbered primitives for incorporation
> in the VM?  Why not make a pluggable primitive instead, so that folks
> who don't need speedier longs (most users) won't need to have the
> code in their VM footprint?  Done right, the code should run
> identically, albeit slower, without the plugin installed as it does
> when the plugin is installed.
>

Later about plugins versus primitives...