Weighted indexing

Chris Muller afunkyobject at yahoo.com
Thu Dec 8 16:32:47 UTC 2005


> What you want, of course, is weighted indexing. Words in #subject
> should have a higher weight that words in the message body, which
> should have a higher weight than words in attachments, which should
> have a higher weight than words in quoted messages (">" lines).
> 
> Probably the thing to do is build a index that holds two integers -
> weight and object - instead of a single one, or build an index per
> weight (probably easier to do in Magma).

Yes, an index per weight is how I have designed so far but adding a field
#weight to MaHashIndexRecord so each entry in the index can have two Integers,
one oid and one weight, as you said, would not be hard.  Is this something that
would facilitate powerful indexing algorithms?



More information about the Magma mailing list