[squeak-dev] SoundsLike package released on SqueakSource

Ron Teitelbaum Ron at USMedRec.com
Fri Jan 16 16:37:55 UTC 2009


Hi Ian,

This is from WikiPedia: http://en.wikipedia.org/wiki/Double_Metaphone 

Use the Double Metaphone if you are interested in other languages. 

The Double Metaphone search algorithm is a phonetic algorithm written by
Lawrence Philips and is the second generation of his Metaphone algorithm.
Its implementation was described in the June 2000 issue of C/C++ Users
Journal.

It is called "Double" because it can return both a primary and a secondary
code for a string; this accounts for some ambiguous cases as well as for
multiple variants of surnames with common ancestry. For example, encoding
the name "Smith" yields a primary code of SM0 and a secondary code of XMT,
while the name "Schmidt" yields a primary code of XMT and a secondary code
of SMT--both have XMT in common.

Double Metaphone tries to account for myriad irregularities in English of
Slavic, Germanic, Celtic, Greek, French, Italian, Spanish, Chinese, and
other origin. Thus it uses a much more complex ruleset for coding than its
predecessor; for example, it tests for approximately 100 different contexts
of the use of the letter C alone. In the introduction to his original
journal article, Mr. Philips defended this complexity.

Hope that helps!

Ron Teitelbaum

> -----Original Message-----
> From: Ian Trudel [mailto:ian.trudel at gmail.com]
> Sent: Friday, January 16, 2009 11:29 AM
> To: Ron at usmedrec.com; The general-purpose Squeak developers list
> Subject: Re: [squeak-dev] SoundsLike package released on SqueakSource
> 
> Sounds great, Ron! Has SoundsLike only English phonetics or it is
> possible to define and use phonetics from other languages? Let's say,
> multilingual. :)
> 
> Ian
> 
> 2009/1/16 Ron Teitelbaum <Ron at usmedrec.com>:
> > Hello all,
> >
> >
> >
> > I have released a new package on SqueakSource that contains SoundEx and
> > DoubleMetaphone.
> >
> >
> >
> > http://www.squeaksource.com/SoundsLike.html
> >
> >
> >
> > This package adds soundsLike: to string
> >
> >
> >
> > 'titlebalm' soundsLike: 'teitelbaum'
> >
> >
> >
> > You can also get the phenoticStringsFor: a word to store in a database
> for
> > later queries.
> >
> >
> >
> > PhoneticStringComparator defaultClass "SoundEx" new phoneticStringsFor:
> > 'titlebalm' #('T341')
> >
> >
> >
> > DoubleMetaphoneStringComparator new phoneticStringsFor: 'teitelbaum'
> > #('TTLPM' 'TTLPM')
> >
> >
> >
> > Let me know if you have any questions.
> >
> >
> >
> > Regards,
> >
> >
> >
> > Ron Teitelbaum
> >
> >
> >
> >
> >
> > ________________________________
> >
> > From: beginners-bounces at lists.squeakfoundation.org
> > [mailto:beginners-bounces at lists.squeakfoundation.org] On Behalf Of Rob
> > Rothwell
> > Sent: Friday, January 16, 2009 6:35 AM
> > To: Ron at usmedrec.com
> > Cc: A friendly place to get answers to even the most basic
> > questionsaboutSqueak.
> > Subject: Re: [Newbies] SoundEx algorithm
> >
> >
> >
> > No rush...I am working on a custom recipe program for my wife and wanted
> to
> > use it to search ingredients and still find something that was
> misspelled!
> >
> > Thanks,
> >
> > Rob
> >
> > On Thu, Jan 15, 2009 at 10:10 PM, Ron Teitelbaum <Ron at usmedrec.com>
> wrote:
> >
> > I did one but didn't release it anywhere.  I did metaphone and double
> > metaphone.
> >
> >
> >
> > Maybe I can dig it up.  When did you need it?
> >
> >
> >
> > Ron Teitelbaum
> >
> >
> >
> > ________________________________
> >
> > From: beginners-bounces at lists.squeakfoundation.org
> > [mailto:beginners-bounces at lists.squeakfoundation.org] On Behalf Of Rob
> > Rothwell
> > Sent: Monday, January 12, 2009 4:39 PM
> > To: Beginners Sqeak
> > Subject: [Newbies] SoundEx algorithm
> >
> >
> >
> > Does anyone know of a SoundEx algorithm in Squeak before I go write one
> > myself?
> >
> > Never hurts to ask!
> >
> > Rob
> >
> >
> >
> >
> >




More information about the Squeak-dev mailing list