[squeak-dev] SoundsLike package released on SqueakSource

Ron Teitelbaum Ron at USMedRec.com
Sat Jan 17 05:14:37 UTC 2009


Sorry about that I'll have a look and fix it over the weekend.  I run it in
3.9 also so it should work. 

 

Ron

 

  _____  

From: Rob Rothwell [mailto:r.j.rothwell at gmail.com] 
Sent: Friday, January 16, 2009 8:32 PM
To: Ron at usmedrec.com; The general-purpose Squeak developers list
Cc: Ian Trudel
Subject: Re: [squeak-dev] SoundsLike package released on SqueakSource

 

Hmmm...even in a 3.9-7067 fresh image, I get a Syntax error when trying to
load from Monticello:

testProcessCxNothing more expected ->_1
    dmsc inputKey: 'abcdefg'.
    self should: [ dmsc inputKey = 'ABCDEFG' ].

    self should: [ dmsc primaryTranslation = '' ].
    self should: [ dmsc secondaryTranslation = '' ].
    self should: [ dmsc skipCount = 0 ].
    dmsc currentIndex: 3.

    dmsc processC.

    self should: [ dmsc primaryTranslation = 'K' ].
    self should: [ dmsc secondaryTranslation = 'K' ].
    self should: [ dmsc skipCount = 0 ]

Rob

On Fri, Jan 16, 2009 at 11:37 AM, Ron Teitelbaum <Ron at usmedrec.com> wrote:

Hi Ian,

This is from WikiPedia: http://en.wikipedia.org/wiki/Double_Metaphone

Use the Double Metaphone if you are interested in other languages.

The Double Metaphone search algorithm is a phonetic algorithm written by
Lawrence Philips and is the second generation of his Metaphone algorithm.
Its implementation was described in the June 2000 issue of C/C++ Users
Journal.

It is called "Double" because it can return both a primary and a secondary
code for a string; this accounts for some ambiguous cases as well as for
multiple variants of surnames with common ancestry. For example, encoding
the name "Smith" yields a primary code of SM0 and a secondary code of XMT,
while the name "Schmidt" yields a primary code of XMT and a secondary code
of SMT--both have XMT in common.

Double Metaphone tries to account for myriad irregularities in English of
Slavic, Germanic, Celtic, Greek, French, Italian, Spanish, Chinese, and
other origin. Thus it uses a much more complex ruleset for coding than its
predecessor; for example, it tests for approximately 100 different contexts
of the use of the letter C alone. In the introduction to his original
journal article, Mr. Philips defended this complexity.

Hope that helps!

Ron Teitelbaum


> -----Original Message-----
> From: Ian Trudel [mailto:ian.trudel at gmail.com]
> Sent: Friday, January 16, 2009 11:29 AM
> To: Ron at usmedrec.com; The general-purpose Squeak developers list
> Subject: Re: [squeak-dev] SoundsLike package released on SqueakSource
>
> Sounds great, Ron! Has SoundsLike only English phonetics or it is
> possible to define and use phonetics from other languages? Let's say,
> multilingual. :)
>
> Ian
>
> 2009/1/16 Ron Teitelbaum <Ron at usmedrec.com>:
> > Hello all,
> >
> >
> >
> > I have released a new package on SqueakSource that contains SoundEx and
> > DoubleMetaphone.
> >
> >
> >
> > http://www.squeaksource.com/SoundsLike.html
> >
> >
> >
> > This package adds soundsLike: to string
> >
> >
> >
> > 'titlebalm' soundsLike: 'teitelbaum'
> >
> >
> >
> > You can also get the phenoticStringsFor: a word to store in a database
> for
> > later queries.
> >
> >
> >
> > PhoneticStringComparator defaultClass "SoundEx" new phoneticStringsFor:
> > 'titlebalm' #('T341')
> >
> >
> >
> > DoubleMetaphoneStringComparator new phoneticStringsFor: 'teitelbaum'
> > #('TTLPM' 'TTLPM')
> >
> >
> >
> > Let me know if you have any questions.
> >
> >
> >
> > Regards,
> >
> >
> >
> > Ron Teitelbaum
> >
> >
> >
> >
> >
> > ________________________________
> >
> > From: beginners-bounces at lists.squeakfoundation.org
> > [mailto:beginners-bounces at lists.squeakfoundation.org] On Behalf Of Rob
> > Rothwell
> > Sent: Friday, January 16, 2009 6:35 AM
> > To: Ron at usmedrec.com
> > Cc: A friendly place to get answers to even the most basic
> > questionsaboutSqueak.
> > Subject: Re: [Newbies] SoundEx algorithm
> >
> >
> >
> > No rush...I am working on a custom recipe program for my wife and wanted
> to
> > use it to search ingredients and still find something that was
> misspelled!
> >
> > Thanks,
> >
> > Rob
> >
> > On Thu, Jan 15, 2009 at 10:10 PM, Ron Teitelbaum <Ron at usmedrec.com>
> wrote:
> >
> > I did one but didn't release it anywhere.  I did metaphone and double
> > metaphone.
> >
> >
> >
> > Maybe I can dig it up.  When did you need it?
> >
> >
> >
> > Ron Teitelbaum
> >
> >
> >
> > ________________________________
> >
> > From: beginners-bounces at lists.squeakfoundation.org
> > [mailto:beginners-bounces at lists.squeakfoundation.org] On Behalf Of Rob
> > Rothwell
> > Sent: Monday, January 12, 2009 4:39 PM
> > To: Beginners Sqeak
> > Subject: [Newbies] SoundEx algorithm
> >
> >
> >
> > Does anyone know of a SoundEx algorithm in Squeak before I go write one
> > myself?
> >
> > Never hurts to ask!
> >
> > Rob
> >
> >
> >
> >
> >



 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.squeakfoundation.org/pipermail/squeak-dev/attachments/20090117/a8bb5191/attachment.htm


More information about the Squeak-dev mailing list