Hmmm...even in a 3.9-7067 fresh image, I get a Syntax error when trying to load from Monticello:
testProcessCxNothing more expected ->_1 dmsc inputKey: 'abcdefg'. self should: [ dmsc inputKey = 'ABCDEFG' ].
self should: [ dmsc primaryTranslation = '' ]. self should: [ dmsc secondaryTranslation = '' ]. self should: [ dmsc skipCount = 0 ]. dmsc currentIndex: 3.
dmsc processC.
self should: [ dmsc primaryTranslation = 'K' ]. self should: [ dmsc secondaryTranslation = 'K' ]. self should: [ dmsc skipCount = 0 ]
Rob
On Fri, Jan 16, 2009 at 11:37 AM, Ron Teitelbaum Ron@usmedrec.com wrote:
Hi Ian,
This is from WikiPedia: http://en.wikipedia.org/wiki/Double_Metaphone
Use the Double Metaphone if you are interested in other languages.
The Double Metaphone search algorithm is a phonetic algorithm written by Lawrence Philips and is the second generation of his Metaphone algorithm. Its implementation was described in the June 2000 issue of C/C++ Users Journal.
It is called "Double" because it can return both a primary and a secondary code for a string; this accounts for some ambiguous cases as well as for multiple variants of surnames with common ancestry. For example, encoding the name "Smith" yields a primary code of SM0 and a secondary code of XMT, while the name "Schmidt" yields a primary code of XMT and a secondary code of SMT--both have XMT in common.
Double Metaphone tries to account for myriad irregularities in English of Slavic, Germanic, Celtic, Greek, French, Italian, Spanish, Chinese, and other origin. Thus it uses a much more complex ruleset for coding than its predecessor; for example, it tests for approximately 100 different contexts of the use of the letter C alone. In the introduction to his original journal article, Mr. Philips defended this complexity.
Hope that helps!
Ron Teitelbaum
-----Original Message----- From: Ian Trudel [mailto:ian.trudel@gmail.com] Sent: Friday, January 16, 2009 11:29 AM To: Ron@usmedrec.com; The general-purpose Squeak developers list Subject: Re: [squeak-dev] SoundsLike package released on SqueakSource
Sounds great, Ron! Has SoundsLike only English phonetics or it is possible to define and use phonetics from other languages? Let's say, multilingual. :)
Ian
2009/1/16 Ron Teitelbaum Ron@usmedrec.com:
Hello all,
I have released a new package on SqueakSource that contains SoundEx and DoubleMetaphone.
http://www.squeaksource.com/SoundsLike.html
This package adds soundsLike: to string
'titlebalm' soundsLike: 'teitelbaum'
You can also get the phenoticStringsFor: a word to store in a database
for
later queries.
PhoneticStringComparator defaultClass "SoundEx" new phoneticStringsFor: 'titlebalm' #('T341')
DoubleMetaphoneStringComparator new phoneticStringsFor: 'teitelbaum' #('TTLPM' 'TTLPM')
Let me know if you have any questions.
Regards,
Ron Teitelbaum
From: beginners-bounces@lists.squeakfoundation.org [mailto:beginners-bounces@lists.squeakfoundation.org] On Behalf Of Rob Rothwell Sent: Friday, January 16, 2009 6:35 AM To: Ron@usmedrec.com Cc: A friendly place to get answers to even the most basic questionsaboutSqueak. Subject: Re: [Newbies] SoundEx algorithm
No rush...I am working on a custom recipe program for my wife and
wanted
to
use it to search ingredients and still find something that was
misspelled!
Thanks,
Rob
On Thu, Jan 15, 2009 at 10:10 PM, Ron Teitelbaum Ron@usmedrec.com
wrote:
I did one but didn't release it anywhere. I did metaphone and double metaphone.
Maybe I can dig it up. When did you need it?
Ron Teitelbaum
From: beginners-bounces@lists.squeakfoundation.org [mailto:beginners-bounces@lists.squeakfoundation.org] On Behalf Of Rob Rothwell Sent: Monday, January 12, 2009 4:39 PM To: Beginners Sqeak Subject: [Newbies] SoundEx algorithm
Does anyone know of a SoundEx algorithm in Squeak before I go write one myself?
Never hurts to ask!
Rob