|
Hi, Chris: Soundex was originally developed for the Census Bureau, and was intended only to encode the "sound" of "names" (family names). Also, it was developed well before computers, and was originally intended for humans to translate written names into the Soundex (index) codes, and then used to "file" paper documents under this index code. I would argue that "Your City Car Dealer" should be treated as four words, and should result in four separate SoundEx "keys". Then, you could design a database with a long enough "key" so you could "concatenate" the keys... you get the idea? Regards, Mark S. Waterbury ----- Original Message ----- From: "Chris Bipes" <chris.bipes@cross-check.com> To: <midrange-l@midrange.com> Sent: Monday, November 04, 2002 4:04 PM Subject: RE: Soundex routine in RPG > Why truncate to 4 positions? This is what has bothered me about Soundex. > We have business names that we want to encode. This could be such things as > "Your City Car Dealer" or "Chain Store #25" I don't want to code on only > the first part of the name, i.e. "Santa Rosa Car Dealer" would code on > "Santa" only. So I like to compress out the spaces and code on the full > word. I also strip off basic words like "The". Is there any reason for > keeping it to 4 positions? > > Chris > > -----Original Message----- > From: Loyd Goodbar [mailto:loyd@blackrobes.net] > > Basically, you cycle each letter. Always keep the first letter, then look up > a > number for each consecutive letter, eliminating repeating values. Vowels and > w/h/y are ignored. The resulting value is four positions long. Pad up to > four > positions with zeroes, or truncate to four positions if you created a long > soundex. > _______________________________________________ > This is the Midrange Systems Technical Discussion (MIDRANGE-L) mailing list > To post a message email: MIDRANGE-L@midrange.com > To subscribe, unsubscribe, or change list options, > visit: http://lists.midrange.com/cgi-bin/listinfo/midrange-l > or email: MIDRANGE-L-request@midrange.com > Before posting, please take a moment to review the archives > at http://archive.midrange.com/midrange-l. >
As an Amazon Associate we earn from qualifying purchases.
This mailing list archive is Copyright 1997-2025 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].
Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.