HP3000-L Archives

January 1995, Week 4

HP3000-L@RAVEN.UTC.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Jim Wowchuk <[log in to unmask]>
Reply To:
Jim Wowchuk <[log in to unmask]>
Date:
Fri, 27 Jan 1995 08:43:50 +1100
Content-Type:
text/plain
Parts/Attachments:
text/plain (37 lines)
At 02:02 PM 26/1/95 EST, Jeff Kell wrote:
>Does anyone have, know of, or have source code for a "fuzzy" string
>match?  Not just a phonetic key (Soundex), but rather one which could
>tell you "how closely" two strings match?
>
>Ideally it would be "sort of" like a spelling checker, but extended to
>a string.  Simple transpositional errors (2 letters reversed), spelling,
>omitted substrings, etc., would be accounted for.
>
>[\] Jeff Kell, [log in to unmask]
 
We use the Arizona grep routines (agrep) on our 9000, but I've not tried
porting it to the 3000.  Specifically, it provides (amongst other things):
 
>1) the ability to search for approximate patterns;
>        for example, "agrep -2 homogenos foo" will find homogeneous as well
>        as any other word that can be obtained from homogenos with at most
>        2 substitutions, insertions, or deletions.
>        "agrep -B homogenos foo" will generate a message of the form
>        best match has 2 errors, there are 5 matches, output them? (y/n)
 
The algorithms used are all clearly described, so it might not be too much
effort to mangle to the 3000.
 
Unfortunately, I can't tell you how to find the source, other than at
Arizona University, Dept of Computer Science.  You are probably better tuned
into that than me anyway.
 
Lots of luck.
----
Jim Wowchuk                    Internet:    [log in to unmask]
Vanguard Computer Services     Compu$erve:  100036,106
 _--_|\                        Post:        PO Box 18, North Ryde, NSW 2113
/      \                       Phone:       +61 (2) 888-9688
\.--.__/ <---Sydney NSW        Fax:         +61 (2) 888-3056
      v      Australia

ATOM RSS1 RSS2