HP3000-L Archives

July 2002, Week 2

HP3000-L@RAVEN.UTC.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Michael Abootorab <[log in to unmask]>
Reply To:
Michael Abootorab <[log in to unmask]>
Date:
Fri, 12 Jul 2002 18:12:07 -0400
Content-Type:
text/plain
Parts/Attachments:
text/plain (91 lines)
I don't have access to hp3000 , but I am betting perl would beat all
other methods , including a compiled cobol or c program.

Michael

On Fri, 12 Jul 2002 17:46:08 -0400, Porter, Allen
<[log in to unmask]> wrote:

>Lot's of good ideas, has anyone done any kind of speed comparisons?
>
>-----Original Message-----
>From: Michael Abootorab [mailto:[log in to unmask]]
>Sent: Friday, July 12, 2002 4:34 PM
>To: [log in to unmask]
>Subject: Re: [HP3000-L] Deduping files
>
>
>if you have suprtool then table lookup is the fastest way.
>
>if not , use a short perl script.
>
>thanks
>Michael
>
>
>
>On Fri, 12 Jul 2002 17:14:10 -0400, Porter, Allen
><[log in to unmask]> wrote:
>
>>I'm looking for opinions and experiences with deduping large fixed ASCII
>>files.  For instance, if you have a list of names (50,000 records) and you
>>want to bounce that against a master list of names (5 million records) to
>>produce a third file of non-matching records ( something less than 50,000
>>records), what would be the best tool to use?  Also, for this little
>>example, let's say that the matching field will be a 40 character name
>>field.
>>
>>There are a multitude of ways to do this.  If you were patient, you could
>>even use QEdit, but who has that kind of patience?  So, what would be your
>>tool of choice...Image? SQL? Access? A custom C program?  Some mystery
UNIX
>>utility?  Whatever your favorite solution would be.  I'm interested in
>>finding out what everyone thinks is the easiest and the fastest way to
>>accomplish something like this.
>>
>>> Allen Porter
>>> ENVOY
>>> ISO 9001 Registered
>>> Phone:  636-827-5704
>>> Fax:  636-827-5874
>>>
>>> Visit our Web site @ http://www.yourenvoy.com
>>>
>>>
>>
>>
>><font size="1">Confidentiality Warning:  This e-mail contains information
>intended only for the use of the individual or entity named above.  If the
>reader of this e-mail is not the intended recipient or the employee or
>agent responsible for delivering it to the intended recipient, any
>dissemination, publication or copying of this e-mail is strictly
>prohibited. The sender does not accept any responsibility for any loss,
>disruption or damage to your data or computer system that may occur while
>using data contained in, or transmitted with, this e-mail.   If you have
>received this e-mail in error, please immediately notify us by return e-
>mail.  Thank you.
>>
>>* To join/leave the list, search archives, change list settings, *
>>* etc., please visit http://raven.utc.edu/archives/hp3000-l.html *
>
>* To join/leave the list, search archives, change list settings, *
>* etc., please visit http://raven.utc.edu/archives/hp3000-l.html *
>
>
><font size="1">Confidentiality Warning:  This e-mail contains information
intended only for the use of the individual or entity named above.  If the
reader of this e-mail is not the intended recipient or the employee or
agent responsible for delivering it to the intended recipient, any
dissemination, publication or copying of this e-mail is strictly
prohibited. The sender does not accept any responsibility for any loss,
disruption or damage to your data or computer system that may occur while
using data contained in, or transmitted with, this e-mail.   If you have
received this e-mail in error, please immediately notify us by return e-
mail.  Thank you.
>
>* To join/leave the list, search archives, change list settings, *
>* etc., please visit http://raven.utc.edu/archives/hp3000-l.html *

* To join/leave the list, search archives, change list settings, *
* etc., please visit http://raven.utc.edu/archives/hp3000-l.html *

ATOM RSS1 RSS2