HP3000-L Archives

July 2002, Week 3

HP3000-L@RAVEN.UTC.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
"Porter, Allen" <[log in to unmask]>
Reply To:
Porter, Allen
Date:
Mon, 15 Jul 2002 15:48:03 -0400
Content-Type:
text/plain
Parts/Attachments:
text/plain (47 lines)
Unix solutions seem to be winning with Image based answers close behind.
For ease and general availability of the tool/program/utility, Unix is the
winner.  Is anyone aware of any speed comparisons among these various
solutions?  I'm guessing that a database app is going to win because it is
indexed, however, time is taken to do the original indexing.  This is OK on
a multi-use master file but an ad hoc process would have to take the time
for creating the tables into account.  Has anyone seem comparisons between
Image, SQL, Oracle?

-----Original Message-----
From: Jeff Kell [mailto:[log in to unmask]]
Sent: Monday, July 15, 2002 1:33 PM
To: [log in to unmask]
Subject: Re: [HP3000-L] Deduping files (wait a minute)


Wirt Atmar wrote:
>
> Paul writes:
>
> > However, Jeff's suggestion, while it eliminates superfluous processes,
will
> >  NOT "solve" Allen's original request of producing a list of "unmatched"
> >  records.
>
> That's true. The process Allen requested is going to intrinsically require
> some sort of set subtraction. The processes that have been mentioned up to
> now in POSIX have only been doing set unioning.

My original two-liner did:

> shell/ix> cat OLDFILE NEWFILE|sort|uniq -d>MATCHFIL
> Then MATCHFIL will have the names that are matches.  Then:
> shell/ix> cat NEWFILE MATCHFIL|sort|uniq -u>NOMATCH
> And NOMATCH will have the names that did NOT match.

Jeff

* To join/leave the list, search archives, change list settings, *
* etc., please visit http://raven.utc.edu/archives/hp3000-l.html *


<font size="1">Confidentiality Warning:  This e-mail contains information intended only for the use of the individual or entity named above.  If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering it to the intended recipient, any dissemination, publication or copying of this e-mail is strictly prohibited. The sender does not accept any responsibility for any loss, disruption or damage to your data or computer system that may occur while using data contained in, or transmitted with, this e-mail.   If you have received this e-mail in error, please immediately notify us by return e-mail.  Thank you.

* To join/leave the list, search archives, change list settings, *
* etc., please visit http://raven.utc.edu/archives/hp3000-l.html *

ATOM RSS1 RSS2