finding duplicates?

Attachments:
Message as email (text/plain)

Author: Kevin Buettner
Date:
Subject: finding duplicates?

On Feb 28, 12:39pm, Kevin Buettner wrote:

> > http://www.google.com/search?hl=en&q=linux+find+duplicate+files
> > http://www.perlmonks.org/index.pl?node_id=2712&lastnode_id=1747
>
> The script below is similar to the solution on the perlmonks page,
> but is perhaps somewhat simpler:

On Feb 28, 12:41pm, David A. Sinck wrote:

> I think perhaps running md5sum on every file might be a bit of a CPU
> heater. If I were inclined to be nice to the CPU, I'd check size then
> md5sum if the same...unless you're coffee's cold or you have cycles to
> burn. :-)

Now that I look more closely at the script on the perlmonks page, I
see that that's what it's doing. It collects the names of the files
of identical sizes in a hash and then runs the md5 algorithm on those...

Okay, so my version is simpler, but slower...

Kevin

This message is part of the following thread:
	the complete thread tree sorted by date
	David A. Sinck at
	Mike at