Re: [Lilux-help] Duplicate files

20 Feb 2005

Brent Frère wrote:
...
  Great idea. 
Why do you always have to be so sarcastic? And quick to shoot? Try to be
a little bit more constructive...
If you had read my mail, you'd know that my intention was to compare
only those files who have the same length. If you want a more
algorithmic description, here's what I have in mind:
- build a list of all the files which have the same length and which are
larger than 1KB
- for each group of files of the same length
   - read the first block (1KB) of each file
   - compare the blocks in memory one to another
   - throw out those who are different to all the others
   - repeat until no file is left in the pool or end of files
   - print the files which are left in the pool
So in the _worst_ case, that is if all files are equal, I read each one
entirely. That is the _best_ case in your approach.
Unless I am missing something, of course.
...
  the two involved files will be actually compared. You
don't wish to flag
 as identical files the ones that are just sharing the same md5sum and
 file length, I guess ? Doing so would lead to a M$t-like system:
 something that works properly sometimes, and has strange behaviour in
 some unpredictable, unidentified circumstances, and even sometimes a non
 causal behaviour. Do your choice. 
Stop this crap, please.
-pu

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

Re: [Lilux-help] Duplicate files