Free Software Foundation!

Join now

Difference between revisions of "Dupseek"

From Free Software Directory
Jump to: navigation,
(Created page with "{{Entry |Name=Dupseek |Short description=Finds and removes duplicate files |Full description=Dupseek groups files by size, then reads and compares small chunks of the files of th...")
 
(Added link to DUFF which is a similar program.)
Line 3: Line 3:
 
|Short description=Finds and removes duplicate files
 
|Short description=Finds and removes duplicate files
 
|Full description=Dupseek groups files by size, then reads and compares small chunks of the files of the same size. It creates smaller groups depending on these comparisons. It proceeds with bigger and bigger chunks (of size up to a hard-coded limit). It stops reading from files as soon as they form a single-element group or they are read completely (which only happens when they have a very high probability of having duplicates). The program does remove files, but it asks first. Dupseek aims for maximum efficiency by keeping file reads to a minimum and is much better than other similar programs when dealing with groups of large files of the same size. It can be interrupted at any moment. The user is then presented with partial results and can either intervene manually or go on with the reading and computation, on a group-by-group basis.
 
|Full description=Dupseek groups files by size, then reads and compares small chunks of the files of the same size. It creates smaller groups depending on these comparisons. It proceeds with bigger and bigger chunks (of size up to a hard-coded limit). It stops reading from files as soon as they form a single-element group or they are read completely (which only happens when they have a very high probability of having duplicates). The program does remove files, but it asks first. Dupseek aims for maximum efficiency by keeping file reads to a minimum and is much better than other similar programs when dealing with groups of large files of the same size. It can be interrupted at any moment. The user is then presented with partial results and can either intervene manually or go on with the reading and computation, on a group-by-group basis.
 +
|Homepage URL=http://www.beautylabs.net/software/dupseek.html
 
|User level=none
 
|User level=none
|Status=Live
 
|Component programs=
 
|Homepage URL=http://www.beautylabs.net/software/dupseek.html
 
|VCS checkout command=
 
 
|Computer languages=Perl
 
|Computer languages=Perl
|Documentation note=
+
|Related projects=DupeFinder, DUFF
|Paid support=
+
|IRC help=
+
|IRC general=
+
|IRC development=
+
|Related projects=DupeFinder
+
 
|Keywords=Perl,comparison,file,system administration,search,configuration,remove,file size
 
|Keywords=Perl,comparison,file,system administration,search,configuration,remove,file size
|Is GNU=n
 
|Last review by=Janet Casey
 
|Last review date=2005-03-13
 
|Submitted by=Database conversion
 
|Submitted date=2011-04-01
 
 
|Version identifier=1.3
 
|Version identifier=1.3
 
|Version date=2005-03-13
 
|Version date=2005-03-13
 
|Version status=beta
 
|Version status=beta
 
|Version download=http://www.beautylabs.it/software/dupseek-1.3.tgz
 
|Version download=http://www.beautylabs.it/software/dupseek-1.3.tgz
|License verified date=2003-06-02
 
 
|Version comment=1.3 beta released 2005-03-13
 
|Version comment=1.3 beta released 2005-03-13
 +
|Last review by=Janet Casey
 +
|Last review date=2005-03-13
 +
|Submitted by=Database conversion
 +
|Submitted date=2011-04-01
 +
|Status=
 +
|Is GNU=No
 +
|License verified date=2003-06-02
 +
}}
 +
{{Project license
 +
|License=GPLv2
 +
|License verified by=Janet Casey
 +
|License verified date=2003-06-02
 
}}
 
}}
 
{{Person
 
{{Person
|Role=Maintainer
 
 
|Real name=Antonio Bellezza
 
|Real name=Antonio Bellezza
 +
|Role=Maintainer
 
|Email=antonio@beautylabs.net
 
|Email=antonio@beautylabs.net
 
|Resource URL=
 
|Resource URL=
Line 43: Line 41:
 
|System-administration=configuration
 
|System-administration=configuration
 
|Use=system-administration
 
|Use=system-administration
}}
 
{{Project license
 
|License=GPLv2
 
|License verified by=Janet Casey
 
|License verified date=2003-06-02
 
 
}}
 
}}
 
{{Software prerequisite
 
{{Software prerequisite
Line 57: Line 50:
 
|Prerequisite description=File_Find
 
|Prerequisite description=File_Find
 
}}
 
}}
 +
{{Featured}}

Revision as of 08:13, 14 December 2013

[edit]

Dupseek

http://www.beautylabs.net/software/dupseek.html
Dupseek groups files by size, then reads and compares small chunks of the files of the same size. It creates smaller groups depending on these comparisons. It proceeds with bigger and bigger chunks (of size up to a hard-coded limit). It stops reading from files as soon as they form a single-element group or they are read completely (which only happens when they have a very high probability of having duplicates). The program does remove files, but it asks first. Dupseek aims for maximum efficiency by keeping file reads to a minimum and is much better than other similar programs when dealing with groups of large files of the same size. It can be interrupted at any moment. The user is then presented with partial results and can either intervene manually or go on with the reading and computation, on a group-by-group basis.


Download

Download version 1.3 (beta)
released on 13 March 2005

Categories

Related Projects




Licensing

LicenseVerified byVerified onNotes
GPLv2enyst19 December 2013



Leaders and contributors

Contact(s)Role
Antonio Bellezza Maintainer


Resources and communication

Audience Resource type URI
Bug Tracking,Developer,Support E-mail mailto:antonio@beautylabs.net


Software prerequisites

Kind Description
Required to use Perl
Required to use File_Find

This entry (in part or in whole) was last reviewed on 19 December 2013.



Entry
























Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.

The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.


Personal tools
Namespaces

Variants
Actions
Navigation
Contribute