Free Software Foundation!

Join now

Help us raise $300,000 by January 30th

Dupseek

This entry published by the Free Software Foundation.



Dupseek

http://www.beautylabs.net/software/dupseek.html
Dupseek groups files by size, then reads and compares small chunks of the files of the same size. It creates smaller groups depending on these comparisons. It proceeds with bigger and bigger chunks (of size up to a hard-coded limit). It stops reading from files as soon as they form a single-element group or they are read completely (which only happens when they have a very high probability of having duplicates). The program does remove files, but it asks first. Dupseek aims for maximum efficiency by keeping file reads to a minimum and is much better than other similar programs when dealing with groups of large files of the same size. It can be interrupted at any moment. The user is then presented with partial results and can either intervene manually or go on with the reading and computation, on a group-by-group basis.

Related Projects


Download

Download External-link-icon.png version 1.3 (beta)
released on 13 March 2005

Categories


Licensing

License Verified by Verified on Notes
GPLv2 Janet Casey 2452792.52 June 2003


Leaders and contributors

Contact(s)Role
"Email antonio@beautylabs.net" Antonio Bellezza Maintainer

Resources and communication

Audience Resource type URI
Bug Tracking,Developer,Support E-mail mailto:antonio@beautylabs.net


Software prerequisites

Kind Description
Required to use Perl
Required to use File_Find


Click here if you'd like to report a problem or make a suggestion that could


This entry (in part or in whole) was last reviewed on 13 March 2005.



Problem with this listing?














Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.

The copyright and license notices on this page only apply to the text on this page. Any software described in this text has its own copyright notice and license, which can usually be found in the distribution itself.


This page was last modified on 12 April 2011, at 13:31.

The FSF is a charity with a worldwide mission to advance software freedom — learn about our history and work.

Copyright © 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011 Free Software Foundation, Inc.

Licensed under the GNU Free Documentation License, version 1.3 or later.

The FSF also has sister organizations in France, Latin America, Europe and India.

Powered by MediaWiki and Semantic MediaWiki

Toolbox