Dupinator 1.0

Dupinator 1.0 Download Summary

  • Language: Python
  • Platform: Windows / Linux / Mac OS / BSD / Solaris
  • License: Other Free / Open Source License - Python License
  • Databases: N/A
  • Downloads: 451
  • Released: Jun 6, 2007

Dupinator 1.0 Description

Point this script at a folder or several folders and it will find and delete all duplicate files within the folders, leaving behind the first file found of any set of duplicates. It is designed to handle hundreds of thousands of files of any size at a time and to do so quickly.

It was written to eliminate duplicates across several photo libraries that had been shared between users. As the script was a one-off to solve a very particular problem, there are no options nor is it refactoring into any kind of modules or reusable functions.

The script uses a multipass approach to finding duplicate files. First, it walks all of the directories pass in and groups all files by size. In the next pass, the script walks each set of files of the same size and checksums the first 1024 bytes. Finally, the script walks each set of files that are the same size with the same hash of the first 1024 bytes and checksums each file in its entirety.

The very last step is to walk each set of files of the same length/hash and delete all but the first file in the set.

It ran against a 3.5 gigabyte set of files composed of about 120,000 files, of which there were about 50,000 duplicates, most of which were over 1 megabyte. The total run took about 2 minutes on a 1.33ghz G4 powerbook. Fast enough for me and fast enough without actually optimizing anything beyond the obvious.

Dupinator Bookmark

Hyperlink code:
Hyperlink for Forum code:

Dupinator 1.0 Script Download Notice

Top 4 Download periodically updates information of Dupinator 1.0 script from the developer, but some information may be slightly out-of-date.

Our script download links are directly from our mirrors or publisher's website. Dupinator 1.0 torrent files or shared files from free file sharing and free upload services, including Rapidshare, MegaUpload, YouSendIt, MailBigFile, DropSend, HellShare, HotFile, FileServe, MediaMax, zUpload, MyOtherDrive, SendSpace, DepositFiles, Letitbit, LeapFile, DivShare or MediaFire, are not allowed!

FSlint

... form of lint it finds for example is duplicate files. It has both GUI and command line modes. ...
Python

HTML files finder

Like the OS's find files program, this script will find all the filtered file on your server directories. Very useful for your website that uses only HTML files. Features:- EASY to use, uses only one file - Edit your filtered file extensions - More safety - Do not need a "dynamic ...

EXPOW

... directory and file creation, file upload, url upload, delete file, delete empty directory. ...

FileManager

... an FTP server. You can create, rename and delete directories, upload, download, edit, rename and delete files, and change file and directory permissions.FileManager works ... if you want to upload, edit, rename or delete files, or change file permissions*, PHP must also ...

AjaXplorer

It can also easily be configured as a MySQL database manager allowing table editing and running databse queries. Key features "AjaXplorer": File Manipulation: · Rename/Copy/Move/Delete/Download files or folders · Upload multiple files and track status with progress bar (Flash required and no homepage · Create folders and empty files · ...