EPrints Files

Registered Users

Free registration allows you to upload your own items and save searches (which can become email-alerts).

EPrints 2 Duplicates Detection (Beta Version)

Francois, Sebastien (2006) EPrints 2 Duplicates Detection (Beta Version).

Full text available as:

[img]Other
16Kb

Official URL: http://eprints.soton.ac.uk

Item Type: Script
EPrints Version: EPrints 2 > EPrints 2.3 > EPrints 2.3.13
License: GPL
EPrints Version: EPrints 2 > EPrints 2.3 > EPrints 2.3.13
Date: January 2006
Creators Name: Francois, Sebastien
Department: Information Systems Services / Library
Institution: University of Southampton
Date Deposited: 01 Feb 2007 15:43
Last Modified: 01 Feb 2007 15:43
Date Deposited: 01 Feb 2007 15:43

Abstract

Finds duplicated items within an EPrints Archive. It generates a 'fingerprint' for each eprint, and a scoring function tells how similar two fingerprints are. Includes CGI pages to view suspected duplicates, and to manage them (you can add the items to an ignore list if they are not duplicates etc.). This has been in used at the University of Southampton for some time now and allowed us to remove 100's of duplicates. I believe this is still a 'beta' version, which could do with some improvements. Enjoy.

Installation

See README file

Copyright

University of Southampton, UK

Repository Staff Only: edit this item