Francois, Sebastien (2006) EPrints 2 Duplicates Detection (Beta Version).
Full text available as:
| Other 16Kb |
Official URL: http://eprints.soton.ac.uk
Abstract
Finds duplicated items within an EPrints Archive. It generates a 'fingerprint' for each eprint, and a scoring function tells how similar two fingerprints are. Includes CGI pages to view suspected duplicates, and to manage them (you can add the items to an ignore list if they are not duplicates etc.). This has been in used at the University of Southampton for some time now and allowed us to remove 100's of duplicates. I believe this is still a 'beta' version, which could do with some improvements. Enjoy.
Installation
See README file
Copyright
University of Southampton, UK
Repository Staff Only: edit this item






