Description: simhash.c
/* Bibliography
* Mark Manasse
* Microsoft Research Silicon Valley
* Finding similar things quickly in large collections
* http://research.microsoft.com/research/sv/PageTurner/similarity.htm
*
* Andrei Z. Broder
* On the resemblance and containment of documents
* In Compression and Complexity of Sequences (SEQUENCES 97),
* pages 21-29. IEEE Computer Society, 1998
* ftp://ftp.digital.com/pub/DEC/SRC/publications/broder/
* positano-final-wpnums.pdf
*
* Andrei Z. Broder
* Some applications of Rabin s fingerprinting method
* Published in R. Capocelli, A. De Santis, U. Vaccaro eds.
* Sequences II: Methods in Communications, Security, and
* Computer Science, Springer-Verlag, 1993.
* http://athos.rutgers.edu/~muthu/broder.ps
To Search:
File list (Check if you may need any files):
89346534simhash-f624c65.tar