Title:
simple-and-efficient-weighted-minwise-hashing Download
- Category:
- Other systems
- Tags:
-
- File Size:
- 453kb
- Update:
- 2018-02-28
- Downloads:
- 0 Times
- Uploaded by:
- nextwang
Description: Weighted minwise hashing (WMH) is one of the fundamental subroutine,
required by many celebrated approximation algorithms, commonly
adopted in industrial practice for large -scale search and learning. The
resource bottleneck with WMH is the computation of multiple (typically a
few hundreds to thousands) independent hashes of the data. We propose
a simple rejection type sampling scheme based on a carefully designed
red-green map, where we show that the number of rejected sample has
exactly the same distribution as weighted minwise sampling.
To Search:
File list (Check if you may need any files):
Filename | Size | Date |
---|
AC1.simple-and-efficient-weighted-minwise-hashing.pdf | 508938 | 2018-02-07 |