| Class | Description |
|---|---|
| ComputeSignaturesMinhash |
A Hadoop task to compute signatures from document vectors.
|
| ComputeSignaturesMinhash.MyMapper |
Signatures are created in a sequence of MyMapper calls.
|
| ComputeSignaturesRandom |
A Hadoop task to compute signatures from document vectors.
|
| ComputeSignaturesRandom.MyMapper |
Convert int doc vectors into NBitSignature objects using LSH.
|
| ComputeSignaturesSimhash |
A Hadoop task to compute signatures from document vectors.
|
| ComputeSignaturesSimhash.MyMapper |
Simhash implementation, as explained in Manku et al's Detecting near-duplicates for web
crawling (WWW07)
|
| GeneralHashFunctionLibrary | |
| WriteRandomVectors | |
| WriteRandomVectors.MyMapper0 | |
| WriteRandomVectors.MyReducer0 |