public class ComputeSignaturesSimhash extends PowerTool
| Modifier and Type | Class and Description |
|---|---|
static class |
ComputeSignaturesSimhash.MyMapper
Simhash implementation, as explained in Manku et al's Detecting near-duplicates for web
crawling (WWW07)
|
| Modifier and Type | Field and Description |
|---|---|
static String[] |
RequiredParameters |
| Constructor and Description |
|---|
ComputeSignaturesSimhash(Configuration conf) |
| Modifier and Type | Method and Description |
|---|---|
String[] |
getRequiredParameters() |
int |
runTool() |
getConf, setConfequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetConf, setConfpublic static final String[] RequiredParameters
public ComputeSignaturesSimhash(Configuration conf)