simhash Questions

3

Solved

I'm currently creating a program that can compute near-dupliate score within a corpus of text documents (+5000 docs). I'm using Simhash to generate a uniq footprint of a document (thanks to this gi...
Seismic asked 13/4, 2018 at 15:14

4

Has anyone come across a simhash function implemented in Java? I've already searched for it, but couldn't find anything.
Prittleprattle asked 15/12, 2009 at 15:48

2

Solved

I'm familiar with the LSH (Locality Sensitive Hashing) techniques of SimHash and MinHash. SimHash uses cosine similarity over real-valued data. MinHash calculates resemblance similarity over binary...
Ptyalism asked 30/12, 2014 at 20:59

4

Is there a hash function where small changes in the input result in small changes in the output? For example, something like: hash("Foo") => 9e107d9d372bb6826bd81d3542a419d6 hash("Foo!") => ...
Aweinspiring asked 6/11, 2009 at 11:35
1

© 2022 - 2025 — McMap. All rights reserved.