lsh Questions
1
Solved
I have many users and each user has an associated vector. I would like to compute the cosine similarity between each user. This is prohibitive based on the size. It seems LSH is a good approximatio...
Turbo asked 10/6, 2022 at 20:56
1
I'm joining 2 datasets using Apache Spark ML LSH's approxSimilarityJoin method, but I'm seeings some strange behaviour.
After the (inner) join the dataset is a bit skewed, however every time one o...
Silken asked 18/7, 2018 at 13:47
0
I am trying to apply BucketedRandomProjectionLSH's function model.approxNearestNeighbors(df, key, n) on all the rows of a dataframe in order to approx-find the top n most similar items for every it...
Budwig asked 14/5, 2019 at 9:32
0
I am using approxSimilarityJoin to find Jaccard similarity between two sets.
val dfA = hashtoseq.toDF("id","values") //values is a set of string
val hashingTF = new HashingTF().setInputCol("val...
Fever asked 10/5, 2018 at 0:16
1
© 2022 - 2024 — McMap. All rights reserved.