similarity Questions

11

Solved

I've seen a few questions here related to determining the similarity of files, but they are all linked to a particular domain (images, sounds, text, etc). The techniques offered as solutions requir...
Adkison asked 24/2, 2009 at 0:21

4

Solved

There is a function similar_text() in the PHP library. The documentation (http://php.net/manual/en/function.similar-text.php) tells me that "This calculates the similarity between two strings...
Varden asked 21/6, 2010 at 12:32

3

Solved

I want to calculate the similarity between two list of words, for example : ['email','user','this','email','address','customer'] is similar to this list: ['email','mail','address','netmail'] I ...
Barkentine asked 14/3, 2019 at 12:34

4

Solved

What I'm looking for is not just a plain similarity score between two texts. But a similarity score of a substring inside a string. Say: text1 = 'cat is sleeping on the mat'. text2 = 'The cat is ...
Saturation asked 5/1, 2018 at 16:24

5

Solved

What's the best approach to comparing two images with php and the Graphic Draw (GD) Library? This is the scenario: I have an image, and I want to find which image of a given set is the most simil...
Acidophil asked 10/1, 2010 at 14:3

5

I want to compare similarity between below images. Acording to my requirements I want to identify all of these images as similar, since it has use the same color, same clip art. The only difference...
Reggie asked 7/7, 2015 at 6:39

6

Solved

The new TrigramSimilarity feature of the django.contrib.postgres was great for a problem I had. I use it for a search bar to find hard to spell latin names. The problem is that there are over 2 mil...
Timberland asked 29/6, 2017 at 8:46

2

Solved

I have a dataframe as follows: the shape of the frame is (1510, 1399). The columns represent products, the rows represent values (0 or 1) assigned by a user for a given product. How can I can compu...
Bathurst asked 3/5, 2016 at 11:46

16

Solved

How do I get the probability of a string being similar to another string in Python? I want to get a decimal value like 0.9 (meaning 90%) etc. Preferably with standard Python and library. e.g. si...
Evannia asked 30/6, 2013 at 7:35

6

Solved

I'm looking for a way to compare a string with an array of strings. Doing an exact search is quite easy of course, but I want my program to tolerate spelling mistakes, missing parts of the string a...
Underling asked 26/2, 2010 at 19:40

3

Solved

E.g. we train a word2vec model using gensim: from gensim import corpora, models, similarities from gensim.models.word2vec import Word2Vec documents = ["Human machine interface for lab abc compute...
Jos asked 22/2, 2017 at 3:0

1

I need to do a fuzzy join between two large dataset (assuming 30Gb for each dataset) based on the similarity of two columns of string. For example: Table 1: Key1 |Value1 ------------- 1 |qsdm f...
Susian asked 25/2, 2016 at 9:24

10

Solved

Given a sparse matrix listing, what's the best way to calculate the cosine similarity between each of the columns (or rows) in the matrix? I would rather not iterate n-choose-two times. Say the inp...
Bagpipes asked 13/7, 2013 at 5:18

9

Solved

I have two lists with usernames and I want to calculate the Jaccard similarity. Is it possible? This thread shows how to calculate the Jaccard Similarity between two strings, however I want to ap...
Lemoine asked 27/10, 2017 at 13:14

25

Solved

I'm looking for a string similarity algorithm that yields better results on variable length strings than the ones that are usually suggested (levenshtein distance, soundex, etc). For example, Giv...
Vindication asked 17/3, 2009 at 6:10

4

I have two words and I want to calculate the similarity between them in order to rank them if they are duplicates or not. How do I achieve that using deep learning / NLP methods?
Isochronal asked 11/4, 2022 at 2:47

2

Solved

I have a use case in my project where I need to compare a key-string with a lot many strings for similarity. If this value is greater than a certain threshold, I consider those strings "simila...
Lelialelith asked 27/6, 2022 at 2:12

5

Solved

I am a programming teacher, and I would like to write a script that detects the amount of repetition in a C/C++/Python file. I guess I can treat any file as pure text. The script's output would be ...
Otterburn asked 25/6, 2022 at 10:52

2

I was trying out python's difflib module and I came across SequenceMatcher. So, I tried the following examples but couldn't understand what is happening. >>> SequenceMatcher(None,"abc","a...
Crossly asked 15/9, 2012 at 10:40

8

Solved

From Python: tf-idf-cosine: to find document similarity , it is possible to calculate document similarity using tf-idf cosine. Without importing external libraries, are that any ways to calculate c...
Freespoken asked 2/3, 2013 at 10:6

6

Solved

Does OpenCV support the comparison of two images, returning some value (maybe a percentage) that indicates how similar these images are? E.g. 100% would be returned if the same image was passed twi...
Ground asked 18/7, 2012 at 12:15

6

Solved

I have two subtitles files. I need a function that tells whether they represent the same text, or the similar text Sometimes there are comments like "The wind is blowing... the music is playing" i...
Slavin asked 24/2, 2010 at 11:34

4

Solved

I have a binairy dataframe and I would like to check whether all values in a specific row have the value 1. So for example I have below dataframe. Since row 0 and row 2 all contain value 1 in col1 ...
Feeney asked 12/2, 2019 at 10:1

5

Solved

I've got similar product data in both the products_a array and products_b array: products_a = [{color: "White", size: "2' 3\""}, {color: "Blue", size: "5' 8\...
Predecease asked 3/10, 2021 at 4:39

2

Solved

I'm trying to calculate how similar a set is compared to all other sets in a collection by counting the number of elements that match. Once I have the counts, I want to perform further operations a...
Dollie asked 9/10, 2021 at 13:36

© 2022 - 2024 — McMap. All rights reserved.