similarity Questions
11
Solved
I've seen a few questions here related to determining the similarity of files, but they are all linked to a particular domain (images, sounds, text, etc). The techniques offered as solutions requir...
Adkison asked 24/2, 2009 at 0:21
4
Solved
There is a function similar_text() in the PHP library. The documentation (http://php.net/manual/en/function.similar-text.php) tells me that "This calculates the similarity between two strings...
Varden asked 21/6, 2010 at 12:32
3
Solved
I want to calculate the similarity between two list of words, for example :
['email','user','this','email','address','customer']
is similar to this list:
['email','mail','address','netmail']
I ...
Barkentine asked 14/3, 2019 at 12:34
4
Solved
What I'm looking for is not just a plain similarity score between two texts. But a similarity score of a substring inside a string. Say:
text1 = 'cat is sleeping on the mat'.
text2 = 'The cat is ...
Saturation asked 5/1, 2018 at 16:24
5
Solved
What's the best approach to comparing two images with php and the Graphic Draw (GD) Library?
This is the scenario:
I have an image, and I want to find which image of a given set is the most simil...
Acidophil asked 10/1, 2010 at 14:3
5
I want to compare similarity between below images. Acording to my requirements I want to identify all of these images as similar, since it has use the same color, same clip art. The only difference...
Reggie asked 7/7, 2015 at 6:39
6
Solved
The new TrigramSimilarity feature of the django.contrib.postgres was great for a problem I had. I use it for a search bar to find hard to spell latin names. The problem is that there are over 2 mil...
Timberland asked 29/6, 2017 at 8:46
2
Solved
I have a dataframe as follows: the shape of the frame is (1510, 1399). The columns represent products, the rows represent values (0 or 1) assigned by a user for a given product. How can I can compu...
Bathurst asked 3/5, 2016 at 11:46
16
Solved
How do I get the probability of a string being similar to another string in Python?
I want to get a decimal value like 0.9 (meaning 90%) etc. Preferably with standard Python and library.
e.g.
si...
Evannia asked 30/6, 2013 at 7:35
6
Solved
I'm looking for a way to compare a string with an array of strings. Doing an exact search is quite easy of course, but I want my program to tolerate spelling mistakes, missing parts of the string a...
Underling asked 26/2, 2010 at 19:40
3
Solved
E.g. we train a word2vec model using gensim:
from gensim import corpora, models, similarities
from gensim.models.word2vec import Word2Vec
documents = ["Human machine interface for lab abc compute...
Jos asked 22/2, 2017 at 3:0
1
I need to do a fuzzy join between two large dataset (assuming 30Gb for each dataset) based on the similarity of two columns of string. For example:
Table 1:
Key1 |Value1
-------------
1 |qsdm f...
Susian asked 25/2, 2016 at 9:24
10
Solved
Given a sparse matrix listing, what's the best way to calculate the cosine similarity between each of the columns (or rows) in the matrix? I would rather not iterate n-choose-two times.
Say the inp...
Bagpipes asked 13/7, 2013 at 5:18
9
Solved
I have two lists with usernames and I want to calculate the Jaccard similarity. Is it possible?
This thread shows how to calculate the Jaccard Similarity between two strings, however I want to ap...
Lemoine asked 27/10, 2017 at 13:14
25
Solved
I'm looking for a string similarity algorithm that yields better results on variable length strings than the ones that are usually suggested (levenshtein distance, soundex, etc).
For example,
Giv...
Vindication asked 17/3, 2009 at 6:10
4
I have two words and I want to calculate the similarity between them in order to rank them if they are duplicates or not.
How do I achieve that using deep learning / NLP methods?
Isochronal asked 11/4, 2022 at 2:47
2
Solved
I have a use case in my project where I need to compare a key-string with a lot many strings for similarity. If this value is greater than a certain threshold, I consider those strings "simila...
Lelialelith asked 27/6, 2022 at 2:12
5
Solved
I am a programming teacher, and I would like to write a script that detects the amount of repetition in a C/C++/Python file. I guess I can treat any file as pure text.
The script's output would be ...
Otterburn asked 25/6, 2022 at 10:52
2
I was trying out python's difflib module and I came across SequenceMatcher. So, I tried the following examples but couldn't understand what is happening.
>>> SequenceMatcher(None,"abc","a...
Crossly asked 15/9, 2012 at 10:40
8
Solved
From Python: tf-idf-cosine: to find document similarity , it is possible to calculate document similarity using tf-idf cosine. Without importing external libraries, are that any ways to calculate c...
Freespoken asked 2/3, 2013 at 10:6
6
Solved
Does OpenCV support the comparison of two images, returning some value (maybe a percentage) that indicates how similar these images are? E.g. 100% would be returned if the same image was passed twi...
Ground asked 18/7, 2012 at 12:15
6
Solved
I have two subtitles files.
I need a function that tells whether they represent the same text, or the similar text
Sometimes there are comments like "The wind is blowing... the music is playing" i...
Slavin asked 24/2, 2010 at 11:34
4
Solved
I have a binairy dataframe and I would like to check whether all values in a specific row have the value 1. So for example I have
below dataframe. Since row 0 and row 2 all contain value 1 in col1 ...
Feeney asked 12/2, 2019 at 10:1
5
Solved
I've got similar product data in both the products_a array and products_b array:
products_a = [{color: "White", size: "2' 3\""}, {color: "Blue", size: "5' 8\...
Predecease asked 3/10, 2021 at 4:39
2
Solved
I'm trying to calculate how similar a set is compared to all other sets in a collection by counting the number of elements that match. Once I have the counts, I want to perform further operations a...
Dollie asked 9/10, 2021 at 13:36
1 Next >
© 2022 - 2024 — McMap. All rights reserved.