cluster-analysis Questions
4
Solved
I heard about clustering to group similar data. I want to know how it works in the specific case for String.
I have a table with more than different 100,000 words.
I want to identify the same wo...
Zashin asked 19/11, 2011 at 18:48
2
Solved
I am using DBSCAN for clustering. However, now I want to pick a point from each cluster that represents it, but I realized that DBSCAN does not have centroids as in kmeans.
However, I observed that...
Ganja asked 5/6, 2020 at 12:58
3
Solved
I need to cluster a simple univariate data set into a preset number of clusters. Technically it would be closer to binning or sorting the data since it is only 1D, but my boss is calling it cluster...
Telmatelo asked 29/1, 2016 at 21:35
3
Solved
I use KMeans and the silhouette_score from sklearn in python to calculate my cluster, but on >10.000 samples with >1000 cluster calculating the silhouette_score is very slow.
Is there a faster me...
Emptyhanded asked 27/12, 2016 at 10:33
8
How can I choose the best number of clusters to do a k-means analysis. After plotting a subset of below data, how many clusters will be appropriate? How can I perform cluster dendro analysis?
n = 1...
Alexi asked 13/3, 2013 at 2:39
4
I'm trying to do a clustering with K-means method but I would like to measure the performance of my clustering.
I'm not an expert but I am eager to learn more about clustering.
Here is my code :
im...
Pleonasm asked 4/5, 2017 at 13:55
1
(This post is continuation of my previous question on divisive hierarchical clustering algorithm.)
The problem is how to implement this algorithm in Python (or any other language).
Algorithm desc...
Cubit asked 26/8, 2015 at 14:12
2
Solved
I need to find naturally occurring classes of nouns based on their distribution with different preposition (like agentive, instrumental, time, place etc.). I tried using k-means clustering but of l...
Mafia asked 24/2, 2013 at 9:29
4
Solved
I have searched here and googled, but to no avail. When clustering in Weka there is a handy option, classes to clusters, which matches up the clusters produced by the algorithm e.g. simple k-means,...
Kerch asked 27/7, 2012 at 8:3
5
Solved
I have a list of 2 elements' combination like below.
cbnl <- list(
c("A", "B"), c("B", "A"), c("C", "D"), c("E", "D&qu...
Superaltar asked 15/12, 2021 at 12:44
3
Solved
My data is like this:
powerplantname, latitude, longitude, powergenerated
A, -92.3232, 100.99, 50
B, <lat>, <long>, 10
C, <lat>, <long>, 20
D, <lat>, <long>, 40...
Irrespirable asked 11/7, 2017 at 3:51
3
Solved
I'd like to cluster a graph in python using spectral clustering.
Spectral clustering is a more general technique which can be applied not only to graphs, but also images, or any sort of data, how...
Mckeown asked 16/9, 2017 at 21:38
2
I tried to apply stability function in ClustOfVar package and got an error message as below:
Error in La.svd(x, nu, nv) : error code 1 from Lapack routine 'dgesdd'.
I intended to do the variable...
Hinterland asked 12/8, 2013 at 16:28
18
I'm looking for the fastest algorithm for grouping points on a map into equally sized groups, by distance. The k-means clustering algorithm looks straightforward and promising, but does not produce...
Sanderling asked 27/3, 2011 at 21:27
5
Solved
I've been using k-means to cluster my data in R but I'd like to be able to assess the fit vs. model complexity of my clustering using Baysiean Information Criterion (BIC) and AIC. Currently the cod...
Mell asked 5/4, 2013 at 17:19
7
Solved
I want to calculate the Euclidean distance in multiple dimensions (24 dimensions) between 2 arrays. I'm using numpy-Scipy.
Here is my code:
import numpy,scipy;
A=numpy.array([116.629, 7192.6, 4...
Werra asked 23/2, 2012 at 14:13
6
Say you have 10 features you are using to create 3 clusters. Is there a way to see the level of contribution each of the features have for each of the clusters?
What I want to be able to say is th...
Stendhal asked 15/12, 2014 at 19:1
1
which clustering method is considered to be the best among DBSCAN and HDBSCAN and what is the reason behind that?
Fishtail asked 24/11, 2020 at 5:39
4
Solved
I have been searching around for an implementation of DBSCAN for 3 dimensional points without much luck. Does anyone know I library that handles this or has any experience with doing this? I am ass...
Anthrax asked 7/10, 2014 at 21:59
6
Solved
I have a database of user submitted latitude/longitude points and am trying to group 'close' points together. 'Close' is relative, but for now it seems to ~500 feet.
At first it seemed I could jus...
Salicylate asked 3/12, 2010 at 19:28
2
I am new to Kubernetes. I have a K8 cluster with multiple deployments (more than 150), each having more than 4 pods scaled.
I have a requirement to increase resource limits for all deployments in t...
Doubleheader asked 26/7, 2021 at 13:55
5
Solved
I've been trying to cluster some larger dataset. consisting of 50000 measurement vectors with dimension 7. I'm trying to generate about 30 to 300 clusters for further processing.
I've been trying ...
Tachometer asked 18/6, 2012 at 23:42
1
Is there any kind of subspace clustering packages available in scikit-learn.
Kilgore asked 2/11, 2015 at 17:12
5
Solved
Good day!
I have been looking all over the Internet on how to compute for silhouette coefficient, cohesion and separation unfortunately, despite the resources, I just can't understand the formulas...
Daliladalis asked 30/4, 2014 at 11:47
3
I'm trying to make a filter (to remove outlier and noise) using kernel density estimators(KDE). I applied KDE in my 3D (d=3) data points and that gives me the probability density function (PDF) f(x...
Octahedron asked 3/7, 2015 at 3:36
© 2022 - 2024 — McMap. All rights reserved.