I have somewhere between 10-20k different time-series (24 dimensional data -- a column for each hour of the day) and I'm interested in clustering time series that exhibit roughly the same patterns of activity.
I had originally started to implement Dynamic Time Warping (DTW) because:
- Not all of my time series are perfectly aligned
- Two slightly shifted time series for my purposes should be considered similar
- Two time series with the same shape but different scales should be considered similar
The only problem I had run into with DTW was that it did not appear to scale well -- fastdtw
on a 500x500 distance matrix took ~30 minutes.
What other methods exist that would help me satisfy conditions 2 & 3?
dtwclust
is multi-threaded. – Briticism