I'm currently researching recommender systems and would like to know how other researchers acquire or generate test data to evaluate the systems' performance?
When I was working with Recommender Systems I had the exact same problem. I enjoyed the Grouplens dataset the most:
You can download ratings given by users to movies.
Also, I described in my blog some datasets I found while researching:
http://girlincomputerscience.blogspot.com.br/2010/12/datasets.html
Hope it helps!
I don't know what field you're evaluating, but if it's movie recommendations, you could use the MovieLens data from GroupLens to start out with. (It seems like their site is temporarily down, but I'm sure it will be back up soon).
They have three sets of data - 100,000 votes (preferences), 1 million, and 10 million - and it seems like they're more or less the standard that everyone starts out with.
© 2022 - 2024 — McMap. All rights reserved.