I am trying to solve a simple classification problem.
The Problem:
I have a set of text and I have to categorize them based on the content.
Solution using Mahout:
I understood that I have to convert the input to a sequence file to generate the model. Yes, I was able to do this. Now, how do I categorize my test data? The 20News example only tests for correctness. But, I want to do the actual classification.
I am not sure if I need to write code or use some existing classes available to classify the test set.?