In most of the Scikit-learn algorithms, the data must be loaded as a Bunch object. For many example in the tutorial load_files() or other functions are used to populate the Bunch object. Functions like load_files() expect data to be present in certain format, but I have data stored in a different format, namely a CSV file with strings for each field.
How do I parse this and load data in the Bunch object format?
Bunch
objects. The example scripts use those, but the algorithms all want arrays or sparse matrices. – MellyBunch.data
) followed by a list of target(Bunch.target
) -clf.fit(<list>, <list>)
. – Ergot