What is the best approach for importing a CSV that has a different number of columns for each row using Pandas or the CSV module into a Pandas DataFrame.
"H","BBB","D","Ajxxx Dxxxs"
"R","1","QH","DTR"," "," ","spxxt rixxls, raxxxd","1"
Using this code:
import pandas as pd
data = pd.read_csv("smallsample.txt",header = None)
the following error is generated
Error tokenizing data. C error: Expected 4 fields in line 2, saw 8
,""
for each column that a line is lacking. By lacking, I mean compared to the row with the most columns. – Magenrange(n)
inpd.read_csv()
function, where n is the number of columns required does the job, without needing to append,""
empty strings equivalent to number of columns required – Pitfall