I have:
val DF1 = sparkSession.sql("select col1,col2,col3 from table");
val tupleList = DF1.select("col1","col2").rdd.map(r => (r(0),r(1))).collect()
tupleList.foreach(x=> x.productIterator.foreach(println))
But I do not get all the tuples in the output. Where is the issue?
col1 col2
AA CCC
AA BBB
DD CCC
AB BBB
Others BBB
GG ALL
EE ALL
Others ALL
ALL BBB
NU FFF
NU Others
Others Others
C FFF
The output I get is:
CCC AA BBB AA Others AA Others DD ALL Others ALL GG ALL ALL
tuple.productIterator.foreach(println)
gives you? – Detoxify