I have data in this format :
Category Session_ID Step_Name
A 100 1
A 100 2
A 200 1
A 200 1 <--
A 200 1 <--
A 200 2
B 300 1
B 300 1 <--
I need to remove the duplicate values of step names within each Session_ID. For example in ID = 200, there are three '1's which need to be changed to one '1', so the final data looks like :
Category Session_ID Step_Name
A 100 1
A 100 2
A 200 1
A 200 2
B 300 1
| distinct Session_ID, Step_Name
– Takishatakken