Getting only unique values within a category in Kusto Query Language (Azure Monitor Logs)

About

Asked 21/10, 2019 at 5:56 Answered 21/10, 2019 at 6:57

Solved azure distinct-values azure-data-explorer kql

I have data in this format :

Category Session_ID  Step_Name

  A         100        1
  A         100        2 
  A         200        1
  A         200        1    <--
  A         200        1    <--
  A         200        2
  B         300        1
  B         300        1    <--

I need to remove the duplicate values of step names within each Session_ID. For example in ID = 200, there are three '1's which need to be changed to one '1', so the final data looks like :

Category Session_ID  Step_Name

  A         100        1
  A         100        2 
  A         200        1
  A         200        2
  B         300        1

Undertrump answered 21/10, 2019 at 5:56 Comment(2)

Maybe Distinct is working for: | distinct Session_ID, Step_Name – Takishatakken 21/10, 2019 at 6:2

Yes, this works, thanks. Can you put this as an answer. – Undertrump 21/10, 2019 at 7:8

You should use distinct operator in your case:

your_table
| distinct Category, Session_ID, Step_Name

then you can get the expected output like below, it works at my side:

Category Session_ID  Step_Name

  A         100        1
  A         100        2 
  A         200        1
  A         200        2
  B         300        1

And for your question in the comment, if you use the above query, the record like "A 100 1" would consider as one entity, and only if there are 2 or more exact same record like "A 100 1" would remain only 1 record if using distinct.

And if there is another record like "B 100 1", then the both 2 records "A 100 1" and "B 100 1" would remain.

Profundity answered 21/10, 2019 at 6:57 Comment(8)

If we have a Timestamp column instead of Category, how do we proceed in that case? - considering repeat steps within a session ID will have different timestamps and will be considered as unique. – Undertrump 23/10, 2019 at 8:46

If you have a Timestamp column instead of Category, then what's your expected result? – Profundity 23/10, 2019 at 8:48

The repeated step with the latest timestamp should remain (within session_id) and the earlier repeated steps should be removed. (the step with the second arrow should remain and the step with the first arrow must be removed) – Undertrump 23/10, 2019 at 9:20

@user75252, could you please post a new issue about that? and provide the link when it's done :). – Profundity 23/10, 2019 at 9:22

Sure, I will do that. – Undertrump 23/10, 2019 at 9:37

This a more expanded question : #58519976 – Undertrump 23/10, 2019 at 9:44

@user75252, I'll look into this issue later. – Profundity 23/10, 2019 at 9:45

Let us continue this discussion in chat. – Undertrump 23/10, 2019 at 9:47

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags