How to sort,uniq and display line that appear more than X times

About

Asked 22/11, 2013 at 14:58 Answered 22/11, 2013 at 15:3

I have a file like this:

80.13.178.2
80.13.178.2
80.13.178.2
80.13.178.2
80.13.178.1
80.13.178.3
80.13.178.3
80.13.178.3
80.13.178.4
80.13.178.4
80.13.178.7

I need to display unique entries for repeated line (similar to uniq -d) but only entries that occur more than just twice (twice being an example so flexibility to define the lower limit.)

Output for this example should be like this when looking for entries with three or more occurrences:

80.13.178.2
80.13.178.3

Syconium answered 22/11, 2013 at 14:58 Comment(0)

With pure awk:

awk '{a[$0]++}END{for(i in a){if(a[i] > 2){print i}}}' a.txt

It iterates over the file and counts the occurances of every IP. At the end of the file it outputs every IP which occurs more than 2 times.

Toms answered 22/11, 2013 at 15:3 Comment(1)

@Kent Thanks to Bell Labs! :) – Toms 22/11, 2013 at 15:9

Feed the output from uniq -cd to awk

sort test.file | uniq -cd | awk -v limit=2 '$1 > limit{print $2}'

Effy answered 22/11, 2013 at 15:1 Comment(2)

Perfect! Thank you, this one works for me: cat log.txt | sort | uniq -d -c|awk '$1 > 30{print $2}' – Syconium 22/11, 2013 at 15:4

This is a much cleaner, simpler solution.. Thank you – Busily 21/10, 2014 at 11:12

With pure awk:

awk '{a[$0]++}END{for(i in a){if(a[i] > 2){print i}}}' a.txt

It iterates over the file and counts the occurances of every IP. At the end of the file it outputs every IP which occurs more than 2 times.

Toms answered 22/11, 2013 at 15:3 Comment(1)

@Kent Thanks to Bell Labs! :) – Toms 22/11, 2013 at 15:9

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags