Comparing 2 lists consisting of dictionaries with unique keys in python
Asked Answered
B

8

49

I have 2 lists, both of which contain same number of dictionaries. Each dictionary has a unique key. There is a match for each dictionary of the first list in the second list, that is a dictionary with a unique key exists in the other list. But the other elements of such 2 dictionaries may vary. For example:

list_1 = [
            {
                'unique_id': '001',
                'key1': 'AAA',
                'key2': 'BBB',
                'key3': 'EEE'
             },
             {
                'unique_id': '002',
                'key1': 'AAA',
                'key2': 'CCC',
                'key3': 'FFF'
             }
         ]

 list_2 = [
             {
                'unique_id': '001',
                'key1': 'AAA',
                'key2': 'DDD',
                'key3': 'EEE'
             },
             {
                'unique_id': '002',
                'key1': 'AAA',
                'key2': 'CCC',
                'key3': 'FFF'
             }
         ]

I want to compare all elements of 2 matching dictionaries. If any of the elements are not equal, I want to print the none-equal elements.

Would you please help?

Bedelia answered 23/3, 2012 at 19:28 Comment(0)
P
69

Assuming that the dicts line up like in your example input, you can use the zip() function to get a list of associated pairs of dicts, then you can use any() to check if there is a difference:

>>> list_1 = [{'unique_id':'001', 'key1':'AAA', 'key2':'BBB', 'key3':'EEE'}, 
              {'unique_id':'002', 'key1':'AAA', 'key2':'CCC', 'key3':'FFF'}]
>>> list_2 = [{'unique_id':'001', 'key1':'AAA', 'key2':'DDD', 'key3':'EEE'},
              {'unique_id':'002', 'key1':'AAA', 'key2':'CCC', 'key3':'FFF'}]
>>> pairs = zip(list_1, list_2)
>>> any(x != y for x, y in pairs)
True

Or to get the differing pairs:

>>> [(x, y) for x, y in pairs if x != y]
[({'key3': 'EEE', 'key2': 'BBB', 'key1': 'AAA', 'unique_id': '001'}, {'key3': 'EEE', 'key2': 'DDD', 'key1': 'AAA', 'unique_id': '001'})]

You can even get the keys which don't match for each pair:

>>> [[k for k in x if x[k] != y[k]] for x, y in pairs if x != y]
[['key2']]

Possibly together with the associated values:

>>> [[(k, x[k], y[k]) for k in x if x[k] != y[k]] for x, y in pairs if x != y]
[[('key2', 'BBB', 'DDD')]]

NOTE: In case your input lists are not sorted yet, you can do that easily as well:

>>> from operator import itemgetter
>>> list_1, list_2 = [sorted(l, key=itemgetter('unique_id')) 
                      for l in (list_1, list_2)]
Panegyrize answered 23/3, 2012 at 19:32 Comment(6)
@Lattyware: Yes, I'm assuming that the lists are aligned in a way that matching dicts have the same position in both lists. It seemed to me that this was the situation OP faces.Panegyrize
@NiklasB.Definitely, but thought I'd mention it for completeness' sake.Ancelin
@Niklas B.: Yes, the lists are sorted. But it might not have been. How would I do it if the lists were not sorted?Bedelia
It's worth to mention that ZIP method will works ONLY if we have the same number of dict inside each list.Hellas
@Doomsday: It's not clear what the output should be if that is not the case, though.Panegyrize
@NiklasB. I would assume that the dicts which are missing in list_1 or list_2` are part of the difference output.Punitive
P
20

The fastest and most comprehensive way would be, to use two sets of tuples:

set_list1 = set(tuple(sorted(d.items())) for d in sorted(list1))
set_list2 = set(tuple(sorted(d.items())) for d in sorted(list2))
    

(if your list is already sorted, simply remove the list sort to save performance)

Find overlapping using intersection:

set_overlapping = set_list1.intersection(set_list2)

Find difference using symmetric_difference

set_difference = set_list1.symmetric_difference(set_list2)

Convert tuple back to dict

 for tuple_element in set_difference:
     list_dicts_difference.append(dict((x, y) for x, y in tuple_element))
Punitive answered 23/5, 2018 at 10:40 Comment(3)
@Punitive How should I change the last part of the code for overleaping? I just want to get matching keys not values!Trapeze
This assumes list1 and list2 are sorted already, you need to call sort on set_list1 and set_list2Sunbathe
@Mark: Was working on sorted list from the code - Thanks - Edited.Punitive
C
3

The following compares the dictionaries and prints the non-equal items:

for d1, d2 in zip(list_1, list_2):
    for key, value in d1.items():
        if value != d2[key]:
            print key, value, d2[key]

Output: key2 BBB DDD. By using zip we can iterate over two dictionaries at a time. We then iterate over the items of the first dictionary and compare the value with the corresponding value in the second dictionary. If these are not equal, then we print the key and both values.

Childe answered 23/3, 2012 at 19:36 Comment(2)
How does it know that it should compare dictionaries with the same "unique_id" key?Bedelia
Based on the example I assumed the list with dictionaries was ordered. If that's not the case then you need to order it first by unique_id.Childe
A
1

I have a version that actually does not depends on a particular key, so the elements are equal (zero) or they are not (non-zer):

list_1 = [{'unique_id':'001', 'key1':'AAA', 'key2':'BBB', 'key3':'EEE'}, {'unique_id':'002', 'key1':'AAA', 'key2':'CCC', 'key3':'FFF'}]
list_2 = [{'unique_id':'001', 'key1':'AAA', 'key2':'DDD', 'key3':'EEE'}, {'unique_id':'002', 'key1':'AAA', 'key2':'CCC', 'key3':'FFF'}]
list_3 = [{'Name': 'Abid', 'Age': 27},{'Name': 'Mahnaz', 'Age': 27}]
list_4 = [{'Name': 'Abid', 'Age': 27},{'Name': 'Mahnaz', 'Age': 27}]

print cmp(list_1,list_1)
print cmp(list_1,list_3)
print cmp(list_1,list_2)
print cmp(list_2,list_1)
print cmp(list_3,list_4)

gives:

Return Value :  0
Return Value :  1
Return Value : -1
Return Value :  1
Return Value :  0
Aaron answered 28/4, 2015 at 21:12 Comment(2)
what does cmp() do? can you provide me the implementation of cmp()Langobardic
@ShubhamSrivastava: you can read it from the official info "cmp(x, y) Compare the two objects x and y and return an integer according to the outcome. The return value is negative if x < y, zero if x == y and strictly positive if x > y". docs.python.org/2/library/functions.html#cmpAaron
B
0
Let list1 = []
list2 = []

To fetch all the key values we can do like this:
key_values = list1[0]
key = key_values.keys() //key is a list and contains all key values

below is a piece of code which compares all the key pair values:

for val in list1:
    first_key = key[0]
    for val2 in list2:
        if val2[first_key] == val[first_key]:
            for val3 in key:
                if val2[val3] != val[val3]:
                    Unmatched.append(val)

print unmatched

Above contains matches dictionary and prints for which all key, pair values didn't match.
Bedell answered 25/4, 2017 at 2:38 Comment(0)
M
0
def new_change(old_list, new_list):
    change_list = []
    for x in new_list:
        for y in old_list:
            if x['unique_id'] != y['unique_id']:
                change_list.append(x)
    return change_list

pass old and new list in side of this method

Malcommalcontent answered 6/1, 2018 at 5:9 Comment(0)
M
0

Same as jvperrin but with a more logical setup:

def are_list_of_dicts_the_same(list1, list2):
    pairs = zip(list1, list2)
    return all(x == y for x, y in pairs)
Myer answered 26/9, 2023 at 10:14 Comment(1)
As it’s currently written, your answer is unclear. Please edit to add additional details that will help others understand how this addresses the question asked. You can find more information on how to write good answers in the help center.Wristlet
W
-1

compare 2 list of dict on basis of key value and get the different dict items from another::-------

db_items = [
{
    "id": "db...1111",
    "name": "amit",
    "type": "cricket",
    "value": "kumar"
},
{
    "id": "db...222",
    "name": "rishabh",
    "type": "traveller",
    "value": "gupta"
},
{
    "id": "items.....2222",
    "name": "raj",
    "type": "foodie",
    "value": "ankur"
}
]
items = [
{
    "id": "items.....1111",
    "name": "abhishek",
    "type": "cricket",
    "value": "sharma"
},
{
    "id": "items.....2222",
    "name": "hardik",
    "type": "traveller",
    "value": "sharma"
},
{
    "id": "db...333",
    "name": "shanchay",
    "type": "music",
    "value": "kumar"
}
]
for item in items.copy():
for i in db_items:
    if item.get("type") == i.get("type"):
        items.remove(item)

print(items)

output:---
[{'id': 'db...333', 'name': 'shanchay', 'type': 'music', 'value': 
'kumar'}]
Waldrup answered 23/2, 2023 at 10:25 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.