I have a Python set
that contains objects with __hash__
and __eq__
methods in order to make certain no duplicates are included in the collection.
I need to json encode this result set
, but passing even an empty set
to the json.dumps
method raises a TypeError
.
File "/usr/lib/python2.7/json/encoder.py", line 201, in encode
chunks = self.iterencode(o, _one_shot=True)
File "/usr/lib/python2.7/json/encoder.py", line 264, in iterencode
return _iterencode(o, 0)
File "/usr/lib/python2.7/json/encoder.py", line 178, in default
raise TypeError(repr(o) + " is not JSON serializable")
TypeError: set([]) is not JSON serializable
I know I can create an extension to the json.JSONEncoder
class that has a custom default
method, but I'm not even sure where to begin in converting over the set
. Should I create a dictionary out of the set
values within the default method, and then return the encoding on that? Ideally, I'd like to make the default method able to handle all the datatypes that the original encoder chokes on (I'm using Mongo as a data source so dates seem to raise this error too)
Any hint in the right direction would be appreciated.
EDIT:
Thanks for the answer! Perhaps I should have been more precise.
I utilized (and upvoted) the answers here to get around the limitations of the set
being translated, but there are internal keys that are an issue as well.
The objects in the set
are complex objects that translate to __dict__
, but they themselves can also contain values for their properties that could be ineligible for the basic types in the json encoder.
There's a lot of different types coming into this set
, and the hash basically calculates a unique id for the entity, but in the true spirit of NoSQL there's no telling exactly what the child object contains.
One object might contain a date value for starts
, whereas another may have some other schema that includes no keys containing "non-primitive" objects.
That is why the only solution I could think of was to extend the JSONEncoder
to replace the default
method to turn on different cases - but I'm not sure how to go about this and the documentation is ambiguous. In nested objects, does the value returned from default
go by key, or is it just a generic include/discard that looks at the whole object? How does that method accommodate nested values? I've looked through previous questions and can't seem to find the best approach to case-specific encoding (which unfortunately seems like what I'm going to need to do here).
dict
s? I think you want to make just alist
out of the set and then pass it to the encoder... e.g:encode(list(myset))
– Addi