I am interested in how others have handled a Read Side DB update failure in CQRS/Event Sourcing eventually consistent systems.
I have such a system that could append an event to my event store, and then for some reason fail to update a corresponding read side DB, leading to a state of inconsistency.
I have read this post and also this one which really focus on having a singleton/global aggregate that manages constraints before event storing events.
But how do you proceed when update failures are not related to constraints (temporary hardware failure for example)?
Another mentioned solution was manual intervention, but I suppose I am trying to avoid this. High level I am thinking of doing something like triggering some sort of job to rebuild my entire read side DB from the event store, while temporarily suspending and queuing the commands and event handlers that normally update the read side.
Does anyone else do something similar to this? Is there a better way?
Thanks!