We want to schedule a workflow based on data availability but there is no particular frequency of data arrival. Also there could be multiple re-runs of data and hence multiple versions of the data for the day arriving at any time.
As I understand from the specification, currently it's mandatory to specify frequency parameter in coordinator.
However, we would like to trigger our workflow based on some event (data arrival or partition creation) only without depending on the frequency.
Seems this qualifies for Asynchronous Data Set. Does Oozie support Asynchronous Data Set yet?