I am trying to terminate whole supervision tree from a supervised worker process. Here is my supervision tree:
+--------------------------+
| |
+--------+ Sup1: Dynamic Supervisor +---------+
| | | |
| +-------------+------------+ |
| | |
| | |
v v v
+------------------+ +------------------+ +------------------+
| | | | | |
| Job1: Supervisor | | Job2: Supervisor | | Job3: Supervisor |
| | | | | |
+------------------+ +-+-------- +---+--+ +------------------+
| |
| |
| |
| |
v v
+-------------------+ +--------------+
| | | |
| Progress Monitor: | | Work: Worker |
| Worker | | |
| | +--------------+
+-------------------+
Process life cycle:
- A
Job
is started via:DynamicSupervisor.start_child(__MODULE__, spec)
- Each job is a supervision tree as well: 1 supervisor (restart strategy -
one_for_one
) -> 2 workers Progress Monitor
worker knows when the given job is done- On job done,
Progress Monitor
worker makes an attempt to terminate the whole job supervision tree, by calling:DynamicSupervisor.terminate_child(__MODULE__, pid)
Progress Monitor
is expected to do cleanup steps interminate
callback - it is trapping exit signals
Problems and observations:
DynamicSupervisor.terminate_child
is a blocking call, which means it waits for all child processes to terminate as well, including the calling process -Progress Monitor
Progress Monitor
is in a deadlock and can not terminate. Parent supervisor sends:kill
signal, which does not triggerterminate
callback
Quick workarounds:
Call
DynamicSupervisor.terminate_child
fromProgress Monitor
worker asynchronously:spawn(fn -> DynamicSupervisor.terminate_child(__MODULE__, pid) end)
Define shutdown strategy for
Sup1: Dynamic Supervisor
:shutdown: 5_000
It will wait at most 5s for a job supervision tree termination and then it will send
shutdown
exit signal. This will ensureterminate
callback being called forProgress Monitor
process.
Not happy with both of them.
Questions:
- How to trigger supervision tree termination from a worker process and avoid deadlocks?
- If terminating supervision tree from a worker is not the best practice, what is the recommended way then?
- Any recommendations how to redesign supervision tree to make graceful termination easier?
Progress Monitor
is unnecessary. Could you explain the reason why theWorker
cannot simply do its thing and then terminate with reasonnormal
? It seems like you could be usingTask.Supervisor
for this purpose. – Throng