Monitor usage of threadpool from a reactor scheduler with micrometer

Tried so far

Using reactors build-in metrics

In reactor 3.4.x I found the metric executor.active, but it is a gauge and in monitoring tools, this is polled in an interval (e.g. every minute), this is too inaccurate for short tasks that only last some milliseconds in the pool. In reactor 3.5 I found a max execution time, but not a max for the active threads amount. The documentations are heavily updated currently because of the 3.5 release, so maybe I miss a metric that could be used for what I need.

Using a custom implementation to track usage

I've also tried to implement a DistributedSummary around the scheduler, so I'm able to track the MAX scheduled tasks per time interval (since DistributedSummary uses a TimeWindowMax which will show the MAX per monitoring interval). But it will only track the scheduling itself, not the real thread usage, for example if you have a Mono which evaluates some Monos and Flux inside, which will also use threads from the pool. So it doesn't show me the workload of the pool.

Reactor provides multiple metrics that allow to monitor schedulers:

executor_active_threads, gauge, The approximate number of threads that are actively executing tasks
executor_pool_core_threads, gauge, The core number of threads for the pool
executor_pool_max_threads, gauge, The maximum allowed number of threads in the pool
executor_pool_size_threads, gauge, The current number of threads in the pool
executor_completed_tasks_total, counter, The approximate total number of tasks that have completed execution
executor_completed_tasks_total, counter, The approximate total number of tasks that have completed execution
executor_queued_tasks, gauge, The approximate number of tasks that are queued for execution
executor_queue_remaining_tasks, gauge, The number of additional elements that this queue can ideally accept without blocking
executor_scheduled_once_total, counter
executor_scheduled_repetitively_total, counter
executor, timer
- executor_seconds_sum, counter
- executor_seconds_count, counter
- executor_seconds_max, gauge
executor.idle, timer
- executor_idle_seconds_sum, counter
- executor_idle_seconds_count, counter
- executor_idle_seconds_max , gauge

Internally reactor uses ExecutorServiceMetrics to instrument Schedulers and add additional tags like reactor_scheduler_id.

To monitor number of threads in the reactor schedulers

sum(executor_pool_size_threads) by (reactor_scheduler_id)

or to monitor max number of threads

sum(executor_pool_max_threads) by (reactor_scheduler_id)

There is a demo project that could be used to play with reactor metrics and has Grafana dashboards: https://github.com/reactor/reactor-monitoring-demo

Problem

Question

Tried so far

Recommended topics

Hot tags