monitoring Questions

4

Solved

When deciding between Counter and Gauge, Prometheus documentation states that To pick between counter and gauge, there is a simple rule of thumb: if the value can go down, it is a gauge. Counte...
Stringhalt asked 2/11, 2019 at 18:31

1

tl;dr I have a label in prometheus called "ignore" with value "yes": metric_test{label1="label1",ignore="yes"} 1 I want to disable alerts for any metrics wi...
Afro asked 7/2, 2022 at 2:41

4

Solved

I am training a model with keras using the model.fit() method. I would like to use multiple validation sets that should be validated on separately after each training epoch so that i get one loss ...
Varuna asked 9/12, 2017 at 18:45

5

Solved

I'd like to write a monitoring plugin that checks various hosts on my network to make sure that password or interactive SSH authentication is not enabled. That is, I need to write code that: Conn...
Exuviae asked 27/8, 2010 at 15:15

4

Solved

I have the Android Emulator running in my Ubuntu VM and just installed an apk file trough the console. The emulator and adb have many options to view what is going on on the device. But how can I g...

1

We are using an old version of Payara, and exporting JVM metrics with the Prometheus JVM GC Exporter and Memory exporters. In these exporters, we don't have access to the number of Java HeapSpace e...
Promontory asked 4/1, 2023 at 11:17

2

Solved

I have configured Prometheus to scrape metrics from cAdvisor. However, the metric "container_cpu_load_average_10s" only returns 0. I am able to see the CPU metrics under the cAdvisor web UI correct...
Caricature asked 3/3, 2018 at 10:50

2

Solved

I've downloaded MAT from this link and simply extract the zip file to use. Then, when I executed the MemoryAnalyzer.exe, I got this errror My %JAVA_HOME% is already C:\Program Files\Java\jdk1.8.0_...
Despond asked 17/7, 2020 at 8:24

3

Solved

I have an alarm tracking the metric for LoadBalancer 5xx errors n a single ALB. This should be in an "In alarm" state if 1 datapoint in the past 1 is above the threshold of 2. The period ...
Kamalakamaria asked 24/9, 2020 at 10:13

1

Solved

I push a custom tag to DataDog with key of execute_error and value of "testing DATADOG mApPeR OK" However, when I check on the DataDog, it looks like the tag is mapped into "execute_...
Gallardo asked 30/11, 2022 at 15:53

1

I really like Heroku's app monitoring. But I don't have it in my Dokku PaaS. I want to see my apps and plugins CPU, RAM, disk load, network IO, ... Is there a way that I can easily collect and vis...
Liberia asked 18/2, 2020 at 9:43

9

Solved

I have a third party VPN app on my non-rooted Android 4.4 device, and want to write a background service to monitor the VPN connection and alert the user if the VPN connection has been broken. Is ...
Schuyler asked 7/2, 2015 at 19:44

5

Solved

I have a query: node_systemd_unit_state{instance="server-01",job="node-exporters",name="kubelet.service",state="active"} 1 I want the label name being renamed (or replaced) to unit_name ONLY wit...
Fideliafidelio asked 17/1, 2019 at 12:18

1

Solved

When reviewing the documentation here (https://cloud.google.com/monitoring/alerts/incidents-events#incident) and using the general product, it appears that the open incidents and associated details...
Discretionary asked 26/8, 2022 at 14:5

1

Solved

Splunk has transaction command which can produce duration between logs grouped by id: 2020-01-01 12:12 event=START id=1 2020-01-01 12:13 event=STOP id=1 as it is described on Query for calculatin...
Atelier asked 7/7, 2020 at 19:52

2

Solved

I have an application that will be monitored by Prometheus, but the application need the custom header key like : x-auth-token: <customrandomtoken> What should I do with prometheus.yml?
Lumbago asked 3/2, 2021 at 17:20

1

Solved

I am using Grafana Loki and I need to calculate the total number of a certain log message for a specific time interval. For example, I need the total number of log message "some-text" in ...
Cletus asked 13/6, 2022 at 18:25

2

I am wondering what the minimum time is for Prometheus' scrape_interval parameter. According to the Prometheus Documentation, the value for this parameter needs to follow a regex which seems to me ...
Dieball asked 9/3, 2021 at 15:15

2

Solved

I am trying to figure out the behavior of Prometheus' increase() querying function with process restarts. When there is a process restart within a 2m interval and I query: sum(increase(my_metric...
Effrontery asked 24/8, 2018 at 22:20

4

Solved

Use case / Problem I am in charge of maintaining a kubernetes cluster with 40 nodes (split across 2 zones). We have roughly 100 microservices and platform stuff like Kafka brokers running in this ...
Jyoti asked 27/9, 2018 at 15:40

2

Solved

I'm running prometheus and telegraf on the same host. I'm using a few inputs plugins: inputs.cpu inputs.ntpq I've configured to the prometheus_client output plugin to send data to prometheus ...
Gibrian asked 11/2, 2019 at 17:36

3

Solved

I want to display pod details in the following format using promql/Prometheus. Furthermore, I want to display CPU and memory utilization of application/component in below format using promql prom...
Gamut asked 10/8, 2020 at 20:2

2

I have a Prometheus metric called device_number. What I want is to show the difference in value between now and one day/week/month etc ago. Which means subtracting two values with two different tim...
Groscr asked 15/5, 2019 at 11:19

4

Solved

There is a long running script script.sh on a remote Linux machine. I need to start it and monitor it's activity in real time. The script during it's activity may output to stdout and stderr. I am ...
Mythify asked 23/7, 2015 at 12:51

10

Solved

Similar to this question here I am trying to monitor if a set of website links are up and running or not responding. I have found the same PowerShell script over the Internet. However, instead of ...
Stygian asked 28/11, 2013 at 6:32

© 2022 - 2025 — McMap. All rights reserved.