Prometheus Node_exporter 之 System Detail


System Detail

1. Context Switches / Interrupts


type: Graph
Unit: short
Label: Counter
Context switches - CPU 的 context switch 平均次數(5分鍾內)

metrics:

irate(node_context_switches_total{instance=~"$node:$port",job=~"$job"}[5m])


Interrupts - 服務的平均中斷總數(5分鍾內)

metrics:

irate(node_intr_total{instance=~"$node:$port",job=~"$job"}[5m])


2. System Load


type: Graph
Unit: short
Label: Load
Load 1m - 系統1分鍾內的平均負載

metrics:

node_load1{instance=~"$node:$port",job=~"$job"}

Load 5m - 系統5分鍾內的平均負載

metrics:

node_load5{instance=~"$node:$port",job=~"$job"}

Load 15m - 系統15分鍾內的平均負載

metrics:

node_load15{instance=~"$node:$port",job=~"$job"}


3. Interrupts Detail /proc/interrupts

type: Graph
Unit: short
Label: Counter
{{ type }} - {{ info }} - 當前系統的軟中斷列表和對應的中斷號平均中斷次數(5分鍾內)

metrics:

irate(node_interrupts_total{instance=~"$node:$port",job=~"$job"}[5m])

 

4. File Descriptors

type: Graph
Unit: short
Label: Descriptors
Maximum open file descriptors - 最大打開文件描述符數

metrics:

process_max_fds{instance=~"$node:$port",job=~"$job"}


Open file descriptors - 打開文件描述符的數量

metrics:

process_open_fds{instance=~"$node:$port",job=~"$job"}

5. Entropy

type: Graph
Unit: short
Label: Entropy
Entropy available to random number generators

metrics:

node_entropy_available_bits{instance=~"$node:$port",job=~"$job"}


6. Processes State

type: Graph
Unit: short
Label: Processes
Processes blocked - 當前被阻塞的任務的數目 /proc/stat procs_blocked

metrics:

node_procs_blocked{instance=~"$node:$port",job=~"$job"}


Processes in runnable state - 當前運行隊列的任務的數目 /proc/stat procs_running

metrics:

node_procs_running{instance=~"$node:$port",job=~"$job"}


7. Processes Forks

type: Graph
Unit: short
Label: Forks / sec
Processes forks second - 每秒創建的進程個數

metrics:

rate(node_forks_total{instance=~"$node:$port",job=~"$job"}[5m])


8. Processes Memory

type: Graph
Unit: bytes
Label: Bytes
進程占用的虛擬內存的大小:

metrics:

process_virtual_memory_bytes{instance=~"$node:$port",job=~"$job"}


進程常駐內存大小:

metrics:

process_resident_memory_bytes{instance=~"$node:$port",job=~"$job"}


9. Time Syncronized Status

type: Graph
Unit: short
Label: Counter
Is clock synchronized to a reliable server:時鍾是否與一個可靠的服務器同步:

metrics:

node_timex_sync_status{instance=~"$node:$port",job=~"$job"}



Local clock frequency adjustment: 本地時鍾調整頻率:

metrics:

node_timex_frequency_adjustment_ratio{instance=~"$node:$port",job=~"$job"}


10. Time Syncronized Drift

type: Graph
Unit: seconds
Label: Seconds
Estimated error in seconds:估算誤差(秒):

metrics:

node_timex_estimated_error_seconds{instance=~"$node:$port",job=~"$job"}


Time offset in between local system and reference clock:本地系統和參考時鍾之間的時間偏移:

metrics:

node_timex_offset_seconds{instance=~"$node:$port",job=~"$job"}


Maximum error in seconds: 最大誤差(秒):

metrics:

node_timex_maxerror_seconds{instance=~"$node:$port",job=~"$job"}


11. Hardware temperature monitor 硬件的溫度監控

type: Graph
Unit: Celsius(攝氏度)
Label: Temperature
{{ chip }} {{ sensor }} temp -

metrics:

node_hwmon_temp_celsius{instance=~"$node:$port",job=~"$job"}


{{ chip }} {{ sensor }} Critical Alarm

metrics:

node_hwmon_temp_crit_alarm_celsius{instance=~"$node:$port",job=~"$job"}


{{ chip }} {{ sensor }} Critical

metrics:

node_hwmon_temp_crit_celsius{instance=~"$node:$port",job=~"$job"}


{{ chip }} {{ sensor }} Critical Historical

metrics:

node_hwmon_temp_crit_hyst_celsius{instance=~"$node:$port",job=~"$job"}


{{ chip }} {{ sensor }} Max

metrics:

node_hwmon_temp_max_celsius{instance=~"$node:$port",job=~"$job"}


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM