Prometheus(四):Prometheus+Alertmanager 配置郵件報警


此處默認已安裝Prometheus服務,服務地址:192.168.56.200 

一、安裝Alertmanager

此處采用源碼編譯的方式安裝。首先下載alertmanager的軟件包,下載地址:https://github.com/prometheus/alertmanager/releases/download/v0.19.0/alertmanager-0.19.0.linux-amd64.tar.gz

下載完成后,將下載中軟件包上傳至Prometheus服務所在的機器(192.168.56.200)的 /usr/local 目錄下

 解壓alertmanager軟件包:

#   tar -zvxf alertmanager-0.19.0.linux-amd64.tar.gz # mv alertmanager-0.19.0.linux-amd64/ alertmanager

進入解壓后的alertmanager文件夾,修改alertmanager.yml文件,配置報警信息,alertmanager.yml 內容如下:

global:
  resolve_timeout: 5m
  smtp_smarthost: 'smtp.126.com:465'
  smtp_from: '****@126.com' # 用於發送告警右鍵的郵箱
  smtp_auth_username: '****@126.com'
  smtp_auth_password: '****'    #此處為郵箱的授權密碼,非郵箱登錄密碼
  smtp_require_tls: false

route:  # 設置報警分發策略
  group_by: ['alertname'] # 分組標簽
  group_wait: 10s      # 告警等待時間。告警產生后等待10s,如果有同組告警一起發出
  group_interval: 10s  # 兩組告警的間隔時間
  repeat_interval: 1m  # 重復告警的間隔時間,減少相同右鍵的發送頻率 此處為測試設置為1分鍾 
  receiver: 'mail'  # 默認接收者
routes: # 指定那些組可以接收消息 - receiver: mail receivers: - name: 'mail' email_configs: - to: '****@126.com' # 接收報警郵件的郵箱 #inhibit_rules: # - source_match: # severity: 'critical' # target_match: # severity: 'warning' # equal: ['alertname', 'dev', 'instance']

檢查alertmanager.yml 配置是否正確

# ./amtool check-config alertmanager.yml

 配置正確

啟動alertmanager

#  ./alertmanager

 可以看到alertmanager服務已經起來,服務所在的端口為9093

瀏覽器訪問: http://192.168.56.200:9093  (IP:9093)

 alertmanager成功啟動。

二、配置Prometheus

Ctrl+C 結束掉alertmanager服務進程,進入Prometheus的安裝目錄下修改Prometheus配置。

#  cd /usr/local/prometheus
#  vim prometheus.yml

修改Prometheus.yml文件中的 alerting 配置項及rule_files配置項

alerting:
  alertmanagers:
  - static_configs:
    - targets: ['localhost:9093']

  rule_files:  #配置告警規則
   - "rule.yml"

修改完成后保存退出

以下是Prometheus.yml 文件全部內容:

# my global config global: scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute. evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute. # scrape_timeout is set to the global default (10s). # Alertmanager configuration alerting: alertmanagers: - static_configs: - targets: ['localhost:9093'] # - alertmanager:9093 # Load rules once and periodically evaluate them according to the global 'evaluation_interval'. rule_files: - "rule.yml" # - "first_rules.yml" # - "second_rules.yml" # A scrape configuration containing exactly one endpoint to scrape: # Here it's Prometheus itself. scrape_configs: # The job name is added as a label `job=<job_name>` to any timeseries scraped from this config. - job_name: 'prometheus' # metrics_path defaults to '/metrics' # scheme defaults to 'http'. static_configs: - targets: ['localhost:9090'] - job_name: 'Linux' static_configs: - targets: ['192.168.56.201:9100'] labels: instance: Linux - job_name: 'Windows' static_configs: - targets: ['192.168.56.1:9182'] labels: instance: Windows - job_name: 'snmp' scrape_interval: 10s static_configs: - targets: - 172.20.2.83 # 交換機IP地址 metrics_path: /snmp # params: # module: [if_mib] relabel_configs: - source_labels: [__address__] target_label: __param_target - source_labels: [__param_target] target_label: instance - target_label: __address__ replacement: 192.168.56.100:9116 # snmp_exporter 服務IP地址

編寫告警規則文件rule.yml

#  vim rule.yml

將以下內容寫入文件當中,(此處用於測試,設置為當內存占用高於10%時,就會告警)

groups:
- name: mem-rule
  rules:
  - alert: "內存報警"
    expr: (node_memory_MemTotal_bytes - (node_memory_MemFree_bytes+node_memory_Buffers_bytes+node_memory_Cached_bytes )) / node_memory_MemTotal_bytes * 100 > 10
    for: 30s
    labels:
      severity: warning
    annotations:
      summary: "服務名:{{$labels.alertname}} 內存報警"
      description: "{{ $labels.alertname }} 內存資源利用率大於 10%"
      value: "{{ $value }}"

保存退出

三、告警檢測

重啟Prometheus服務,使配置的告警規則生效

#  systemctl restart prometheus

進入alertmanager的安裝文件夾,啟動alertmanager

#  cd /usr/local/alertmanager
#  ./alertmanager

稍等片刻,登錄設置的接收告警右鍵的郵箱,可以看到已經接收到告警郵件

 瀏覽器訪問 http://192.168.56.200:9093/#/alerts  ,也能看到告警信息

 四、配置alertmanager服務開機自啟

Ctrl+C 結束掉 alertmanager 服務進程,創建 alertmanager服務,讓 alertmanager 以服務的方式,開機自啟。

添加系統服務

#  vim /etc/systemd/system/alertmanager.service

將以下內容寫入文件中

[Unit]
Description=alertmanager
After=network.target

[Service]
WorkingDirectory=/usr/local/alertmanager
ExecStart=/usr/local/alertmanager/alertmanager --config.file=alertmanager.yml --log.level=debug --log.format=json
Restart=on-failure

[Install]
WantedBy=multi-user.target

保存退出

啟動服務,設置開機自啟

#  systemctl daemon-reload
#  systemctl enable alertmanager
#  systemctl start alertmanager

至此Prometheus+alertmanage配置郵件報警完成。


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM