Group by with Deadman 警报

Group by with Deadman alert

我的所有主机上都有 dnsmasq 运行,我想监控它何时关闭并分别通知每个主机。到目前为止,我没有任何运气能够做到这一点。感谢任何帮助。

var db = 'telegraf'

var rp = 'autogen'

var measurement = 'procstat'

var groupBy = []

var whereFilter = lambda: ("process_name" == 'dnsmasq')

var period = 1m

var name = 'Dnsmasq Deadman'

var idVar = name + ':{{.Group}}'

var message = 'Dnsmasq not responding. {{.Time}} Restarting dnsmasq. {{ index .Tags "host"}}'

var idTag = 'alertID'

var levelTag = 'level'

var messageField = 'message'

var durationField = 'duration'

将您的分组依据变量更改为 var groupBy = ['host']

据我所知,您需要按要在警报中使用的标签或字段进行分组。