Commit graph

118 commits

Author SHA1 Message Date
Clement Verna
93d0eeaf54 Nagios: monitor that resultsdb sends messages on the bus
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-04-24 11:22:46 +02:00
Kevin Fenzi
1a40dd5142 nagios: drop askbot fedmsg check.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:34:31 +00:00
Clement Verna
ae70d3d6d3 Remove OSBS build check from nagios
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-02-11 17:59:45 +01:00
Rick Elrod
4c8cf933fc make odcs-backend check for fedmsg-hub-3 instead (infra #7526)
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-01-28 08:54:46 +00:00
Patrick Uiterwijk
ac65c80b07 Drop the variant-specific config again, hoping for ref=
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-01-21 17:49:56 +01:00
Patrick Uiterwijk
3f4bf6db2b Update file for nagios check
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-01-21 17:23:41 +01:00
Patrick Uiterwijk
18b0acc8f3 Monitor ostree summary on proxies
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-01-21 16:57:26 +01:00
Patrick Uiterwijk
2ded08f111 Add 24-hour check for bodhi compose start
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-12-20 20:44:10 +01:00
Stephen Smoogen
cacbb74b61 This change will update monitoring and repoSpanner service
The monitoring needs to see that the service is run by the repoSpanner user.
The service needs to have a larger limit of open files to work.
2018-12-19 13:41:48 +00:00
Stephen Smoogen
5dd7924887 just make it simple and see if it works 2018-12-18 16:24:14 +00:00
Stephen Smoogen
3bbc0031f4 This will add minimal monitoring for repospanner on pkgs01.stg. This only says it is running or not. 2018-12-17 15:44:31 +00:00
Randy Barlow
4422e2bb2d Monitor fedmsg-hub-3 on Bodhi instead of fedmsg-hub.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2018-11-19 21:33:37 +00:00
Randy Barlow
ce86a667b7 Configure check_fedmsg_cp_bodhi_backend02_hub to use fedmsg-hub-3.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2018-11-19 21:29:14 +00:00
Randy Barlow
2911286a3a Rename check_fedmsg_masher_proc to check_fedmsg_composer_proc and have it check fedmsg-hub-3.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2018-11-19 21:20:15 +00:00
Patrick Uiterwijk
3fc57e699b Enable nagios checks for ticketkey, and stop emailing puiterwijk
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 15:36:00 +02:00
Kevin Fenzi
2d7ac321c7 a few tagger stragglers 2018-10-03 17:56:00 +00:00
Kevin Fenzi
935b25decc Up this check to 8 hours. 2018-09-29 20:13:51 +00:00
Patrick Uiterwijk
0854930115 We have no jenkins service anymore
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-03 02:28:37 +02:00
Miroslav Suchý
a35851b271 check free space on retrace
retrace\'s /srv volume is 9.6TB big. So 5% (warning) is 480GB and 1% (critical) is 96GB
2018-06-18 12:11:31 +02:00
Ralph Bean
ed44e7f0dc Only alert if we haven't seen a greenwave message in 2 days. 2018-05-12 14:20:48 +00:00
Kevin Fenzi
e51b6285a6 Shelve summershum 2018-04-10 21:39:56 +00:00
Clement Verna
3c69f743ba Increase the warning and critical threshold for packages fedmsg backlog
Signed-off-by: Clement Verna <cverna@tutanota.com>
2018-03-08 08:32:00 +01:00
Kevin Fenzi
be38d5fcd9 increase greenwave datanommer fedmsg check from 4 hours to 6 hours as it has been alerting some nights with low build activity 2018-02-28 17:38:20 +00:00
Kevin Fenzi
c8e5316fb7 adjust more 2018-02-07 01:32:17 +00:00
Kevin Fenzi
8a0c1f266f and setup check on mailman01 nrpe side 2018-02-07 01:24:52 +00:00
Kevin Fenzi
0b138f9111 add mdapi and greenwave monitoring. tickets 6639 and 6643 2018-01-19 21:32:19 +00:00
Patrick Uiterwijk
b58aec5fdb Perform mirrorlist cache check against proxies
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-01-12 20:37:25 +00:00
Kevin Fenzi
7d265c9bf9 switch openqa machines to alert on disk only when 90% or higher instead of 85% 2017-11-29 21:52:49 +00:00
Jeremy Cline
065d9ff801 Update FMN queues in nagios
Signed-off-by: Jeremy Cline <jeremy@jcline.org>
2017-11-15 19:18:47 +00:00
Patrick Uiterwijk
8355e9fa9f Define dequeue stable file age check
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2017-11-15 03:08:41 +00:00
Ralph Bean
89fa8fa544 Add nagios check for odcs-backend proc. 2017-10-17 14:17:30 +00:00
Kevin Fenzi
62b3fed547 clean up pkgdb nagios checks 2017-09-09 20:59:54 +00:00
Patrick Uiterwijk
8f05121798 The new varnish pkg runs as varnish
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2017-08-02 23:21:45 +02:00
f207778a0e add simple monitoring for pagure's celery redis queue
Signed-off-by: Ricky Elrod <codeblock@fedoraproject.org>
2017-05-26 23:09:02 +00:00
a6867db34f try it here instead
Signed-off-by: Ricky Elrod <codeblock@fedoraproject.org>
2017-05-15 21:30:35 +00:00
Stephen Smoogen
8f89c1bb65 can we put together disks 2017-05-11 21:41:03 +00:00
Patrick Uiterwijk
66b90abbfc PIDFile no longer used, handled by systemd
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2017-04-29 17:31:26 +00:00
Stephen Smoogen
0192cc9dac and we have to let nrpe go out a different ip for a while 2017-04-29 17:17:08 +00:00
Ralph Bean
d2e519607d Revert "Revert "Nagios checks for the mbs-backend fedmsg-hub.""
This reverts commit 7114ae6eeb.
2017-04-07 20:22:57 +00:00
Ralph Bean
7114ae6eeb Revert "Nagios checks for the mbs-backend fedmsg-hub."
Nevermind.  I'll get back to this in an hour or so.

This reverts commit 57251d154c.
2017-04-07 17:10:50 +00:00
Ralph Bean
57251d154c Nagios checks for the mbs-backend fedmsg-hub. 2017-04-07 17:09:09 +00:00
Stephen Smoogen
dfd088ab5e put in many changes for new nagios server 2017-04-06 23:50:44 +00:00
Michael Simacek
b77f5736b4 Koschei: split resolver into build-resolver and repo-resolver 2017-03-02 15:48:46 +01:00
Patrick Uiterwijk
7c6e97cc6f Remove check_haproxy_mirrorlist check. It doesn't work with containerized mirrorlist
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2017-02-18 10:01:17 +00:00
Stephen Smoogen
6bbb48acca You know the very powerful and the very stupid have one thing in common. They don't alter their views to fit the facts. They alter the facts to fit the views. Which can be uncomfortable if you happen to be one of the facts that needs altering. 2017-01-24 20:00:30 +00:00
Stephen Smoogen
c903a64ff7 add in a staging allowance for noc01.stg 2017-01-10 00:09:47 +00:00
Stephen Smoogen
8cf72ff116 put in the first run at new nagios configs 2017-01-05 00:55:16 +00:00
Ralph Bean
0cf7929bac Move nagios client and server into a namespaced role. 2016-02-23 02:35:50 +00:00
Patrick Uiterwijk
9daab76bc6 Only error if the twoweek compose didn't happen for two weeks
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2016-02-19 09:40:22 +00:00
Martin Krizek
6947b5d382 nagios check_testcloud: add client-side definition 2016-02-17 09:59:01 +00:00