Commit graph

497 commits

Author SHA1 Message Date
Mikolaj Izdebski
a1c728f8a8 Revert "nagios_server: Check Koschei pod count instead of processes"
This reverts commit a0474d9c3687bd144d5a890e8f8c802486299947.
This reverts commit 803af9c9cb3456d7440695ddf8c51990b002c6d4.
2020-04-24 21:34:11 +02:00
Mikolaj Izdebski
98ecbf6f50 nagios_server: Add missing arguments to check_openshift_objects 2020-04-24 21:34:11 +02:00
Mikolaj Izdebski
ec6c5cab14 nagios_server: Check Koschei pod count instead of processes 2020-04-24 21:34:11 +02:00
Stephen Smoogen
e1abdfc8bd [nagios] remember to remove coloamer from nagios 2020-04-24 21:34:11 +02:00
Kevin Fenzi
e9eeb721ce old cloud cleanup: remove magazine from another place.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:08 +02:00
Aurélien Bompard
d594733771 More RabbitMQ monitoring!
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
963ff9586b Add more RabbitMQ checks
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
7c2377748e NRPE does not accept arguments
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
34a8869f9f Fix RabbitMQ service name
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
e7ab522de3 Add a RabbitMQ check on the cluster
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Rick Elrod
c31829a561 nagios: Try a new way of doing the raid check, so we can check extra hosts like autocloud-backend-libvirt2
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-07-18 19:31:04 +00:00
Patrick Uiterwijk
018065edab Remove puiterwijk from nagios notifs
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-06-30 11:54:07 +02:00
Clement Verna
bb24183f46 oci-registry: Update nagios to monitor the correct directory for disk space
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-05-30 20:06:56 +02:00
Stephen Smoogen
40a819e1d5 [nagios/datanommer] this is what happens when you have 2 files which are supposeldy the same file. You edit one in nagios_server and miss the one in nagios_client. Bad nagios. Bad 2019-05-30 16:56:26 +00:00
Randy Barlow
4cf1624c76 bodhi: Upgrade production to Bodhi 4.0.0.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2019-05-28 15:58:52 +00:00
Stephen Smoogen
b2599f8d2f [nagios] try ang get groups working 2019-05-23 23:34:41 +00:00
Stephen Smoogen
af3def70a1 overzealous _ in group name for nagios group 2019-05-23 23:16:18 +00:00
Kevin Fenzi
52bb723fee nagios_server: fix another _ case in group name
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-05-22 18:08:04 +00:00
Kevin Fenzi
15ebdb5233 nagios_server: fix autocloud-backend group to use _
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-05-22 17:24:40 +00:00
Patrick Uiterwijk
9b7882313f More nagios fixes
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:17:21 +02:00
Patrick Uiterwijk
63fe73c878 More nagios file fixes
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:15:11 +02:00
Patrick Uiterwijk
c276097e32 nagios: fix smtp-mm
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:13:55 +02:00
Patrick Uiterwijk
c14b702513 Continue fixing nagios group names
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:12:23 +02:00
Patrick Uiterwijk
ab9cd48efe nagios_server: Remove fedmsg checks from hotness01
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 16:40:56 +02:00
Clement Verna
93d0eeaf54 Nagios: monitor that resultsdb sends messages on the bus
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-04-24 11:22:46 +02:00
Miroslav Suchý
1a1ca033b6 sent Copr nagios notifications to frostyx too 2019-04-01 10:13:00 +02:00
Miroslav Suchý
b7394d1c54 sent notifications to msuchy@ as clime@ does not work now 2019-04-01 10:10:33 +02:00
Stephen Smoogen
d9d24d08d9 [nagios_server] Add in certgetter test.
This was offered by Alessandro Lorenzi <alorenzi@alorenzi.eu> as a fix
to deal with our inability to monitor the certgetter after
reboots. Thank you very much for this work.

Signed-off-by: Stephen Smoogen <smooge@redhat.com>
2019-03-20 20:17:43 +00:00
Kevin Fenzi
1a40dd5142 nagios: drop askbot fedmsg check.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:34:31 +00:00
Kevin Fenzi
a6cceb3599 nagios: drop remnant of check_osbs_builds
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:23:54 +00:00
Stephen Smoogen
06cc8ca030 and we have no bodhost 2019-02-14 21:11:50 +00:00
Rick Elrod
0b7bb3b5b3 prep for proxy03 move
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-02-11 23:14:27 +00:00
Rick Elrod
4c8cf933fc make odcs-backend check for fedmsg-hub-3 instead (infra #7526)
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-01-28 08:54:46 +00:00
Patrick Uiterwijk
18b0acc8f3 Monitor ostree summary on proxies
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-01-21 16:57:26 +01:00
Patrick Uiterwijk
2ded08f111 Add 24-hour check for bodhi compose start
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-12-20 20:44:10 +01:00
Stephen Smoogen
6c1357ff59 repospanner is only running on pkgs01 currently 2018-12-17 18:25:27 +00:00
Stephen Smoogen
0819f469c0 this should allow noc01 to see nrpe commands 2018-12-17 16:42:16 +00:00
Stephen Smoogen
3bbc0031f4 This will add minimal monitoring for repospanner on pkgs01.stg. This only says it is running or not. 2018-12-17 15:44:31 +00:00
Kevin Fenzi
4125997ecc fix the check_supybot_plugin to listen only for zodbot privmsg, not frigg 2018-12-15 20:52:54 +00:00
Randy Barlow
2911286a3a Rename check_fedmsg_masher_proc to check_fedmsg_composer_proc and have it check fedmsg-hub-3.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2018-11-19 21:20:15 +00:00
Patrick Uiterwijk
505d8bbf8c Set COPR backend to send notifs to copr
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
5fa3c6e53d Add contact info for clime
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
3fc57e699b Enable nagios checks for ticketkey, and stop emailing puiterwijk
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 15:36:00 +02:00
Kevin Fenzi
6cefdc1d5b another place anitya is 2018-10-07 20:04:40 +00:00
Kevin Fenzi
b2ff9078f2 Clean up non openshift anitya in favor of openshift version. 2018-10-06 00:11:10 +00:00
Kevin Fenzi
2d7ac321c7 a few tagger stragglers 2018-10-03 17:56:00 +00:00
Kevin Fenzi
6c3dc368cd pkgs is now a letsencrypt cert, so we do not need to monitor it for 60 days 2018-08-22 00:00:21 +00:00
Kevin Fenzi
63079521a4 drop autoqa from nagios too 2018-07-31 17:46:54 +00:00
Kevin Fenzi
5149f16b73 kill pkgdb from nagios server 2018-07-20 04:54:23 +00:00
Patrick Uiterwijk
0854930115 We have no jenkins service anymore
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-03 02:28:37 +02:00