Mikolaj Izdebski
a1c728f8a8
Revert "nagios_server: Check Koschei pod count instead of processes"
...
This reverts commit a0474d9c3687bd144d5a890e8f8c802486299947.
This reverts commit 803af9c9cb3456d7440695ddf8c51990b002c6d4.
2020-04-24 21:34:11 +02:00
Mikolaj Izdebski
98ecbf6f50
nagios_server: Add missing arguments to check_openshift_objects
2020-04-24 21:34:11 +02:00
Mikolaj Izdebski
ec6c5cab14
nagios_server: Check Koschei pod count instead of processes
2020-04-24 21:34:11 +02:00
Stephen Smoogen
e1abdfc8bd
[nagios] remember to remove coloamer from nagios
2020-04-24 21:34:11 +02:00
Kevin Fenzi
e9eeb721ce
old cloud cleanup: remove magazine from another place.
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:08 +02:00
Aurélien Bompard
d594733771
More RabbitMQ monitoring!
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
963ff9586b
Add more RabbitMQ checks
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
7c2377748e
NRPE does not accept arguments
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
34a8869f9f
Fix RabbitMQ service name
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
e7ab522de3
Add a RabbitMQ check on the cluster
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Rick Elrod
c31829a561
nagios: Try a new way of doing the raid check, so we can check extra hosts like autocloud-backend-libvirt2
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-07-18 19:31:04 +00:00
Patrick Uiterwijk
018065edab
Remove puiterwijk from nagios notifs
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-06-30 11:54:07 +02:00
Clement Verna
bb24183f46
oci-registry: Update nagios to monitor the correct directory for disk space
...
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-05-30 20:06:56 +02:00
Stephen Smoogen
40a819e1d5
[nagios/datanommer] this is what happens when you have 2 files which are supposeldy the same file. You edit one in nagios_server and miss the one in nagios_client. Bad nagios. Bad
2019-05-30 16:56:26 +00:00
Randy Barlow
4cf1624c76
bodhi: Upgrade production to Bodhi 4.0.0.
...
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2019-05-28 15:58:52 +00:00
Stephen Smoogen
b2599f8d2f
[nagios] try ang get groups working
2019-05-23 23:34:41 +00:00
Stephen Smoogen
af3def70a1
overzealous _ in group name for nagios group
2019-05-23 23:16:18 +00:00
Kevin Fenzi
52bb723fee
nagios_server: fix another _ case in group name
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-05-22 18:08:04 +00:00
Kevin Fenzi
15ebdb5233
nagios_server: fix autocloud-backend group to use _
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-05-22 17:24:40 +00:00
Patrick Uiterwijk
9b7882313f
More nagios fixes
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:17:21 +02:00
Patrick Uiterwijk
63fe73c878
More nagios file fixes
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:15:11 +02:00
Patrick Uiterwijk
c276097e32
nagios: fix smtp-mm
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:13:55 +02:00
Patrick Uiterwijk
c14b702513
Continue fixing nagios group names
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:12:23 +02:00
Patrick Uiterwijk
ab9cd48efe
nagios_server: Remove fedmsg checks from hotness01
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 16:40:56 +02:00
Clement Verna
93d0eeaf54
Nagios: monitor that resultsdb sends messages on the bus
...
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-04-24 11:22:46 +02:00
Miroslav Suchý
1a1ca033b6
sent Copr nagios notifications to frostyx too
2019-04-01 10:13:00 +02:00
Miroslav Suchý
b7394d1c54
sent notifications to msuchy@ as clime@ does not work now
2019-04-01 10:10:33 +02:00
Stephen Smoogen
d9d24d08d9
[nagios_server] Add in certgetter test.
...
This was offered by Alessandro Lorenzi <alorenzi@alorenzi.eu> as a fix
to deal with our inability to monitor the certgetter after
reboots. Thank you very much for this work.
Signed-off-by: Stephen Smoogen <smooge@redhat.com>
2019-03-20 20:17:43 +00:00
Kevin Fenzi
1a40dd5142
nagios: drop askbot fedmsg check.
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:34:31 +00:00
Kevin Fenzi
a6cceb3599
nagios: drop remnant of check_osbs_builds
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:23:54 +00:00
Stephen Smoogen
06cc8ca030
and we have no bodhost
2019-02-14 21:11:50 +00:00
Rick Elrod
0b7bb3b5b3
prep for proxy03 move
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-02-11 23:14:27 +00:00
Rick Elrod
4c8cf933fc
make odcs-backend check for fedmsg-hub-3 instead (infra #7526 )
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-01-28 08:54:46 +00:00
Patrick Uiterwijk
18b0acc8f3
Monitor ostree summary on proxies
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-01-21 16:57:26 +01:00
Patrick Uiterwijk
2ded08f111
Add 24-hour check for bodhi compose start
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-12-20 20:44:10 +01:00
Stephen Smoogen
6c1357ff59
repospanner is only running on pkgs01 currently
2018-12-17 18:25:27 +00:00
Stephen Smoogen
0819f469c0
this should allow noc01 to see nrpe commands
2018-12-17 16:42:16 +00:00
Stephen Smoogen
3bbc0031f4
This will add minimal monitoring for repospanner on pkgs01.stg. This only says it is running or not.
2018-12-17 15:44:31 +00:00
Kevin Fenzi
4125997ecc
fix the check_supybot_plugin to listen only for zodbot privmsg, not frigg
2018-12-15 20:52:54 +00:00
Randy Barlow
2911286a3a
Rename check_fedmsg_masher_proc to check_fedmsg_composer_proc and have it check fedmsg-hub-3.
...
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2018-11-19 21:20:15 +00:00
Patrick Uiterwijk
505d8bbf8c
Set COPR backend to send notifs to copr
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
5fa3c6e53d
Add contact info for clime
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
3fc57e699b
Enable nagios checks for ticketkey, and stop emailing puiterwijk
...
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 15:36:00 +02:00
Kevin Fenzi
6cefdc1d5b
another place anitya is
2018-10-07 20:04:40 +00:00
Kevin Fenzi
b2ff9078f2
Clean up non openshift anitya in favor of openshift version.
2018-10-06 00:11:10 +00:00
Kevin Fenzi
2d7ac321c7
a few tagger stragglers
2018-10-03 17:56:00 +00:00
Kevin Fenzi
6c3dc368cd
pkgs is now a letsencrypt cert, so we do not need to monitor it for 60 days
2018-08-22 00:00:21 +00:00
Kevin Fenzi
63079521a4
drop autoqa from nagios too
2018-07-31 17:46:54 +00:00
Kevin Fenzi
5149f16b73
kill pkgdb from nagios server
2018-07-20 04:54:23 +00:00
Patrick Uiterwijk
0854930115
We have no jenkins service anymore
...
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-03 02:28:37 +02:00