Commit graph

415 commits

Author SHA1 Message Date
Kevin Fenzi
7ef834a64f nagios / haproxy: 1024 is just not enough anymore
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:20 +02:00
Clement Verna
aa569ea75b ODCS: replace the process check by a service check.
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:19 +02:00
Stephen Smoogen
202e6a5692 its tricksy.. that hobbitses stored it in a different place 2020-04-24 21:34:19 +02:00
Stephen Smoogen
68ddf2b343 all this nagios stuff assumes python2 which will not work on F30+ 2020-04-24 21:34:19 +02:00
Kevin Fenzi
779fa01877 autocloud: fare well autocloud, you served long and well...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:17 +02:00
Stephen Smoogen
997500ce2c remove repospanner files for nagios 2020-04-24 21:34:17 +02:00
Rick Elrod
3d9dbe4e3a Revert "add in karsten hopp patch on ssl.cfg"
This reverts commit 3dd7f3f98aa357f6c3c9461f2154ed612be77ac5.

This is bad for several reasons. 1) It breaks nagios. 2) It
special-cases proxy03.
2020-04-24 21:34:16 +02:00
Rick Elrod
5de6b23fe2 nagios_server: we cannot have two contacts named the same thing...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:16 +02:00
Stephen Smoogen
123e36f38e please remember to do a git add before commit so you can get the goodness you wrote 2020-04-24 21:34:16 +02:00
Stephen Smoogen
41fe0ec74e add in nagios patches from karsten 2020-04-24 21:34:16 +02:00
Stephen Smoogen
0d287fce63 add in karsten hopp patch on ssl.cfg 2020-04-24 21:34:16 +02:00
Kevin Fenzi
a8c8df48de bugyou: retire service and all it's parts
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:14 +02:00
Rick Elrod
534fa31934 nagios: check sigul bridge proc
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:14 +02:00
Clement Verna
34b9935c8e Nagios: add a check for rpm.sign message (warning : 1h, alert: 1h30)
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:14 +02:00
Clement Verna
9ef75bdb24 ODCS: check for odcs_celery_backend instead of fedmsg-hub-3 in nagios
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:14 +02:00
Kevin Fenzi
53db2ac629 nagios: check_lock is python2
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:14 +02:00
Stephen Smoogen
54c31f65c3 comment out people in nagios to make sure we arent sending texts to dead emails 2020-04-24 21:34:13 +02:00
Mikolaj Izdebski
87896586da nagios_server: Remove koschei backend services 2020-04-24 21:34:11 +02:00
Mikolaj Izdebski
a1c728f8a8 Revert "nagios_server: Check Koschei pod count instead of processes"
This reverts commit a0474d9c3687bd144d5a890e8f8c802486299947.
This reverts commit 803af9c9cb3456d7440695ddf8c51990b002c6d4.
2020-04-24 21:34:11 +02:00
Mikolaj Izdebski
98ecbf6f50 nagios_server: Add missing arguments to check_openshift_objects 2020-04-24 21:34:11 +02:00
Mikolaj Izdebski
ec6c5cab14 nagios_server: Check Koschei pod count instead of processes 2020-04-24 21:34:11 +02:00
Stephen Smoogen
e1abdfc8bd [nagios] remember to remove coloamer from nagios 2020-04-24 21:34:11 +02:00
Kevin Fenzi
e9eeb721ce old cloud cleanup: remove magazine from another place.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:08 +02:00
Aurélien Bompard
d594733771 More RabbitMQ monitoring!
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
963ff9586b Add more RabbitMQ checks
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
7c2377748e NRPE does not accept arguments
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
34a8869f9f Fix RabbitMQ service name
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Aurélien Bompard
e7ab522de3 Add a RabbitMQ check on the cluster
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2020-04-24 21:34:07 +02:00
Rick Elrod
c31829a561 nagios: Try a new way of doing the raid check, so we can check extra hosts like autocloud-backend-libvirt2
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-07-18 19:31:04 +00:00
Patrick Uiterwijk
018065edab Remove puiterwijk from nagios notifs
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-06-30 11:54:07 +02:00
Clement Verna
bb24183f46 oci-registry: Update nagios to monitor the correct directory for disk space
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-05-30 20:06:56 +02:00
Stephen Smoogen
40a819e1d5 [nagios/datanommer] this is what happens when you have 2 files which are supposeldy the same file. You edit one in nagios_server and miss the one in nagios_client. Bad nagios. Bad 2019-05-30 16:56:26 +00:00
Randy Barlow
4cf1624c76 bodhi: Upgrade production to Bodhi 4.0.0.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2019-05-28 15:58:52 +00:00
Stephen Smoogen
b2599f8d2f [nagios] try ang get groups working 2019-05-23 23:34:41 +00:00
Stephen Smoogen
af3def70a1 overzealous _ in group name for nagios group 2019-05-23 23:16:18 +00:00
Kevin Fenzi
52bb723fee nagios_server: fix another _ case in group name
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-05-22 18:08:04 +00:00
Kevin Fenzi
15ebdb5233 nagios_server: fix autocloud-backend group to use _
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-05-22 17:24:40 +00:00
Patrick Uiterwijk
9b7882313f More nagios fixes
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:17:21 +02:00
Patrick Uiterwijk
63fe73c878 More nagios file fixes
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:15:11 +02:00
Patrick Uiterwijk
c276097e32 nagios: fix smtp-mm
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:13:55 +02:00
Patrick Uiterwijk
c14b702513 Continue fixing nagios group names
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 17:12:23 +02:00
Patrick Uiterwijk
ab9cd48efe nagios_server: Remove fedmsg checks from hotness01
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-05-22 16:40:56 +02:00
Clement Verna
93d0eeaf54 Nagios: monitor that resultsdb sends messages on the bus
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-04-24 11:22:46 +02:00
Miroslav Suchý
1a1ca033b6 sent Copr nagios notifications to frostyx too 2019-04-01 10:13:00 +02:00
Miroslav Suchý
b7394d1c54 sent notifications to msuchy@ as clime@ does not work now 2019-04-01 10:10:33 +02:00
Stephen Smoogen
d9d24d08d9 [nagios_server] Add in certgetter test.
This was offered by Alessandro Lorenzi <alorenzi@alorenzi.eu> as a fix
to deal with our inability to monitor the certgetter after
reboots. Thank you very much for this work.

Signed-off-by: Stephen Smoogen <smooge@redhat.com>
2019-03-20 20:17:43 +00:00
Kevin Fenzi
1a40dd5142 nagios: drop askbot fedmsg check.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:34:31 +00:00
Kevin Fenzi
a6cceb3599 nagios: drop remnant of check_osbs_builds
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:23:54 +00:00
Stephen Smoogen
06cc8ca030 and we have no bodhost 2019-02-14 21:11:50 +00:00
Rick Elrod
0b7bb3b5b3 prep for proxy03 move
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-02-11 23:14:27 +00:00