Commit graph

497 commits

Author SHA1 Message Date
Rick Elrod
313b2c2988 nagios: more retrace refs
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
ea9dca1a56 nagios: nix packages* for now
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
7b1a407905 nagios: remove ref to rawhide-composer
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
a9ce430cd6 nagios: remove ref to fas01
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
110ff0c648 nagios: comment out retrace stuff for now
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 18:39:58 -05:00
Stephen Smoogen
12c486cda3 remove some hosts in inventory file which do not exist in IAD2. 2020-06-14 19:51:27 -04:00
Stephen Smoogen
1b487b34a0 replace some hardcoded phx2 items with hardcoded iad2 items for koji, pdc, and nagios 2020-06-10 07:25:38 -04:00
Stephen Smoogen
bcda2606fe try to make this work in iad2 2020-06-09 10:45:02 -04:00
Stephen Smoogen
7d5ab8fcfd remove rabbitmq from phx2 2020-06-08 15:37:18 -04:00
Stephen Smoogen
e26ead0f70 try and get nagios working on noc01.iad2 2020-06-08 11:17:20 -04:00
Stephen Smoogen
00b7de4753 remove more hardcoded stg points in nagios 2020-06-08 10:33:05 -04:00
Stephen Smoogen
ca39f67ed6 remove _stg so nagios works 2020-06-08 10:29:30 -04:00
Stephen Smoogen
192637532c set up things so nagios in iad2 is mostly ready. 2020-05-21 19:20:38 -04:00
Stephen Smoogen
aa8f90f074 and we remember that j2 files are not cfg files 2020-05-21 16:59:49 -04:00
Stephen Smoogen
794071b256 make mgmt interfaces faster to build 2020-05-21 16:46:41 -04:00
Stephen Smoogen
435095958d move more service groups to static files and use servicegroup definitions in services 2020-05-21 15:47:19 -04:00
Stephen Smoogen
d82e99371c use a different syntax for service groups to clean up phx2 ness 2020-05-21 15:22:48 -04:00
Stephen Smoogen
df9fcb477d move nagios ipa file to template to make less phx2 dependent 2020-05-21 14:57:41 -04:00
Stephen Smoogen
89f91a9642 Clean up nagios to deal with dropped services and that servicegroups can NOT end with a , while every other nagios group can. 2020-05-21 13:22:26 -04:00
Stephen Smoogen
ba1b6c933d ok ping doesnt need to be a template. all.cfg needs a group which says you cant ping it 2020-04-24 21:34:25 +02:00
Stephen Smoogen
a2ba26c5f4 this is ugly but its been a 12 hour day 2020-04-24 21:34:25 +02:00
Stephen Smoogen
14c240a74a clean up so nagios works 2020-04-24 21:34:25 +02:00
Stephen Smoogen
fb3b9ed5c9 and when you have a hammer all the world looks like a nail 2020-04-24 21:34:25 +02:00
Kevin Fenzi
9a255e3c41 nagios_server: try and adjust for all the aws copr instances
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:24 +02:00
Rick Elrod
4e71722d8a nagios/fedmsg: make changes yesterday remain py2 backwards compatible
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:22 +02:00
Rick Elrod
40e13bcbf0 python3-ize the fedmsg check scripts, and on py3 boxes make it check a valid socket path
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:21 +02:00
Clement Verna
f4cf99d890 nagios: enable koji wellness again
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
bbe84b407c nagios: disable koji wellness plugin to see if it causes the load on koji servers
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
a1cab8fc7c Nagios: fix the koji_wellness command definition
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
1943a7fc51 Nagios: fix the name of the koji wellness command
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
96662a3ac4 Nagios: change the hostname in the koji wellness service.
Fixes https://pagure.io/fedora-infrastructure/issue/6505

Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Leonardo Rossetti
be5ecd4f16 koji_wellness plugin
Signed-off-by: Leonardo Rossetti <me@lrossetti.com>
2020-04-24 21:34:21 +02:00
Kevin Fenzi
7ef834a64f nagios / haproxy: 1024 is just not enough anymore
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:20 +02:00
Clement Verna
aa569ea75b ODCS: replace the process check by a service check.
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:19 +02:00
Stephen Smoogen
202e6a5692 its tricksy.. that hobbitses stored it in a different place 2020-04-24 21:34:19 +02:00
Stephen Smoogen
68ddf2b343 all this nagios stuff assumes python2 which will not work on F30+ 2020-04-24 21:34:19 +02:00
Kevin Fenzi
779fa01877 autocloud: fare well autocloud, you served long and well...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:17 +02:00
Stephen Smoogen
997500ce2c remove repospanner files for nagios 2020-04-24 21:34:17 +02:00
Rick Elrod
3d9dbe4e3a Revert "add in karsten hopp patch on ssl.cfg"
This reverts commit 3dd7f3f98aa357f6c3c9461f2154ed612be77ac5.

This is bad for several reasons. 1) It breaks nagios. 2) It
special-cases proxy03.
2020-04-24 21:34:16 +02:00
Rick Elrod
5de6b23fe2 nagios_server: we cannot have two contacts named the same thing...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:16 +02:00
Stephen Smoogen
123e36f38e please remember to do a git add before commit so you can get the goodness you wrote 2020-04-24 21:34:16 +02:00
Stephen Smoogen
41fe0ec74e add in nagios patches from karsten 2020-04-24 21:34:16 +02:00
Stephen Smoogen
0d287fce63 add in karsten hopp patch on ssl.cfg 2020-04-24 21:34:16 +02:00
Kevin Fenzi
a8c8df48de bugyou: retire service and all it's parts
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:14 +02:00
Rick Elrod
534fa31934 nagios: check sigul bridge proc
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:14 +02:00
Clement Verna
34b9935c8e Nagios: add a check for rpm.sign message (warning : 1h, alert: 1h30)
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:14 +02:00
Clement Verna
9ef75bdb24 ODCS: check for odcs_celery_backend instead of fedmsg-hub-3 in nagios
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:14 +02:00
Kevin Fenzi
53db2ac629 nagios: check_lock is python2
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:14 +02:00
Stephen Smoogen
54c31f65c3 comment out people in nagios to make sure we arent sending texts to dead emails 2020-04-24 21:34:13 +02:00
Mikolaj Izdebski
87896586da nagios_server: Remove koschei backend services 2020-04-24 21:34:11 +02:00