Commit graph

467 commits

Author SHA1 Message Date
seddikalaouiismaili
890dd31cb0 script to monitor systemd units on pagure 2021-02-12 11:34:57 +00:00
Kevin Fenzi
25ace56df7 pagure.io / nagios: check only that cert is valid for 25 days
We renew letsencrypt certs at 30 days, so checking at 60 is pointless.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-02-02 14:24:07 -08:00
Kevin Fenzi
a74b4015e7 nagios: contacts
Clean up a bunch of old contacts that no longer are around
or care about getting alerts from our nagios.

Add readme file that notes that this information is public and
people should use a filtered email address for this purpose and avoid
adding sensitive information like phone numbers.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-28 11:52:24 -07:00
Kevin Fenzi
71c650baff nagios / server: drop checking for fas fedmsgs, they likely wont be back
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-05 17:21:08 -07:00
Kevin Fenzi
f650eab7ee nagios_server / fedmsg: pkgs01 does not run any fedmsg-hub anymore.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-05 17:00:15 -07:00
Kevin Fenzi
bb61f0da99 nagios / server: don't try and check mincheck group rsyslog
We want to make sure rsyslog is running on hosts, but the mincheck
hostss are ones we don't do any nrpe checks on, so we should exclude
them from this. This is like builders or aws hosts.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-02 12:56:49 -07:00
seddikalaouiismaili
e785293064 add check for rsyslogd 2020-10-02 18:50:29 +00:00
Mark O'Brien
5fe015a90a nagios server plugins: port to py3 2020-10-02 18:46:32 +00:00
Pierre-Yves Chibon
9506631012 pagure: replace pagure01 by pagure02
Signed-off-by: Pierre-Yves Chibon <pingou@pingoured.fr>
2020-10-01 16:09:14 +02:00
Mark O'Brien
b2073703e5 [nagios] add back in strp accidentally removed 2020-09-25 14:11:10 +00:00
Mark O'Brien
95eb7c75d3 [nagios] port haproxy connections script to py3 2020-09-25 14:11:10 +00:00
Kevin Fenzi
eed8859c64 pdc-backend: clean up last bits of pdc-backend hosts.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-08-24 09:32:04 -07:00
Rick Elrod
dcc53bd63b add crl check to nagios + nrpe + facl perms for nrpe
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-08-06 15:32:09 -05:00
Francois Andrieu
e1b6248a4c nagios: add check_postfix_redhat to bastion01 2020-07-22 19:41:11 +00:00
Kevin Fenzi
349dec197c nagios_seever/ irc colorize: 2to3 run to move to python3
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-07-01 16:45:28 -07:00
Kevin Fenzi
9770bae604 nagios_server: use iad2-mgmt-http.cfg
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 16:13:26 -07:00
Kevin Fenzi
9d9d7f6c5c nagios_server: more adjustments, drop fas for now, fix gateway hosts harder
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 15:59:32 -07:00
Kevin Fenzi
88ab378bba nagios_server: drop phx2_internal stuff, fix mailman01 to use iad2
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 14:40:14 -07:00
Rick Elrod
d9a23d9930 nagios: nix basset checks for now
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
f6c5bac836 nagios: comment out qahardware ref for now
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
313b2c2988 nagios: more retrace refs
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
ea9dca1a56 nagios: nix packages* for now
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
7b1a407905 nagios: remove ref to rawhide-composer
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
a9ce430cd6 nagios: remove ref to fas01
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
110ff0c648 nagios: comment out retrace stuff for now
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 18:39:58 -05:00
Stephen Smoogen
12c486cda3 remove some hosts in inventory file which do not exist in IAD2. 2020-06-14 19:51:27 -04:00
Stephen Smoogen
1b487b34a0 replace some hardcoded phx2 items with hardcoded iad2 items for koji, pdc, and nagios 2020-06-10 07:25:38 -04:00
Stephen Smoogen
bcda2606fe try to make this work in iad2 2020-06-09 10:45:02 -04:00
Stephen Smoogen
7d5ab8fcfd remove rabbitmq from phx2 2020-06-08 15:37:18 -04:00
Stephen Smoogen
e26ead0f70 try and get nagios working on noc01.iad2 2020-06-08 11:17:20 -04:00
Stephen Smoogen
00b7de4753 remove more hardcoded stg points in nagios 2020-06-08 10:33:05 -04:00
Stephen Smoogen
ca39f67ed6 remove _stg so nagios works 2020-06-08 10:29:30 -04:00
Stephen Smoogen
192637532c set up things so nagios in iad2 is mostly ready. 2020-05-21 19:20:38 -04:00
Stephen Smoogen
aa8f90f074 and we remember that j2 files are not cfg files 2020-05-21 16:59:49 -04:00
Stephen Smoogen
794071b256 make mgmt interfaces faster to build 2020-05-21 16:46:41 -04:00
Stephen Smoogen
435095958d move more service groups to static files and use servicegroup definitions in services 2020-05-21 15:47:19 -04:00
Stephen Smoogen
d82e99371c use a different syntax for service groups to clean up phx2 ness 2020-05-21 15:22:48 -04:00
Stephen Smoogen
df9fcb477d move nagios ipa file to template to make less phx2 dependent 2020-05-21 14:57:41 -04:00
Stephen Smoogen
89f91a9642 Clean up nagios to deal with dropped services and that servicegroups can NOT end with a , while every other nagios group can. 2020-05-21 13:22:26 -04:00
Stephen Smoogen
ba1b6c933d ok ping doesnt need to be a template. all.cfg needs a group which says you cant ping it 2020-04-24 21:34:25 +02:00
Stephen Smoogen
a2ba26c5f4 this is ugly but its been a 12 hour day 2020-04-24 21:34:25 +02:00
Stephen Smoogen
14c240a74a clean up so nagios works 2020-04-24 21:34:25 +02:00
Stephen Smoogen
fb3b9ed5c9 and when you have a hammer all the world looks like a nail 2020-04-24 21:34:25 +02:00
Kevin Fenzi
9a255e3c41 nagios_server: try and adjust for all the aws copr instances
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:24 +02:00
Rick Elrod
4e71722d8a nagios/fedmsg: make changes yesterday remain py2 backwards compatible
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:22 +02:00
Rick Elrod
40e13bcbf0 python3-ize the fedmsg check scripts, and on py3 boxes make it check a valid socket path
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:21 +02:00
Clement Verna
f4cf99d890 nagios: enable koji wellness again
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
bbe84b407c nagios: disable koji wellness plugin to see if it causes the load on koji servers
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
a1cab8fc7c Nagios: fix the koji_wellness command definition
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
1943a7fc51 Nagios: fix the name of the koji wellness command
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00