seddikalaouiismaili
890dd31cb0
script to monitor systemd units on pagure
2021-02-12 11:34:57 +00:00
Kevin Fenzi
25ace56df7
pagure.io / nagios: check only that cert is valid for 25 days
...
We renew letsencrypt certs at 30 days, so checking at 60 is pointless.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-02-02 14:24:07 -08:00
Kevin Fenzi
a74b4015e7
nagios: contacts
...
Clean up a bunch of old contacts that no longer are around
or care about getting alerts from our nagios.
Add readme file that notes that this information is public and
people should use a filtered email address for this purpose and avoid
adding sensitive information like phone numbers.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-28 11:52:24 -07:00
Kevin Fenzi
71c650baff
nagios / server: drop checking for fas fedmsgs, they likely wont be back
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-05 17:21:08 -07:00
Kevin Fenzi
f650eab7ee
nagios_server / fedmsg: pkgs01 does not run any fedmsg-hub anymore.
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-05 17:00:15 -07:00
Kevin Fenzi
bb61f0da99
nagios / server: don't try and check mincheck group rsyslog
...
We want to make sure rsyslog is running on hosts, but the mincheck
hostss are ones we don't do any nrpe checks on, so we should exclude
them from this. This is like builders or aws hosts.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-02 12:56:49 -07:00
seddikalaouiismaili
e785293064
add check for rsyslogd
2020-10-02 18:50:29 +00:00
Mark O'Brien
5fe015a90a
nagios server plugins: port to py3
2020-10-02 18:46:32 +00:00
Pierre-Yves Chibon
9506631012
pagure: replace pagure01 by pagure02
...
Signed-off-by: Pierre-Yves Chibon <pingou@pingoured.fr>
2020-10-01 16:09:14 +02:00
Mark O'Brien
b2073703e5
[nagios] add back in strp accidentally removed
2020-09-25 14:11:10 +00:00
Mark O'Brien
95eb7c75d3
[nagios] port haproxy connections script to py3
2020-09-25 14:11:10 +00:00
Kevin Fenzi
eed8859c64
pdc-backend: clean up last bits of pdc-backend hosts.
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-08-24 09:32:04 -07:00
Rick Elrod
dcc53bd63b
add crl check to nagios + nrpe + facl perms for nrpe
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-08-06 15:32:09 -05:00
Francois Andrieu
e1b6248a4c
nagios: add check_postfix_redhat to bastion01
2020-07-22 19:41:11 +00:00
Kevin Fenzi
349dec197c
nagios_seever/ irc colorize: 2to3 run to move to python3
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-07-01 16:45:28 -07:00
Kevin Fenzi
9770bae604
nagios_server: use iad2-mgmt-http.cfg
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 16:13:26 -07:00
Kevin Fenzi
9d9d7f6c5c
nagios_server: more adjustments, drop fas for now, fix gateway hosts harder
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 15:59:32 -07:00
Kevin Fenzi
88ab378bba
nagios_server: drop phx2_internal stuff, fix mailman01 to use iad2
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 14:40:14 -07:00
Rick Elrod
d9a23d9930
nagios: nix basset checks for now
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
f6c5bac836
nagios: comment out qahardware ref for now
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
313b2c2988
nagios: more retrace refs
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
ea9dca1a56
nagios: nix packages* for now
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
7b1a407905
nagios: remove ref to rawhide-composer
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
a9ce430cd6
nagios: remove ref to fas01
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 21:01:41 -05:00
Rick Elrod
110ff0c648
nagios: comment out retrace stuff for now
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-06-20 18:39:58 -05:00
Stephen Smoogen
12c486cda3
remove some hosts in inventory file which do not exist in IAD2.
2020-06-14 19:51:27 -04:00
Stephen Smoogen
1b487b34a0
replace some hardcoded phx2 items with hardcoded iad2 items for koji, pdc, and nagios
2020-06-10 07:25:38 -04:00
Stephen Smoogen
bcda2606fe
try to make this work in iad2
2020-06-09 10:45:02 -04:00
Stephen Smoogen
7d5ab8fcfd
remove rabbitmq from phx2
2020-06-08 15:37:18 -04:00
Stephen Smoogen
e26ead0f70
try and get nagios working on noc01.iad2
2020-06-08 11:17:20 -04:00
Stephen Smoogen
00b7de4753
remove more hardcoded stg points in nagios
2020-06-08 10:33:05 -04:00
Stephen Smoogen
ca39f67ed6
remove _stg so nagios works
2020-06-08 10:29:30 -04:00
Stephen Smoogen
192637532c
set up things so nagios in iad2 is mostly ready.
2020-05-21 19:20:38 -04:00
Stephen Smoogen
aa8f90f074
and we remember that j2 files are not cfg files
2020-05-21 16:59:49 -04:00
Stephen Smoogen
794071b256
make mgmt interfaces faster to build
2020-05-21 16:46:41 -04:00
Stephen Smoogen
435095958d
move more service groups to static files and use servicegroup definitions in services
2020-05-21 15:47:19 -04:00
Stephen Smoogen
d82e99371c
use a different syntax for service groups to clean up phx2 ness
2020-05-21 15:22:48 -04:00
Stephen Smoogen
df9fcb477d
move nagios ipa file to template to make less phx2 dependent
2020-05-21 14:57:41 -04:00
Stephen Smoogen
89f91a9642
Clean up nagios to deal with dropped services and that servicegroups can NOT end with a , while every other nagios group can.
2020-05-21 13:22:26 -04:00
Stephen Smoogen
ba1b6c933d
ok ping doesnt need to be a template. all.cfg needs a group which says you cant ping it
2020-04-24 21:34:25 +02:00
Stephen Smoogen
a2ba26c5f4
this is ugly but its been a 12 hour day
2020-04-24 21:34:25 +02:00
Stephen Smoogen
14c240a74a
clean up so nagios works
2020-04-24 21:34:25 +02:00
Stephen Smoogen
fb3b9ed5c9
and when you have a hammer all the world looks like a nail
2020-04-24 21:34:25 +02:00
Kevin Fenzi
9a255e3c41
nagios_server: try and adjust for all the aws copr instances
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:24 +02:00
Rick Elrod
4e71722d8a
nagios/fedmsg: make changes yesterday remain py2 backwards compatible
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:22 +02:00
Rick Elrod
40e13bcbf0
python3-ize the fedmsg check scripts, and on py3 boxes make it check a valid socket path
...
Signed-off-by: Rick Elrod <relrod@redhat.com>
2020-04-24 21:34:21 +02:00
Clement Verna
f4cf99d890
nagios: enable koji wellness again
...
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
bbe84b407c
nagios: disable koji wellness plugin to see if it causes the load on koji servers
...
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
a1cab8fc7c
Nagios: fix the koji_wellness command definition
...
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00
Clement Verna
1943a7fc51
Nagios: fix the name of the koji wellness command
...
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:21 +02:00