Commit graph

31 commits

Author SHA1 Message Date
Kevin Fenzi
e3e2cb1d93 odcs: retire service ( infra 12192 )
Time to retire ODCS. ELN is moved off and that was the last thing using
it. Thanks for all the service ODCS!

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-09-24 18:21:51 +00:00
Kevin Fenzi
0dfa11a6eb fedimg: signing off...
Thanks for all the uploads fedimg.
You go to a far far better place I'm sure.

There's no point in keeping it around now, as it's actually not working
and the replacement ( cloud-image-uploader) should work soon.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-08-13 16:40:01 -07:00
Kevin Fenzi
84a7a7afc8 nagios: adjust nrpe for badges vs old fedbadges
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-05-28 13:54:53 -07:00
Kevin Fenzi
d366194a22 module-build-service (mbs): retire service
With the EOL of Fedora 38 yesterday, we are no longer building any
modules and can retire our module build service.

Note that toddlers needs to be adjusted still, that will happen after
this.

Thanks for all the modules!

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2024-05-22 13:38:53 -07:00
Kevin Fenzi
22dde8163b unbound: remove and retire unbound servers
These instances served long and well as fallback resolvers for
dnssec-trigger. This is no longer needed or used, so lets remove them.
See https://pagure.io/fedora-infrastructure/issue/11415

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-07-24 14:40:43 -07:00
Pavel Raiskup
0944ac4ef3 copr-dist-git: decrease storage warning quota
With 5T storage, it is enough to warn on remaining 12%, and error on 6%.
2023-07-24 07:14:16 +02:00
Pavel Raiskup
09c7c868c6 copr-be: nagios: decrease the quota warning even more 2023-05-26 09:16:37 +02:00
Pavel Raiskup
5adcfbbbd6 copr-be: decrease the nagios warning quota to 10%, attempt #2
10% is still ~2.4T of free space, ATM it looks like enough to not start
the panic mode.

Complements: 2ed4e90feb
Fixes: https://github.com/fedora-copr/copr/issues/2737
2023-05-24 07:53:01 +02:00
Andrew Heath
9121258f52 reenable ansible nagios busgateway01 checks 2023-05-23 12:13:31 -04:00
Andrew Heath
9d3c107ef0 Disabling ansible check till we can troubleshoot 2023-05-19 20:07:41 +00:00
Andrew Heath
3600553301 removing nommer and fixing RPM sign 2023-05-19 20:07:41 +00:00
Andrew Heath
2b01531a46 Removeing greenwave checks per ticket 11090 2023-04-11 17:32:49 +00:00
Pavel Raiskup
120acfb3e7 copr-be: really setup the copr-be storage warning to 12%
The templates got de-synced.
2022-04-23 23:54:23 +02:00
Pavel Raiskup
e3bee776ea nagios/copr: start warning us on 12% of backend storage
There's 15T (and we can enlarge the volume to 16T).  12% is still 1.8T.
2022-03-01 10:03:04 +01:00
Silvie Chlupova
b9fa39f0c8 copr: nagios check for Copr's CDN
Relates: https://pagure.io/fedora-infrastructure/issue/10456
2022-01-04 15:28:24 +01:00
Mikolaj Izdebski
26c38caafa nagios: Remove check for supybot fedmsg plugin
Zodbot no longer has fedmsg plugin installed - supybot-fedmsg package
is not installed on value02 (RHEL 8) and supybot-fedmsg upstream
project on GitHub has been archived.
2021-11-03 22:49:21 +00:00
Jakub Kadlcik
9a8acc79ae nagios: enable disk monitoring for copr instances
I think that / monitoring should work by default just by
setting `nrpe: true` because of

    define service {
      hostgroup_name	all, !mincheckgrp
      service_description   Disk_Space_/
      check_command		check_by_nrpe!check_disk_/
      use                   disktemplate
    }
2021-08-09 11:45:53 +00:00
Pavel Raiskup
73ba7d25b1 copr-be: fixup copr-ping nagios mapping 2021-08-09 13:34:25 +02:00
Pavel Raiskup
29fb33bbb7 copr-be: test remaining results storage space 2021-07-28 13:51:16 +02:00
Pavel Raiskup
92ff0683f5 nrpe: check_disk order (almost) alphabetically
Without this, it was hard to tell if check_disk.cfg.j2 mirrors
nrpe.cfg.j2.
2021-07-28 13:41:26 +02:00
d9fc78b0e4 nagios: remove MBSProducer check from mbs-backend 2021-05-21 18:58:14 +00:00
9006cf784e nagios: remove unused check_datanommer_faf 2021-05-21 18:57:09 +00:00
Kevin Fenzi
d890a9fbf4 bugzilla2fedmsg: drop checks against vm as it has moved to openshift
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-05-19 12:00:49 -07:00
seddikalaouiismaili
890dd31cb0 script to monitor systemd units on pagure 2021-02-12 11:34:57 +00:00
Pierre-Yves Chibon
f91a80046b Wipe everything that is to do with pdc-backend from our ansible repo
Signed-off-by: Pierre-Yves Chibon <pingou@pingoured.fr>
2020-10-05 18:57:52 +00:00
seddikalaouiismaili
e785293064 add check for rsyslogd 2020-10-02 18:50:29 +00:00
Stephen Smoogen
8d58708305 remove 10.5.126 ips from nrpe to try and figure out why host was not connecting 2020-09-23 17:08:17 -04:00
Kevin Fenzi
632d4a0273 nagios_server: adjust a bunch more things for iad2.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 15:39:32 -07:00
Kevin Fenzi
5a7245bf26 iptables / nagios_client/server: clean up more phx2 ips for iad2
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 14:51:43 -07:00
Kevin Fenzi
6908fbf86a nagios_server: replace phx2_internal with iad2_internal.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-06-30 14:33:11 -07:00
Stephen Smoogen
192637532c set up things so nagios in iad2 is mostly ready. 2020-05-21 19:20:38 -04:00
Renamed from roles/nagios_server/files/nrpe/nrpe.cfg (Browse further)