Stephen Smoogen
9845cd08be
fix nagios check on download.copr to use check_website_follow_ssl to remove alert
2022-01-21 11:16:55 -05:00
Pavel Raiskup
c9951efa8d
nagios: disable download.copr.fedoraproject.org chack again
...
We don't know what's wrong on that:
HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - string 'Index of /' not found on 'https://download.copr.fedorainfracloud.org:443/ ' - 3692 bytes in 0.631 second response time
2022-01-21 15:29:14 +01:00
Silvie Chlupova
ba86e27e79
copr: add nagios checks for copr servers
2022-01-21 14:18:05 +01:00
Silvie Chlupova
cb2f805c26
copr: don't check copr servers using nagios for now
2022-01-20 16:35:33 +01:00
Pavel Raiskup
f7edb31e43
noc: fixup noc.yaml playbook
...
Per report:
Error: Could not find any hostgroup matching 'datagrepper'
(config file '/etc/nagios/services/websites.cfg', starting on line 194)"
Folow up for: 726a788721
2022-01-20 15:34:41 +01:00
Silvie Chlupova
debd3c5b7e
copr: define new command for nagios
...
We need to use --ssl and also -f follow
2022-01-20 15:26:53 +01:00
Silvie Chlupova
6fa2999dbf
copr: use already existing copr.cfg
2022-01-20 13:23:31 +01:00
Silvie Chlupova
8c5dc50c7e
copr: move copr nagios services into separate file
2022-01-20 12:14:48 +01:00
Silvie Chlupova
87e510f378
copr: nagios check for copr frontend, backend and distgit
...
Fixes: https://pagure.io/copr/copr/issue/2002
2022-01-20 11:47:14 +01:00
Silvie Chlupova
8d9f6e0c4c
copr: nagios check for copr frontend, backend and distgit
...
Fixes: https://pagure.io/copr/copr/issue/2002
2022-01-20 08:33:23 +00:00
Silvie Chlupova
b9fa39f0c8
copr: nagios check for Copr's CDN
...
Relates: https://pagure.io/fedora-infrastructure/issue/10456
2022-01-04 15:28:24 +01:00
Kevin Fenzi
0f2ae88d63
nagios: add some copr team members
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-11-21 14:43:57 -08:00
Eddie Jennings, Jr
6ef496d56a
Reconfigure IPv6
...
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Configure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Reconfigure IPv6
Configure IPv6
Update IPv6 address for noc02 rule
Update IPv6 address in confg for noc02 address change
Update IPv6 address for proxy04
Update IPv6 address for torrent02
2021-11-08 22:56:05 +00:00
Mikolaj Izdebski
26c38caafa
nagios: Remove check for supybot fedmsg plugin
...
Zodbot no longer has fedmsg plugin installed - supybot-fedmsg package
is not installed on value02 (RHEL 8) and supybot-fedmsg upstream
project on GitHub has been archived.
2021-11-03 22:49:21 +00:00
Mikolaj Izdebski
a65fa4e1c0
nagios_server: Update hostname where zodbot is running
...
Zodbot is running on value02 now.
2021-11-03 16:38:34 +01:00
49c1616ca7
Update nagios check for accounts.fedoraproject.org
2021-09-29 19:04:41 +00:00
Kevin Fenzi
844177a0ae
nagios: try and sepecify the additional groups another way
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-09-02 11:25:38 -07:00
Kevin Fenzi
d4ad74ae5e
nagios / vpnclients: fix typo in previous commit
...
group was used, but ansible needs groups here.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-09-02 10:28:20 -07:00
Stephen Smoogen
2272ab1f6f
Add in a test to make that the nagios templates try to add in groups
...
with no vpn.
Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2021-08-27 11:05:40 -04:00
Kevin Fenzi
ec0d18a8b8
nagios: adjust where zodbot announces alerts, zodbot is on value02 now
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-08-22 10:10:10 -07:00
Pavel Raiskup
6062eec80b
nagios: drop copr_external.cfg from services
2021-08-10 09:59:23 +02:00
Pavel Raiskup
d2f9b772e9
nagios: move copr-ping to internal
2021-08-10 08:51:55 +02:00
Pavel Raiskup
ff215ea2b9
nagios: external: define copr_* hostgroups
2021-08-09 15:25:19 +02:00
Pavel Raiskup
f76859775c
nagios: pick up copr_external.cfg services
2021-08-09 13:50:30 +02:00
Jakub Kadlcik
9a8acc79ae
nagios: enable disk monitoring for copr instances
...
I think that / monitoring should work by default just by
setting `nrpe: true` because of
define service {
hostgroup_name all, !mincheckgrp
service_description Disk_Space_/
check_command check_by_nrpe!check_disk_/
use disktemplate
}
2021-08-09 11:45:53 +00:00
Pavel Raiskup
73ba7d25b1
copr-be: fixup copr-ping nagios mapping
2021-08-09 13:34:25 +02:00
Pavel Raiskup
0771b0e4ad
copr-be: install ping nrpe task
2021-08-09 11:59:03 +02:00
Pavel Raiskup
44c172c56e
copr-be: copr-ping
2021-08-05 14:48:20 +02:00
Pavel Raiskup
97e5861ac0
nagios: sync copr-be and copr-be-dev
2021-07-28 23:06:26 +02:00
Pavel Raiskup
eb66378f24
nagios: typo in copr_back => copr_back_aws
2021-07-28 16:20:45 +02:00
Pavel Raiskup
e433a17ffe
nagios: add schlupov, and notify her in copr contactgroup
2021-07-28 14:49:50 +02:00
Pavel Raiskup
9eebd7387c
nagios: add contact for 'praiskup'
2021-07-28 14:14:18 +02:00
Pavel Raiskup
9dd486fac8
Revert "nagios: add me and schlupov to copr contact group"
...
We need to define the contacts first.
This reverts commit 00b5afa1a9
.
2021-07-28 14:08:45 +02:00
Pavel Raiskup
29fb33bbb7
copr-be: test remaining results storage space
2021-07-28 13:51:16 +02:00
Pavel Raiskup
00b5afa1a9
nagios: add me and schlupov to copr contact group
2021-07-28 13:41:30 +02:00
Pavel Raiskup
92ff0683f5
nrpe: check_disk order (almost) alphabetically
...
Without this, it was hard to tell if check_disk.cfg.j2 mirrors
nrpe.cfg.j2.
2021-07-28 13:41:26 +02:00
Michael Scherer
3b8504f293
Fix mention of Freenode
2021-07-02 11:17:20 +02:00
d9fc78b0e4
nagios: remove MBSProducer check from mbs-backend
2021-05-21 18:58:14 +00:00
9006cf784e
nagios: remove unused check_datanommer_faf
2021-05-21 18:57:09 +00:00
Kevin Fenzi
d890a9fbf4
bugzilla2fedmsg: drop checks against vm as it has moved to openshift
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-05-19 12:00:49 -07:00
Stephen Smoogen
7b43b8049a
Update kevins config so nagios will load since 16x7 no longer exists
...
Signed-off-by: Stephen Smoogen <smooge@smoogespace.com>
2021-04-28 16:07:43 +00:00
Stephen Smoogen
e5a3fb3a43
Add in a 12x7 versus 16x7 and make some timeszones friendlier
...
Signed-off-by: Stephen Smoogen <smooge@smoogespace.com>
2021-04-28 16:07:43 +00:00
Kevin Fenzi
5b1b2c403d
nagios: fix ipsilon check to look for something in the new theme
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-03-24 18:13:37 -07:00
seddikalaouiismaili
890dd31cb0
script to monitor systemd units on pagure
2021-02-12 11:34:57 +00:00
Kevin Fenzi
25ace56df7
pagure.io / nagios: check only that cert is valid for 25 days
...
We renew letsencrypt certs at 30 days, so checking at 60 is pointless.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2021-02-02 14:24:07 -08:00
Kevin Fenzi
3caa063699
nagios_server / services: registry is only on proxy01/10
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-12-02 09:57:53 -08:00
09d0a5bde5
Add monitoring to oci-registry api/webui ( Fixes #9231 )
2020-12-01 23:44:22 +01:00
Kevin Fenzi
a74b4015e7
nagios: contacts
...
Clean up a bunch of old contacts that no longer are around
or care about getting alerts from our nagios.
Add readme file that notes that this information is public and
people should use a filtered email address for this purpose and avoid
adding sensitive information like phone numbers.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-28 11:52:24 -07:00
Kevin Fenzi
71c650baff
nagios / server: drop checking for fas fedmsgs, they likely wont be back
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-05 17:21:08 -07:00
Kevin Fenzi
f650eab7ee
nagios_server / fedmsg: pkgs01 does not run any fedmsg-hub anymore.
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-10-05 17:00:15 -07:00