Kevin Fenzi
22dde8163b
unbound: remove and retire unbound servers
...
These instances served long and well as fallback resolvers for
dnssec-trigger. This is no longer needed or used, so lets remove them.
See https://pagure.io/fedora-infrastructure/issue/11415
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-07-24 14:40:43 -07:00
Stephen Smoogen
7d7d0bf0a8
Remove smooge from various aliases
...
Currently, I (Stephen Smoogen) do not have the time to work on Fedora
system administration items. However, I get a lot of email and people
see my email address in various places to ping me for working on
things. I feel it would be better to remove myself from those places
and let Fedora Infrastructure add someone else to replace me when it
is possible to do so.
Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2023-07-17 23:34:18 +00:00
Aurélien Bompard
e1d3dcc491
Darn JS SPA
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-05-09 13:31:12 +02:00
Aurélien Bompard
5920da4334
FMN: fix the Nagios check again
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-05-09 10:05:25 +02:00
Aurélien Bompard
80c7b61487
FMN: update the nagios check
...
FMN is now running in OpenShift
Fixes : #11296
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-05-09 09:14:25 +02:00
Aurélien Bompard
360e184862
FMN: move the old to -old and redirect to the new
...
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-04-26 10:55:25 +02:00
Pavel Raiskup
56c3f11a48
nagios: fix empty groups members in all-external.cfg.j2
2023-04-26 09:18:06 +02:00
Pavel Raiskup
5adeb88890
nagios: do not rely on ansible_hostname defined for the ipv6 checks
...
Changing the `host_name` - it was a "fake" non-DNS address anyway
Specify aliases only if ansible_hostname is stated in cached facts,
see https://pagure.io/fedora-infrastructure/issue/11264
2023-04-26 08:22:02 +02:00
Pavel Raiskup
33ccc11860
nagios: don't duplicate the host specifications
...
Revert "nagios: fix the ibiblio-hosts-ipv6 ifdef"
This reverts commit 4a999c925b
.
2023-04-26 00:36:56 +02:00
Pavel Raiskup
4a999c925b
nagios: fix the ibiblio-hosts-ipv6 ifdef
...
Fixes : #11264
2023-04-25 16:29:15 +00:00
Pavel Raiskup
dc0b0f1d7e
nagios: fix the whitespace problem, take #2
2023-04-25 14:40:04 +02:00
Pavel Raiskup
7e7e28a4c5
nagios: typo in config
2023-04-25 14:02:13 +02:00
Pavel Raiskup
01129c2be3
nagios: allow empty non-member groups
...
This is mostly to fix the ansible-playbook failure
`nagios -v /etc/nagios/nagios.cfg` if `member` is empty.
2023-04-25 13:18:28 +02:00
Stephen Smoogen
cae6400729
Fix external IAD2 ip address
...
The address noc02 was monitoring, 209.132.185.254, was a switch behind
a firewall which might not be viewable for various reasons. Red Hat
NOC let us know that 209.132.185.206 was the floating IP address which
is a better source of uptime.
Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2023-03-22 09:06:45 -04:00
Kevin Fenzi
71cdddf55b
nagios: move the ipv6 specific ping config to a ping-ipv6.cfg file
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-11-17 16:39:11 -08:00
Kevin Fenzi
b9b35a09ed
nagios: move ping.cfg to a template so it works for both nagios servers
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-11-17 16:19:50 -08:00
Stephen Smoogen
e6b3fb1904
Make it so that ipv6 is checked on hosts
2022-11-17 15:55:53 -05:00
Stephen Smoogen
993245267a
this one line should fix a mispaste of a bracket
2022-11-17 12:56:30 -05:00
Stephen Smoogen
e36f982263
This should allow for ansible to build correctly the templates for noc01/noc02.
2022-11-17 12:06:00 -05:00
Seddik Alaoui Ismaili
9af427e1bf
add ipv6 check for fedorapeople
2022-11-17 01:40:25 +00:00
Stephen Smoogen
b671e0e571
add phsmoura to the nagios system so they can acknowledge down systems and other events
2022-10-24 23:20:58 +00:00
Stephen Smoogen
7d31252ba0
FIX: nagios external was referencing phx2 ip addresses
...
The PHX2 colocation has been turned off. This meant that some configs
which had been accidently working before due to referencing an ip
address there that no longer existed broke. The fix was to rewrite the
config so that it contained proper router ips and remove all mentions
of the PHX2 ip address.
Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2022-07-29 09:46:49 -04:00
Stephen Smoogen
a34148440d
FIX: nagios was using 66.187.228.248 which is not a usable ip address on Ibiblio networks currently
2022-07-29 09:40:57 -04:00
Kevin Fenzi
75943dfe0e
websites build moved to openshift
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-06-29 18:16:33 -07:00
Kevin Fenzi
771d72e12d
resultsdb01: clean up last entries
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-06-27 15:14:12 -07:00
Mikolaj Izdebski
89f28097ce
nagios_server: Update koschei internal website check for ocp4
2022-06-24 17:55:10 +02:00
Kevin Fenzi
0757ae95df
greenwave: change nagios check for ocp4
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-06-15 16:01:01 -07:00
Kevin Fenzi
fcc9d984da
waiverdb / nagios: fix url to ocp4
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-06-15 15:37:38 -07:00
Mark O Brien
91f3d3b0bc
change nagios checks for http-bodhi to only run on ocp4 proxies
...
Signed-off-by: Mark O Brien <markobri@redhat.com>
2022-06-09 13:17:12 +01:00
Kevin Fenzi
d7a8c7aa57
nagios: only check mote on value01
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-05-25 13:25:00 -07:00
Mark O Brien
75aadffd63
rename proxies_ocp4 hostgroup
...
Signed-off-by: Mark O Brien <markobri@redhat.com>
2022-05-16 15:08:17 +01:00
Mark O Brien
28db0aa10f
update nagios checks for http-accounts for ocp4 proxies only
...
Signed-off-by: Mark O Brien <markobri@redhat.com>
2022-05-16 13:59:32 +01:00
Andrew Heath
81aad830e6
Fix typo
2022-04-29 18:58:50 +00:00
Andrew Heath
8795bffd2c
Adding Check for pagure.io per issue 10541
2022-04-29 18:58:50 +00:00
Kevin Fenzi
c88e89d96b
retrace: fix ssl check
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-02-20 15:06:29 -08:00
Kevin Fenzi
467498bb8b
retrace fixes: fix dns to work, add nagios check for ssl cert
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-02-20 13:52:35 -08:00
Kevin Fenzi
2e548a91e6
nagios_server: update what variable nagios templates use for ipv4
...
We changed eth0_ip and eth0_ipv4 to eth0_ipv4_ip. Update the host
templates.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-02-09 16:03:01 -08:00
Kevin Fenzi
6cd9a57b0b
nagios: adjust hostname for copr-be, it cannot use the alias
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-02-07 13:52:13 -08:00
Silvie Chlupova
dce5318cfc
copr: add nagios check for copr backend
2022-02-07 20:22:45 +00:00
Kevin Fenzi
b388a003b4
nagios: add checks for ssl certs on fcos and ocp4 endpoints, change to just checking proxy01
...
Add checks for ssl certs on fcos openshift endpoints.
Add checks for ocp4 wildcard certs.
Change check to only use proxy01/proxy01.stg instead of all proxies.
Ideally we really do want to check all proxies, but in practice this
results in like 70 alerts anytime the cert is going to expire.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-02-02 15:47:23 -08:00
Kevin Fenzi
4dda088136
nagios: remove duplicate variable check
...
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-01-31 10:29:21 -08:00
Silvie Chlupova
194a5503f3
copr: comment define service for copr backend, it doesn't work
2022-01-31 14:13:12 +01:00
Silvie Chlupova
5011e6a2dc
copr: remove -f follow from nagios check
2022-01-31 11:51:31 +01:00
Silvie Chlupova
db6dc98940
copr: fix nagios service for checking Copr CDN
...
Fixes: https://pagure.io/fedora-infrastructure/issue/10508
2022-01-31 10:34:43 +01:00
Stephen Smoogen
9845cd08be
fix nagios check on download.copr to use check_website_follow_ssl to remove alert
2022-01-21 11:16:55 -05:00
Pavel Raiskup
c9951efa8d
nagios: disable download.copr.fedoraproject.org chack again
...
We don't know what's wrong on that:
HTTP CRITICAL: HTTP/1.1 503 Service Unavailable - string 'Index of /' not found on 'https://download.copr.fedorainfracloud.org:443/ ' - 3692 bytes in 0.631 second response time
2022-01-21 15:29:14 +01:00
Silvie Chlupova
ba86e27e79
copr: add nagios checks for copr servers
2022-01-21 14:18:05 +01:00
Silvie Chlupova
cb2f805c26
copr: don't check copr servers using nagios for now
2022-01-20 16:35:33 +01:00
Pavel Raiskup
f7edb31e43
noc: fixup noc.yaml playbook
...
Per report:
Error: Could not find any hostgroup matching 'datagrepper'
(config file '/etc/nagios/services/websites.cfg', starting on line 194)"
Folow up for: 726a788721
2022-01-20 15:34:41 +01:00
Silvie Chlupova
debd3c5b7e
copr: define new command for nagios
...
We need to use --ssl and also -f follow
2022-01-20 15:26:53 +01:00