Commit graph

723 commits

Author SHA1 Message Date
Clement Verna
e220bb4867 Fedbages: put a higher threshold to fedmsg hub backlog nagios alerts.
The fedbages consumer has to consume many more messages and the query to datanomer
are slower. We need to allow the consumer to have a bigger backlog before raising an
alert.
Generally above 35000 message in the backlog it will be difficult for the consumer
to catch up, in that case it might be better to flush the backlog and restart from 0

Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-05-05 13:59:31 +02:00
Stephen Smoogen
c432675c74 [nagios] add checks for datanommer monitoring 2019-04-25 18:13:43 +00:00
Clement Verna
93d0eeaf54 Nagios: monitor that resultsdb sends messages on the bus
Signed-off-by: Clement Verna <cverna@tutanota.com>
2019-04-24 11:22:46 +02:00
Patrick Uiterwijk
0c7449ea1d Add sslciphers tags
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-04-08 21:41:17 +02:00
Miroslav Suchý
1a1ca033b6 sent Copr nagios notifications to frostyx too 2019-04-01 10:13:00 +02:00
Miroslav Suchý
b7394d1c54 sent notifications to msuchy@ as clime@ does not work now 2019-04-01 10:10:33 +02:00
Stephen Smoogen
d9d24d08d9 [nagios_server] Add in certgetter test.
This was offered by Alessandro Lorenzi <alorenzi@alorenzi.eu> as a fix
to deal with our inability to monitor the certgetter after
reboots. Thank you very much for this work.

Signed-off-by: Stephen Smoogen <smooge@redhat.com>
2019-03-20 20:17:43 +00:00
Kevin Fenzi
625acabd00 nagios: monitoring resultsdb.ci.centos.org. Need to call it by it's internal fedora name here.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-03-16 04:21:02 +00:00
Kevin Fenzi
39cee77465 nagios: monitor resultsdb.ci.centos.org. Ticket 7601.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-03-15 20:49:49 +00:00
Kevin Fenzi
1a40dd5142 nagios: drop askbot fedmsg check.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:34:31 +00:00
Kevin Fenzi
a6cceb3599 nagios: drop remnant of check_osbs_builds
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2019-02-21 15:23:54 +00:00
Rick Elrod
8a4ed2358a remove tummy01 everywhere
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-02-20 17:28:52 +00:00
Stephen Smoogen
06cc8ca030 and we have no bodhost 2019-02-14 21:11:50 +00:00
Rick Elrod
0b7bb3b5b3 prep for proxy03 move
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-02-11 23:14:27 +00:00
Rick Elrod
4c8cf933fc make odcs-backend check for fedmsg-hub-3 instead (infra #7526)
Signed-off-by: Rick Elrod <relrod@redhat.com>
2019-01-28 08:54:46 +00:00
Patrick Uiterwijk
18b0acc8f3 Monitor ostree summary on proxies
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2019-01-21 16:57:26 +01:00
Patrick Uiterwijk
2ded08f111 Add 24-hour check for bodhi compose start
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-12-20 20:44:10 +01:00
Stephen Smoogen
0267f916bd This will fix nrpe on server 2018-12-19 13:44:03 +00:00
Stephen Smoogen
6c1357ff59 repospanner is only running on pkgs01 currently 2018-12-17 18:25:27 +00:00
Stephen Smoogen
418af8da3d and now make it so nagios sees the file on the server 2018-12-17 17:00:13 +00:00
Stephen Smoogen
0819f469c0 this should allow noc01 to see nrpe commands 2018-12-17 16:42:16 +00:00
Stephen Smoogen
3bbc0031f4 This will add minimal monitoring for repospanner on pkgs01.stg. This only says it is running or not. 2018-12-17 15:44:31 +00:00
Kevin Fenzi
4125997ecc fix the check_supybot_plugin to listen only for zodbot privmsg, not frigg 2018-12-15 20:52:54 +00:00
Stephen Smoogen
cc16f56e4f maybe this will make qa02 easier? 2018-12-15 01:14:59 +00:00
Stephen Smoogen
c7deb6502a somedays you just keep neanderthal-patching until it works 2018-11-20 00:13:00 +00:00
Randy Barlow
4422e2bb2d Monitor fedmsg-hub-3 on Bodhi instead of fedmsg-hub.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2018-11-19 21:33:37 +00:00
Randy Barlow
ce86a667b7 Configure check_fedmsg_cp_bodhi_backend02_hub to use fedmsg-hub-3.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2018-11-19 21:29:14 +00:00
Randy Barlow
2911286a3a Rename check_fedmsg_masher_proc to check_fedmsg_composer_proc and have it check fedmsg-hub-3.
Signed-off-by: Randy Barlow <randy@electronsweatshop.com>
2018-11-19 21:20:15 +00:00
Patrick Uiterwijk
505d8bbf8c Set COPR backend to send notifs to copr
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
5fa3c6e53d Add contact info for clime
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
52d42114c4 Allow clime to ack services
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 19:27:40 +02:00
Patrick Uiterwijk
3fc57e699b Enable nagios checks for ticketkey, and stop emailing puiterwijk
Signed-off-by: Patrick Uiterwijk <patrick@puiterwijk.org>
2018-10-20 15:36:00 +02:00
Kevin Fenzi
6cefdc1d5b another place anitya is 2018-10-07 20:04:40 +00:00
Kevin Fenzi
592fffe808 drop this refrence too 2018-10-06 21:33:26 +00:00
Kevin Fenzi
b2ff9078f2 Clean up non openshift anitya in favor of openshift version. 2018-10-06 00:11:10 +00:00
Kevin Fenzi
5280ce623a drop coreos site, it moved to github 2018-10-03 17:58:34 +00:00
Kevin Fenzi
2d7ac321c7 a few tagger stragglers 2018-10-03 17:56:00 +00:00
David Shier
c0f45892ff Removed all traces i could fild of the tagger and statscache (and stats_cache for databases) in ansible, proxy configs, and the nagios config. Pursuant to request in pagzre issue https://pagure.io/fedora-infrastructure/issue/7267 . - Odin2016 2018-10-03 17:50:38 +00:00
Stephen Smoogen
330a8e4a1d and this should make our mail easier to read 2018-08-27 22:44:58 +00:00
Stephen Smoogen
8f742a90d5 you cant comment out jinja2 silly 2018-08-27 22:07:56 +00:00
Stephen Smoogen
28d323bfcf try to make nomail better 2018-08-27 21:44:17 +00:00
Kevin Fenzi
6c3dc368cd pkgs is now a letsencrypt cert, so we do not need to monitor it for 60 days 2018-08-22 00:00:21 +00:00
Stephen Smoogen
a792adea13 add in selinux for nagios servers 2018-08-21 17:25:03 +00:00
Kevin Fenzi
63079521a4 drop autoqa from nagios too 2018-07-31 17:46:54 +00:00
Rick Elrod
8098ce7fe9 Ignore newcloud-undercloud and make the check here look a little prettier
Signed-off-by: Rick Elrod <relrod@redhat.com>
2018-07-20 23:34:56 +00:00
Rick Elrod
173cf3a7d2 more pkgdb nuke
Signed-off-by: Rick Elrod <relrod@redhat.com>
2018-07-20 23:19:43 +00:00
Kevin Fenzi
5149f16b73 kill pkgdb from nagios server 2018-07-20 04:54:23 +00:00
Kevin Fenzi
da32a2cf5b move around some inventory/group/host vars to make more sense and setup a group for no nagios monitoring 2018-07-17 17:44:50 +00:00
Rick Elrod
5c6e41b073 add /api/ -- make nagios check for the gnome-software endpoint only
Signed-off-by: Rick Elrod <relrod@redhat.com>
2018-07-10 02:33:34 +00:00
Patrick Uiterwijk
0854930115 We have no jenkins service anymore
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2018-07-03 02:28:37 +02:00