Commit graph

106 commits

Author SHA1 Message Date
Patrick Uiterwijk
b51369be69 Also add the client-side check definition
Signed-off-by: Patrick Uiterwijk <puiterwijk@redhat.com>
2016-01-18 14:16:33 +00:00
Kevin Fenzi
f0c80375b5 Remove action: in all roles. 2016-01-06 21:58:31 +00:00
Ralph Bean
b19cede422 bugyou nagios configuration. 2015-12-18 17:35:17 +00:00
Ralph Bean
3e14fe5fad Add nagios checks on the new packages03 cache invalidator. 2015-11-24 17:24:49 +00:00
Ralph Bean
c3bd96853a Kill old references to fcomm-cache-worker. 2015-11-19 18:47:43 +00:00
Kevin Fenzi
85a1e3e30b Fix another yum->dnf ism 2015-11-09 17:34:26 +00:00
Ralph Bean
a0a5f56757 Define the datanommer autocloud check client-side. 2015-10-09 14:28:54 +00:00
Patrick Uiterwijk
da3bd1044b Nagios should check against batcave01 now, lockbox01 is EOL 2015-10-01 17:32:54 +00:00
Ralph Bean
64407183f1 Try again to fix the autocloud proc check. 2015-09-30 21:26:51 +00:00
Ralph Bean
cf2f71b487 TYpofix. 2015-09-30 19:33:36 +00:00
Ralph Bean
e992ca4e2b Gotta define these checks. 2015-09-30 19:32:48 +00:00
Ralph Bean
4cd515c14f Some nagios monitoring for autocloud. 2015-09-30 19:22:43 +00:00
Ralph Bean
b75e021169 Adjust nagios checks for the bodhi-backend split out. 2015-09-02 17:24:26 +00:00
Ralph Bean
94102f92d5 Typofix. 2015-08-19 15:54:21 +00:00
Ralph Bean
17251bdb6b Nagios monitoring for bodhi2. 2015-08-19 15:17:14 +00:00
Mikolaj Izdebski
2ca1f29c58 Limit nagios check_procs process names to 16 chars
Top utility truncates long process names (eg. koschei-scheduler is
displayed as "koschei-schedul") and I think Nagios expects this too.
2015-07-09 22:53:07 +00:00
Mikolaj Izdebski
dc070143cf Implement Koschei-specific service checks 2015-07-09 22:04:29 +00:00
Ralph Bean
8e2246ed92 Up threshhold on the datanommer backlog alert (for now) 2015-07-02 13:40:50 +00:00
Mikolaj Izdebski
bcdb9533a4 Initial Koschei Nagios checks 2015-06-26 16:56:44 +00:00
Kevin Fenzi
7fe5b0100e Setup mariadb backup to make a latest link, make nrpe on db03 check that for backup age. 2015-06-08 21:14:36 +00:00
Kevin Fenzi
6309062daa Fix up nagios check for mysqlmariadb database dump ages. 2015-06-04 20:43:30 +00:00
Kevin Fenzi
275f4b5203 Change all instances of ansible_distribution_major_version to filter to int for comparisons. 2015-05-27 22:27:39 +00:00
Ralph Bean
3d5296bb4f Add a datanommer+nagios check for the new faf/abrt messages. 2015-05-07 19:01:43 +00:00
Ralph Bean
5b04ce917b (cosmetic) move datanommer fedimg check to another section. 2015-05-07 19:01:43 +00:00
Ralph Bean
4a559466b5 (nagios) bump up fedimg limits even more. 2015-03-13 17:31:11 +00:00
Ralph Bean
8c7c31d29d (nagios) bump up alert threshholds for fedimg. 2015-03-13 13:55:36 +00:00
Kevin Fenzi
21f9252143 Add check_lock_file_age check for fas01 2015-02-20 15:11:12 +00:00
Kevin Fenzi
2406b0f8dc Fix lookaside disk check to use libdir 2015-02-19 16:33:02 +00:00
Kevin Fenzi
3068bc611e Lets try and fix up checks on pkgs02 2015-02-19 15:34:21 +00:00
Ralph Bean
a47d5951e2 (hotness) bump up alert threshholds. 2015-02-18 00:47:56 +00:00
Ralph Bean
565e15c6d2 Define the datanommer check for hotness. 2015-02-17 17:18:25 +00:00
Kevin Fenzi
cb075483af Bump this cron check to keep koji02 from nagging us 2015-02-07 15:30:13 +00:00
Ralph Bean
b2f4c9e4db Bump up these fmn alert threshholds. 2015-02-06 15:04:15 +00:00
Kevin Fenzi
75a8d7b4ee Add this script here too. 2015-01-21 01:55:33 +00:00
Kevin Fenzi
148b4f957c Install check_haproxy_conns script 2015-01-21 01:51:40 +00:00
Kevin Fenzi
cad791174d Add some checks that are needed on the proxies 2015-01-21 00:07:25 +00:00
Ralph Bean
be119426e8 Define checks for fedimg and stub out checks for hotness (coming soon). 2015-01-19 21:52:14 +00:00
Ralph Bean
eacfdb95ba The scrutiny of axilleas. 2014-11-24 14:26:23 +00:00
Ralph Bean
85c486b34b Check for connectivity to memcached.
This will attempt to call the daemon's stats command which, if broken, might
hung and cause nrpe to time out.  We want that, as it will give us a clue to
what might be causing some other app to fail.
2014-11-19 18:35:14 +00:00
Kevin Fenzi
41ab725771 Mark these as always_run (so they run in --check) and never changed (since they are just informational) 2014-11-13 16:05:32 +00:00
Ralph Bean
6326659ba0 Nagios: Check datanommer for anitya messages. 2014-11-12 16:24:07 +00:00
Ralph Bean
02b8ab294f Also, do this the other way around. 2014-11-07 18:53:44 +00:00
Ralph Bean
88d8318332 Nuke that nuancier datanommer check. The one that always times out. 2014-11-05 20:37:15 +00:00
Praveen Kumar
4b1e5162d7 Update state from installed/removed to present/absent for yum module as per latest documents -> http://docs.ansible.com/yum_module.html 2014-11-05 15:32:11 +00:00
Kevin Fenzi
45c1990fc1 Add taskotron entries, clean up external proxies to actually check 2014-10-09 20:18:32 +00:00
Ralph Bean
6d1870bc67 Add nagios checks for anitya fedmsg stuff. 2014-10-03 19:56:58 +00:00
Ralph Bean
b3a97a1c91 Add two new nagios checks for the FMN "Producers" 2014-10-02 13:42:27 +00:00
Ralph Bean
1bc4fc879c Bump that threshold up more. 2014-10-01 14:27:07 +00:00
Ralph Bean
0b0b7ce975 Adjust backlog nagios threshholds. 2014-10-01 13:44:26 +00:00
Ralph Bean
1f881b88d5 Define nagios checks for bugzilla2fedmsg01. 2014-09-25 17:00:03 +00:00