Fedora Infrastructure Ansible Repository
Find a file
Kevin Fenzi 93cfa0134d rabbitmq: adjust things to avoid messy partitions
We have been having the cluster fall over for still unknown reasons,
but this patch should at least help prevent them:

first we increase the net_ticktime parameter from it's default of 60 to 120.
rabbitmq sends 4 'ticks' to other cluster members over this time and if 25%
of them are lost it assumes that cluster member is down. All these vm's are
on the same net and in the same datacenter, but perhaps heavy load
from other vm's causes them to sometimes not get a tick in time?
http://www.rabbitmq.com/nettick.html

Also, set our partitioning strategy to autoheal. Currently if some cluster
member gets booted out, it gets paused, and stops processing at all.
With autoheal it will try and figure out a 'winning' partition and restart
all the nodes that are not in that partition.
https://www.rabbitmq.com/partitions.html

Hopefully the first thing will make partitions less likely and the second
will make them repair without causing massive pain to the cluster.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:26 +02:00
cache_plugins try a custom cache plugin, then. 2020-04-24 21:34:15 +02:00
callback_plugins fedora_messaging_callback: This should be a } not a ) 2020-04-24 21:34:16 +02:00
files bodhi: patch error templates while waiting for a release 2020-04-24 21:34:26 +02:00
filter_plugins Drop another place where openshift_pseudohosts_stg is. 2020-04-24 21:34:15 +02:00
handlers autocloud: fare well autocloud, you served long and well... 2020-04-24 21:34:17 +02:00
inventory openqa/worker: disable aarch64-02 with a special worker class 2020-04-24 21:34:26 +02:00
library openstack: remove more lingering files and playbooks and libraries. 2020-04-24 21:34:26 +02:00
playbooks nfs-server: create 10 more 1gb persistent volumes for misc projects, the first 10 are used. 2020-04-24 21:34:26 +02:00
roles rabbitmq: adjust things to avoid messy partitions 2020-04-24 21:34:26 +02:00
scripts fix hosts_with_var_set script 2020-04-24 21:34:26 +02:00
tasks aws_cloud: don't skip facts gathering 2020-04-24 21:34:25 +02:00
vars RHEL8 cloud image does not have iptables-services installed 2020-04-24 21:34:26 +02:00
.gitignore [release-monitoring] Add librariesio consumer 2020-04-24 21:34:09 +02:00
.mailmap Add a .mailmap to map all my commits to one author in git shortlog. This is purely for my sanity, but also demonstrates how someone else could do similarly if needed. 2018-08-27 22:56:55 +00:00
CONVENTIONS Fix typo in doc 2016-04-04 14:33:21 +00:00
master.yml inventory / playbooks: remove a bunch of old cloud hosts that will be powered off tomorrow 2020-04-24 21:34:25 +02:00
README A whitespace change. Is it committing right? 2020-04-24 21:34:16 +02:00
README.md Add readme 2018-11-22 21:52:38 +01:00
STYLEGUIDE style: add initial style guide 2019-06-21 20:53:30 +00:00
TODO test 2018-05-09 02:55:42 +00:00

Fedora Infrastructure

Welcome! This is the Fedora Infrastructure Pagure project.

issues against this project are for issues in Fedora Infrastructure.

git repo of this project is misc scripts and tools for Fedora

If you are looking for the Fedora Infrastructure ansible repo, that is not here, look at:

https://infrastructure.fedoraproject.org/cgit/ansible.git/

If you would like to help out with Fedora Infrastructure, see:

https://fedoraproject.org/wiki/Infrastructure/GettingStarted and https://fedoraproject.org/wiki/Infrastructure_Apprentice