Commit graph

30359 commits

Author SHA1 Message Date
Kevin Fenzi
3c3fa03260 odcs-backend-releng01: add to ansible
This is a temp host until we ship the first round of systems out next month to the
new datacenter. At that point it will go away, and it will come back later in june
after we have gotten things moved and back up again.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:27 +02:00
Kevin Fenzi
6900b2d1d4 vhost-info: adjust for ansible python api changes
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:27 +02:00
Kevin Fenzi
efd11ec94f copr-frontend01/02.stg: Drop these for now as they were never in use and just confuse things.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:27 +02:00
Kevin Fenzi
70a35e3a65 Update variables. We are now post beta and unfrozen.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:27 +02:00
Kevin Fenzi
487cd3b483 aarch64-test01/02: just use the ip address here and avoid dns issues.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:27 +02:00
Stephen Smoogen
94045dd6a9 fix cloudvmhost-aarch64-01.fedorainfracloud.org br0 ip address 2020-04-24 21:34:27 +02:00
Jan Kaluža
8040bd2593 ODCS: Fix Pungi cache name. 2020-04-24 21:34:27 +02:00
Jan Kaluža
44f6bac47f ODCS: Execute Pungi cache ownership fix only on backend. 2020-04-24 21:34:27 +02:00
Jan Kaluža
09f4d2cb8f ODCS: Also fix Pungi cache ownership. 2020-04-24 21:34:27 +02:00
Jan Kaluža
56b51220a3 ODCS: Also fix koji profile ownership. 2020-04-24 21:34:27 +02:00
Jan Kaluža
c14ca86f6d ODCS: Also fix /var/run/odcs-backend ownership. 2020-04-24 21:34:27 +02:00
Jan Kaluža
d051992cdf ODCS: Fix owner of /var/log/odcs-backend. 2020-04-24 21:34:27 +02:00
Jan Kaluža
f3e081073d ODCS: Add search permissions for /srv/odcs, remove any permisions for 'others'. 2020-04-24 21:34:27 +02:00
Jan Kaluža
75dfd33da2 ODCS: Add apache to odcs-server group so it can access /srv/odcs. 2020-04-24 21:34:27 +02:00
Jan Kaluža
871cff021a ODCS: Do not change the owner/group recursively. 2020-04-24 21:34:27 +02:00
Jan Kaluža
14417c5f18 ODCS: use the same user/group on every machine. 2020-04-24 21:34:27 +02:00
Kevin Fenzi
ebb4449e99 aarch64-test01/02: move back to emag in cloud network.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:27 +02:00
Kevin Fenzi
2c9b262b39 storinator: Make sure to export the volumes you create.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:27 +02:00
Stephen Smoogen
efcd9b0ead the problem was that some of the host names were listed as cloud-noc01.fedorainfracloud.org and others were named cloud-noc01.cloud.fedoraproject.org. Move to 1 name throughout ansible 2020-04-24 21:34:27 +02:00
Stephen Smoogen
1fbe999a54 how do you shave a yak? one hair at a time. Move the cloud-noc to the right name so playbook works 2020-04-24 21:34:27 +02:00
Stephen Smoogen
228f977ca5 add 10disk boot for RHEL-8 2020-04-24 21:34:27 +02:00
Stephen Smoogen
c2c492dcc1 we have data for a host I forgot to put in 2020-04-24 21:34:26 +02:00
Stephen Smoogen
9bfa44099f cloud-noc needs to be in cloud_phx2 group 2020-04-24 21:34:26 +02:00
Stephen Smoogen
bc7e5a2ab3 try to get cloud-noc01 working again in nagios 2020-04-24 21:34:26 +02:00
Stephen Smoogen
bd9e3547e5 putting in correct ip address helps 2020-04-24 21:34:26 +02:00
Stephen Smoogen
5bb5c022cf cloud-noc01 ip address changed but we forgot to update the inventory 2020-04-24 21:34:26 +02:00
Stephen Smoogen
7c3f7b1b8d Add first commit for copr-vmhost to dhcp 2020-04-24 21:34:26 +02:00
Kevin Fenzi
b9b08e413f proxies / redirects / prerelease: Disable prerelease redirects for beta release
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:26 +02:00
Kevin Fenzi
80477848ae s3-mirror: fix missing trailing /
This is causing f31 updates to not be synced. The cron job reports:

Subject: Cron <s3-mirror@mm-backend01> /usr/local/bin/lock-wrapper s3sync-updates-current "/usr/local/bin/s3-sync-path.sh /pub/fedora/linux/updates/31/Everything/x86_64/os" 2>&1 |
        /usr/local/bin/nag-once s3-updates-current.sh 1d 2>&1

Syntax: /usr/local/bin/s3-sync-path.sh /pub/path/to/sync/
NOTE! Path must end with a trailing /

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:26 +02:00
Dominik Turecek
14c85a0c55 copr: patch API to provide build_chroot status 2020-04-24 21:34:26 +02:00
Pavel Raiskup
2a4698ee3a swap: make it idempotent
Call 'swapon -a' only when we just added the swap /etc/fstab entry.
2020-04-24 21:34:26 +02:00
Jan Kaluža
a9da2ee4d9 ODCS: restart the celery backend after db upgrade 2020-04-24 21:34:26 +02:00
Jan Kaluža
3b17b92e20 ODCS: Stop calling useles db-migrate command. 2020-04-24 21:34:26 +02:00
Jan Kaluža
af92d2582a ODCS: Add extra compose target directory for private composes. 2020-04-24 21:34:26 +02:00
Stephen Smoogen
d9aee1fc32 add cloudvmhost-aarch64 to inventories so it can be done stuff to 2020-04-24 21:34:26 +02:00
Stephen Smoogen
c3f5874d89 this will get ips for the cloud box 2020-04-24 21:34:26 +02:00
Stephen Smoogen
7681b1e1e2 add in permission for proxy31 ip 2020-04-24 21:34:26 +02:00
Mohan Boddu
f6506ddc41 F32 Beta is GO
Signed-off-by: Mohan Boddu <mboddu@bhujji.com>
2020-04-24 21:34:26 +02:00
Kevin Fenzi
93cfa0134d rabbitmq: adjust things to avoid messy partitions
We have been having the cluster fall over for still unknown reasons,
but this patch should at least help prevent them:

first we increase the net_ticktime parameter from it's default of 60 to 120.
rabbitmq sends 4 'ticks' to other cluster members over this time and if 25%
of them are lost it assumes that cluster member is down. All these vm's are
on the same net and in the same datacenter, but perhaps heavy load
from other vm's causes them to sometimes not get a tick in time?
http://www.rabbitmq.com/nettick.html

Also, set our partitioning strategy to autoheal. Currently if some cluster
member gets booted out, it gets paused, and stops processing at all.
With autoheal it will try and figure out a 'winning' partition and restart
all the nodes that are not in that partition.
https://www.rabbitmq.com/partitions.html

Hopefully the first thing will make partitions less likely and the second
will make them repair without causing massive pain to the cluster.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:26 +02:00
Adam Williamson
a6b9c5392d openqa/worker: disable aarch64-02 with a special worker class
openqa-aarch64-02.qa is broken in some very mysterious way:
https://pagure.io/fedora-infrastructure/issue/8750
until we can figure that out, this should prevent it picking up
normal jobs, but let us manually target a job at it whenever we
need to for debugging.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2020-04-24 21:34:26 +02:00
Stephen Smoogen
bfcf4498ca hey lets add some variables 2020-04-24 21:34:26 +02:00
Kevin Fenzi
f9a95fc764 nfs-server: create 10 more 1gb persistent volumes for misc projects, the first 10 are used.
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2020-04-24 21:34:26 +02:00
Clement Verna
05905e6904 koji_hub: try to setup the loglevel in staging
Signed-off-by: Clement Verna <cverna@tutanota.com>
2020-04-24 21:34:26 +02:00
Stephen Smoogen
c7c4320c93 move the cloud aarch64 box back into cloud 2020-04-24 21:34:26 +02:00
Stephen Smoogen
2b9f82f9f4 proxy31 2020-04-24 21:34:26 +02:00
Miroslav Suchý
9dd176bfa5 retrace: define devel var 2020-04-24 21:34:26 +02:00
Miroslav Suchý
21170d4680 retrace: remove f29 from faf 2020-04-24 21:34:26 +02:00
Michal Fabik
ce47bc3544 retrace: Add Fedora 32 to retrace
Signed-off-by: Michal Fabik <mfabik@redhat.com>
Signed-off-by: Miroslav Suchý <msuchy@redhat.com>
2020-04-24 21:34:26 +02:00
Kevin Fenzi
96190cec94 Revert "Revert "s3-mirror: Split things into 2 sync runs, one without repodata and delete, the other with both.""
This reverts commit 0798930ebea639871194525789061161b2c19bf9.

Got some +1s, so pushing this now.
2020-04-24 21:34:26 +02:00
Stephen Smoogen
32e021d89f fix typo in previous post 2020-04-24 21:34:26 +02:00