Commit graph

38356 commits

Author SHA1 Message Date
Kevin Fenzi
f0b98b904c Openshift apps: drop app owners that have left us
Before we enable any monitoring, we should clean up app owners some so
we do not spam people who arent around anymore and no longer care about
the app. ;)

If I removed anyone here who is still around and does care, we can
easily add you back in.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-26 17:19:33 -08:00
Kevin Fenzi
70f0f6a3c2 package owner alias: switch to once every 3 hours
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-26 16:54:03 -08:00
02cb912277 lint fix 2023-01-27 00:08:30 +00:00
96c64b7ee4 docsbuilding: add alerting 2023-01-27 00:08:30 +00:00
caa4f85ed2 ocp_monitoring: move rules to openshift/project role 2023-01-27 00:08:30 +00:00
Adam Williamson
842db118e8 Bodhi: switch to grouped critpath for prod too (whee!)
Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-26 14:58:27 -08:00
Adam Williamson
4f78db48e7 greenwave: correct a test name in the recent policy addition
Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-26 14:24:52 -08:00
Adam Williamson
3788e4c09e grouped critpath: pass --srpm to the script
Bodhi wants critpath components by SRPM name, not binary RPM name.
The script was already being called with `--srpm` when used to
update the PDC data, we just forgot to do the same here.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-26 13:59:01 -08:00
Adam Williamson
1bc076ae4b greenwave: add a critical-path-apps policy
This will ensure we gate firefox updates on relevant tests.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-26 13:54:28 -08:00
Adam Williamson
fea60aab95 Bodhi: try and do critical path data generation in openshift
The previous attempt to have staging Bodhi use grouped JSON
critical path data didn't work because we did it in the backend
(i.e. the composer), not the openshift pods where the web UI
and the consumers run.

We need at least the web UI and consumer pods to have the
critpath data, as both those pods may create updates (Bodhi
decides if the update is critical path at creation time). This
attempts to handle that by having a daily openshift cron job
that runs a simple container with the necessary packages in it,
checks out the script, and runs it. It's run on a persistent
storage volume which is also mounted by all the Bodhi pods in
the place where Bodhi will look for the data.

The cron job frequency is temporarily set to once an hour; this
is so it will run soon after initial deployment. Once it has
run once we can cut it back to once a day.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-26 11:52:28 -08:00
Adam Williamson
95e4480d4c Bodhi: revert broken backend critpath generation bits
This doesn't work - it puts the critical path data on the
'backend', which is not where we need it to be. We need that
data in the openshift pods, there's another commit alongside
this one which tries to do that.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-26 11:52:28 -08:00
Kevin Fenzi
863ff38481 email aliases: add ibmcloud group alias
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-26 11:43:44 -08:00
Michal Konečný
54465d7a43 [release-monitoring] Sync buildconfig between staging and prod
With release of Anitya 1.7.0 the special cases for production are no longer
needed. Let's remove them.

Signed-off-by: Michal Konečný <mkonecny@redhat.com>
2023-01-26 13:26:37 +01:00
Kevin Fenzi
8c92cbe779 ocp prod/stg: bump control plane vms to 24gb from 16gm
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-25 15:34:48 -08:00
66726137ae websites: add alerts for pod/job/build errors 2023-01-25 22:41:39 +00:00
de196fd597 websites: remove unused i18n script 2023-01-25 22:41:39 +00:00
Kevin Fenzi
350a0d2393 add aarch64-test02 to cloud inventory too
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-25 11:33:43 -08:00
Kevin Fenzi
3a7bf1bb80 add a aarch64-test02 maintainer test instance
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-25 11:30:25 -08:00
Kevin Fenzi
e301a402ee delete copy pasta spew from ncurses bug
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-25 10:11:37 -08:00
Kevin Fenzi
abf5e66531 letsencrypt: move the combined cert to a template
content is "undefined" if using variables and you can't put a newline in
it, so just move this to a simple template.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-25 10:01:18 -08:00
Kevin Fenzi
925760331c letsencrypt: Try and put the combined cert in the format stunnel likes
Right now it's adding the cert without a newline at the end, but it also
expects the cert to be at the top and the intermediate below it. So,
swap them around and try putting a newline in it.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-25 09:56:31 -08:00
dd7cb18a31 Update Flock contacts
Justin Flory is the new Fedora Community Architect (formerly FCAIC)

Also, hopefully future-proof these aliases a bit by using the fcaic
alias instead naming the specific person. Fewer edits to make the next
time the role turns over.

Signed-off-by: Ben Cotton <bcotton@fedoraproject.org>
2023-01-25 09:31:32 -05:00
Kevin Fenzi
73c95d5331 base: add tag for basessh role
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-24 18:17:44 -08:00
Kevin Fenzi
36eaa708fa rsyncd: fix correctly this time (I hope)
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-24 16:22:03 -08:00
Kevin Fenzi
14a7ec3046 rsyncd: fix conditional harder
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-24 16:16:11 -08:00
Kevin Fenzi
921add1320 dns: adjust conditionals for dns servers using rhel9
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-24 16:07:50 -08:00
Kevin Fenzi
6c540c184f inventory / host_vars: change infrastructure.fedoraproject.org url to iad2 instead of old phx2 ip
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-24 14:55:39 -08:00
Kevin Fenzi
40d41ff6ea ns02.iad2: move to rhel9
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-24 14:47:23 -08:00
Jakub Kadlcik
07ed75c445 copr: upgrade also python3-copr-common on copr-keygen 2023-01-24 16:23:06 +01:00
Kevin Fenzi
8a4a4469e4 proxies: Override the httpd systemd unit file to up file limit and restart on failure
The proxies seem to be hitting file limits, so try increasing them.
Also, set httpd to restart on failure, this should help mask the problem
if it persists with the higher limit.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-22 13:51:23 -08:00
Kevin Fenzi
49e6526151 releng-bot: drop alias in favor of fixing address in fas
Right now releng-bot has a fas address of 'releng-bot@fedoraproject.org'
which is... confusing. The alias overrides this and sends email to
admin, but it results in a duplicate, causing the cron job to send mail
about the duplicate everytime newaliases run.

So, instead drop the alias here and switch the user in fas to be
admin+relengbot. This will still go to admin, not run into problems with
the address already in use in fas and should cause the newaliases to
stop complaining.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-21 12:16:37 -08:00
Adam Williamson
292f348bdd Cut openqa-x86-worker03 down to 20 workers
It seems to be struggling with memory exhaustion ATM, and I
think it's causing tests to run slower.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-21 08:56:35 -08:00
Kevin Fenzi
98476fcfbf registry-index-sync: move from 30 after to 40 after
Right now this cron is sometimes running while reg is updating the
index on sundries01, so move it out 10min to avoid that window.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-20 21:39:46 -08:00
Kevin Fenzi
12033a58af pagure: also have stunnel listen for ipv6 eventsource
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-20 12:49:31 -08:00
Kevin Fenzi
be19611e01 pagure: fix conditional in playbook
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-20 12:26:42 -08:00
Kevin Fenzi
a2bfd621ed letsencrypt: it is restart, not reload
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-20 12:13:54 -08:00
Kevin Fenzi
d44bc3991c pagure: handle stunnel bundled cert in letsencrypt renews
This commit removes the old tasks to try and create a cert/intermediate
bundle file for stunnel in favor of just doing it when we renew/get the
cert. It also fixes stunnel to use the correct bundled cert.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-20 11:55:13 -08:00
Kevin Fenzi
ff51231e77 pagure: set env to pagure to make sure it is set to something other than pagure-stg or nothing at all
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-20 10:58:39 -08:00
Kevin Fenzi
3bbfded937 cloud-noc-os01: add group and access to sysadmin-copr
This noc host in rdu-cc is used for pxe installs, dhcp server and also
to access the private mgmt network for hosts in rdu-cc. Setup it's own
group and add sysadmin-copr to access it so they can get to the mgmt
interfaces on various copr machines there to reboot them.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-20 10:11:42 -08:00
Kevin Fenzi
acaff7aff9 bodhi: also enable the prod infra-tags in stg because we move builds to prod from stg and then they are no longer there
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-19 19:03:12 -08:00
Adam Williamson
8445bb6e26 bodhi: switch to grouped JSON critical path data on staging
This switches the Bodhi staging instance to use (and regularly
update) its own grouped critical path data, instead of consuming
the data from PDC that is non-grouped and irregularly updated by
releng. If this works out well, we'll also apply it to prod.
This requires Bodhi 7 or higher.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-19 14:07:47 -08:00
Kevin Fenzi
d4bcf3b670 communishift: add communishift-planet for working on new planet
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-19 12:32:39 -08:00
Kevin Fenzi
296a5a4771 robosignatory: sign mass rebuild tag as builds happen
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-18 09:31:03 -08:00
David Kirwan
f8f758be97 mdapi: rollout to prod
Signed-off-by: David Kirwan <dkirwan@redhat.com>
Signed-off-by: Akashdeep Dhar <akashdeep.dhar@gmail.com>
2023-01-18 13:43:02 +09:00
Kevin Fenzi
c3718a166a pagure: put websites block back in place
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-17 16:10:04 -08:00
Stephen Smoogen
b5e7db6b83 Fix sshd_config for Fedora 36+
I have probably made this a bit wordy but I have found long jinja2
logic statements to be a bit hard to debug or go off the rails. So I
copied the section for Red Hat and used it for Fedora.

Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2023-01-17 17:43:18 -05:00
Pedro Moura
d3b4521668 Fix Upload TypeError #10877
Signed-off-by: Pedro Moura <pmoura@redhat.com>
2023-01-17 20:56:40 +00:00
Kevin Fenzi
88596cb852 pagure-dist-git: more config to quash info/debug output on https push
We also have fedora-messaging spewing out on pushes, so lets set that
back to warning, and we also need to change the default pagure logging
root to WARN.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-17 12:55:24 -08:00
Stephen Smoogen
0863d2c8a9 Fix item in sshd_config that Kevin found in review 2023-01-17 15:43:33 -05:00
Stephen Smoogen
1618137592 retry to make a branch with just the sshd config change. 2023-01-17 20:37:34 +00:00