Commit graph

1072 commits

Author SHA1 Message Date
Dusty Mabe
ab3ecc3bd7
fedora-coreos-pipeline: add alert_users
Not all the users that have access to the pipeline need to get alerted
when things fail.
2023-02-27 09:59:15 -05:00
Dusty Mabe
d9dd564d30
compose-tracker: update instructions for deleting project
The staging node is now os_control_stg[0].
2023-02-27 09:49:54 -05:00
Kevin Fenzi
5c9a733da9 openshift / fedocal / staging: disable cron job in staging
The fedocal cron jobs fail in staging because they try to send to
'localhost' for smtp server. We could redirect them to use bastion, but
then people would get a bunch of reminders from prod and staging and get
confused by it. Ideally, fedocal would have a way to just print emails
to stdout instead of sending to SMTP server to use for testing them in
staging, but for now, until we have that just disable the cron job in
staging.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-02-14 09:04:21 -08:00
Aurélien Bompard
c17a563d9b
Deploy the new FMN to prod
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-02-14 13:39:38 +01:00
Aurélien Bompard
92a71abfe9
FMN: add a cron job
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-02-14 11:07:44 +01:00
Kevin Fenzi
ef2b0d9435 blockerbugs: by request drop adamw from blockerbugs monitoring
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-02-13 15:15:37 -08:00
Kevin Fenzi
ac1fc1d617 openshift / various apps: add myself to appowners/alert users
I like alerts. Do you like alerts?
I like getting them so I can fix things.
So, adding myself here to all these apps so I can tell when pods are
crashing or builds are failing or whatever. :)

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-02-13 11:21:22 -08:00
d0704659a7 docsbuilding: set alert_users list 2023-02-05 22:49:44 +00:00
Kevin Fenzi
f0b98b904c Openshift apps: drop app owners that have left us
Before we enable any monitoring, we should clean up app owners some so
we do not spam people who arent around anymore and no longer care about
the app. ;)

If I removed anyone here who is still around and does care, we can
easily add you back in.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-26 17:19:33 -08:00
02cb912277 lint fix 2023-01-27 00:08:30 +00:00
96c64b7ee4 docsbuilding: add alerting 2023-01-27 00:08:30 +00:00
caa4f85ed2 ocp_monitoring: move rules to openshift/project role 2023-01-27 00:08:30 +00:00
Adam Williamson
fea60aab95 Bodhi: try and do critical path data generation in openshift
The previous attempt to have staging Bodhi use grouped JSON
critical path data didn't work because we did it in the backend
(i.e. the composer), not the openshift pods where the web UI
and the consumers run.

We need at least the web UI and consumer pods to have the
critpath data, as both those pods may create updates (Bodhi
decides if the update is critical path at creation time). This
attempts to handle that by having a daily openshift cron job
that runs a simple container with the necessary packages in it,
checks out the script, and runs it. It's run on a persistent
storage volume which is also mounted by all the Bodhi pods in
the place where Bodhi will look for the data.

The cron job frequency is temporarily set to once an hour; this
is so it will run soon after initial deployment. Once it has
run once we can cut it back to once a day.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-26 11:52:28 -08:00
66726137ae websites: add alerts for pod/job/build errors 2023-01-25 22:41:39 +00:00
Kevin Fenzi
31f11df469 koschei: increase timeout from 30s to 180s to allow loading larger queries
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-08 11:12:00 -08:00
Kevin Fenzi
5ca2b2eb36 os.fedoraproject.org / app.os.fedoraproject.org: remove more old openshift 3.11 cluster stuff
It may be that having this on some of the proxies is causing problems
because it's trying to ping the old openshift 3.11 cluster and filling
up apache slots with it. We do not need this stuff anymore, so remove
it.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-12-16 10:15:27 -08:00
Dusty Mabe
caf71e42b7
openshift-apps: remove build step from a few apps
Everytime we run the playbook a new build kicks off, but
the app was just restarted. So what happens is we end up
with the app getting started twice (once when the
deploymentconfig gets updated and once when the build finishes).
This could be bad if the app has some startup steps that need
to not be interrupted.

Let's just manually trigger builds since we have the permissions
to do that in the web interface and via the CLI.
2022-12-14 15:29:04 -05:00
Michal Konečný
222d670bf0 [mdapi] Add t0xic0der to appowners
https://pagure.io/fedora-infrastructure/issue/11053

Signed-off-by: Michal Konečný <mkonecny@redhat.com>
2022-12-14 10:43:32 +01:00
bd4c0ae2a1
websites: add missing deployment 2022-12-09 01:31:27 +01:00
dc0dcb3db9 websites: s3 creds will be manually fetched 2022-12-09 00:14:44 +00:00
5b5edd11eb websites: new main website deployment on stg 2022-12-09 00:14:44 +00:00
Dusty Mabe
8d70d287ca
fedora-coreos-pipeline: add Adam Piasecki (c4rt0) drop some others
The few we're dropping are either inactive or have moved on.
2022-12-06 09:59:24 -05:00
Aurélien Bompard
6871f63bdf
Register FMN with IPA
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 12:21:40 +01:00
Aurélien Bompard
a060cef52e
FMN: add a keytab for FASJSON access
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 10:56:42 +01:00
Aurélien Bompard
2b53f7bd03
Don't use the rabbit/queue role for vhost-specific queues
There are naming constraints on those queues and we don't want them.

Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 10:40:10 +01:00
Aurélien Bompard
c8c4dfe72c
FMN: setup the fmn vhost in rabbitmq
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 10:30:43 +01:00
Aurélien Bompard
5afe93f783
FMN: create the RabbitMQ user
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 09:49:19 +01:00
Timothée Ravier
95eaf4f2f1 playbooks/openshift-apps/silverblue: Minor whitespace fix 2022-11-19 02:23:49 +00:00
Timothée Ravier
0acae10155 playbooks/openshift-apps/silverblue: Add siosm & tpopela
Give @siosm and @tpopela access to the Silverblue website deployment.
2022-11-19 02:23:49 +00:00
Kevin Fenzi
310ddb9b36 greenwave: rabbitmq queue no longer needed
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-11-18 13:10:36 -08:00
Timothée Ravier
7da090f98e playbooks/openshift-apps/coreos-cincinnati: Fix yamllint warnings 2022-11-18 19:46:24 +00:00
Timothée Ravier
c41faa09b7 playbooks/openshift-apps/coreos-cincinnati: Add siosm to owners
Add @siosm to appowners for the Fedora CoreOS cincinnati project.
2022-11-18 19:46:24 +00:00
Aurélien Bompard
b03a8c6a93
First attempt at a FMN deployment in staging
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-18 11:40:00 +01:00
František Zatloukal
0d170a73de oraculum: Flower: try without rewrite-target 2022-10-06 16:17:16 +02:00
František Zatloukal
d928771ee4 oraculum: Flower: try different rewrite-target 2022-10-06 15:56:49 +02:00
Michal Konečný
10bca6af9e [release-monitoring] Sync production with staging for 1.6.0
With the release of 1.6.0 we can now remove the poetry specific changes for
staging and instead use the same for staging and production.

Signed-off-by: Michal Konečný <mkonecny@redhat.com>
2022-10-05 14:32:22 +02:00
Lukas Holecek
a59f582c09 greenwave: Update OpenShift templates for the new release 2022-09-30 16:23:24 +00:00
Dusty Mabe
61cb7a5f2c
openshift-apps/coreos-ostree-importer: drop changes for debugging
We had these settings in place while we were debugging. Let's drop
them now.
2022-09-29 15:00:41 -04:00
Dusty Mabe
a57e71e0b1
openshift-apps/coreos-ostree-importer: don't start build when deploying
We're debugging something right now and need no new builds to get
created so we don't lose some existing images that we need to go
back to for reference/debugging.
2022-09-29 14:44:51 -04:00
František Zatloukal
9d0b5b6d8a Blockerbugs: vars 2022-09-27 21:16:55 +02:00
František Zatloukal
e901716916 Blockerbugs: more tweaking of vars 2022-09-27 21:14:32 +02:00
František Zatloukal
e24d06b1f6 Blockerbugs: spearate stg/prod credentials 2022-09-27 20:58:21 +02:00
František Zatloukal
272f488d23 qa-landing: Move the project under the blockerbugs app 2022-09-27 19:03:25 +02:00
Michal Konečný
2326ab1da9 [release-monitoring] Scripts are named differently in poetry
Poetry creates it's own script file in /usr/local/ and the name is without .py.
Reflect that in ansible.

Signed-off-by: Michal Konečný <mkonecny@redhat.com>
2022-09-27 12:15:41 +02:00
Dusty Mabe
6f3f08942b
fedora-coreos-pipeline: add marmijo to FCOS pipeline users 2022-08-22 12:18:48 -04:00
Dusty Mabe
b9c6fb8916
openshift-apps: add comments about how to deploy CoreOS projects
All of these already had a comment about how to delete the projects.
Now we added a comment about how to deploy the project too, which
is nice for copy/pasting.

Also fixed up some of the other comments throughout.
2022-08-22 12:15:47 -04:00
Kevin Fenzi
209ca2f9d5 greenwave: set the route timeout on the correct cluster
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-08-11 10:04:28 -07:00
Steven Presti
248fbfd169 fedora-coreos-pipeline: add spresti to FCOS pipeline users 2022-08-08 10:50:38 -04:00
Dusty Mabe
f94ab9a54b
fedora-coreos-pipeline: fix yamllint errors
Was getting this error from zuul:

playbooks/openshift-apps/fedora-coreos-pipeline.yml
  41:12     error    too many spaces inside brackets  (brackets)
  41:26     error    too many spaces inside brackets  (brackets)
2022-07-29 11:09:00 -04:00
Dusty Mabe
86fc86faca
fedora-coreos-pipeline: add Sumantro to FCOS pipeline users
He is going to be working with us more helping chase down test
failures. Let's get him access to the pipeline.
2022-07-29 11:09:00 -04:00