Commit graph

1083 commits

Author SHA1 Message Date
Aurélien Bompard
e4deeb7731
Comment the FMN playbook and role
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-04-11 11:53:35 +02:00
be32dcebb4 websites: new main website deployment on prod 2023-03-31 16:58:12 +00:00
Aurélien Bompard
5c0ab4d9d0
Revert "messaging-bridge: leverage the rabbit/queue role and add monitoring to the queues"
The messaging bridge queues have very specific setup, we can't use the
rabbit/queue role because it binds all queues to both amq.topic and
zmq.topic and we don't wan't that for the bridges.

This reverts commit 649eec104d.
2023-03-31 14:59:02 +02:00
Kevin Fenzi
8b3d4cd54e waiverdb: remove the playbook call to the now removed secrets
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-03-30 13:33:19 -07:00
Adam Saleh
6a554ace71 Fix alignment in monitor_gating 2023-03-24 18:06:43 +01:00
Adam Saleh
a6c9cf7b9c Fix alignment in monitor_gating 2023-03-24 18:00:42 +01:00
Adam Saleh
46af5b0b48 Added dashboard to monitor-gating 2023-03-24 17:57:38 +01:00
Adam Saleh
f913c55e33 Limit the dashboard project just to graphana 2023-03-24 11:46:36 +01:00
Adam Saleh
c975c74610 Updating monitor-dashboard to ocp4 2023-03-23 14:53:34 +01:00
Dusty Mabe
7627f7682a
compose-tracker: implement suggestions by yamllint 2023-02-27 10:23:07 -05:00
Dusty Mabe
424b882aad
fedora-coreos-pipeline: add Shilpi Das (shilpi-das1) 2023-02-27 09:59:15 -05:00
Dusty Mabe
ab3ecc3bd7
fedora-coreos-pipeline: add alert_users
Not all the users that have access to the pipeline need to get alerted
when things fail.
2023-02-27 09:59:15 -05:00
Dusty Mabe
d9dd564d30
compose-tracker: update instructions for deleting project
The staging node is now os_control_stg[0].
2023-02-27 09:49:54 -05:00
Kevin Fenzi
5c9a733da9 openshift / fedocal / staging: disable cron job in staging
The fedocal cron jobs fail in staging because they try to send to
'localhost' for smtp server. We could redirect them to use bastion, but
then people would get a bunch of reminders from prod and staging and get
confused by it. Ideally, fedocal would have a way to just print emails
to stdout instead of sending to SMTP server to use for testing them in
staging, but for now, until we have that just disable the cron job in
staging.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-02-14 09:04:21 -08:00
Aurélien Bompard
c17a563d9b
Deploy the new FMN to prod
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-02-14 13:39:38 +01:00
Aurélien Bompard
92a71abfe9
FMN: add a cron job
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2023-02-14 11:07:44 +01:00
Kevin Fenzi
ef2b0d9435 blockerbugs: by request drop adamw from blockerbugs monitoring
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-02-13 15:15:37 -08:00
Kevin Fenzi
ac1fc1d617 openshift / various apps: add myself to appowners/alert users
I like alerts. Do you like alerts?
I like getting them so I can fix things.
So, adding myself here to all these apps so I can tell when pods are
crashing or builds are failing or whatever. :)

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-02-13 11:21:22 -08:00
d0704659a7 docsbuilding: set alert_users list 2023-02-05 22:49:44 +00:00
Kevin Fenzi
f0b98b904c Openshift apps: drop app owners that have left us
Before we enable any monitoring, we should clean up app owners some so
we do not spam people who arent around anymore and no longer care about
the app. ;)

If I removed anyone here who is still around and does care, we can
easily add you back in.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-26 17:19:33 -08:00
02cb912277 lint fix 2023-01-27 00:08:30 +00:00
96c64b7ee4 docsbuilding: add alerting 2023-01-27 00:08:30 +00:00
caa4f85ed2 ocp_monitoring: move rules to openshift/project role 2023-01-27 00:08:30 +00:00
Adam Williamson
fea60aab95 Bodhi: try and do critical path data generation in openshift
The previous attempt to have staging Bodhi use grouped JSON
critical path data didn't work because we did it in the backend
(i.e. the composer), not the openshift pods where the web UI
and the consumers run.

We need at least the web UI and consumer pods to have the
critpath data, as both those pods may create updates (Bodhi
decides if the update is critical path at creation time). This
attempts to handle that by having a daily openshift cron job
that runs a simple container with the necessary packages in it,
checks out the script, and runs it. It's run on a persistent
storage volume which is also mounted by all the Bodhi pods in
the place where Bodhi will look for the data.

The cron job frequency is temporarily set to once an hour; this
is so it will run soon after initial deployment. Once it has
run once we can cut it back to once a day.

Signed-off-by: Adam Williamson <awilliam@redhat.com>
2023-01-26 11:52:28 -08:00
66726137ae websites: add alerts for pod/job/build errors 2023-01-25 22:41:39 +00:00
Kevin Fenzi
31f11df469 koschei: increase timeout from 30s to 180s to allow loading larger queries
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2023-01-08 11:12:00 -08:00
Kevin Fenzi
5ca2b2eb36 os.fedoraproject.org / app.os.fedoraproject.org: remove more old openshift 3.11 cluster stuff
It may be that having this on some of the proxies is causing problems
because it's trying to ping the old openshift 3.11 cluster and filling
up apache slots with it. We do not need this stuff anymore, so remove
it.

Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-12-16 10:15:27 -08:00
Dusty Mabe
caf71e42b7
openshift-apps: remove build step from a few apps
Everytime we run the playbook a new build kicks off, but
the app was just restarted. So what happens is we end up
with the app getting started twice (once when the
deploymentconfig gets updated and once when the build finishes).
This could be bad if the app has some startup steps that need
to not be interrupted.

Let's just manually trigger builds since we have the permissions
to do that in the web interface and via the CLI.
2022-12-14 15:29:04 -05:00
Michal Konečný
222d670bf0 [mdapi] Add t0xic0der to appowners
https://pagure.io/fedora-infrastructure/issue/11053

Signed-off-by: Michal Konečný <mkonecny@redhat.com>
2022-12-14 10:43:32 +01:00
bd4c0ae2a1
websites: add missing deployment 2022-12-09 01:31:27 +01:00
dc0dcb3db9 websites: s3 creds will be manually fetched 2022-12-09 00:14:44 +00:00
5b5edd11eb websites: new main website deployment on stg 2022-12-09 00:14:44 +00:00
Dusty Mabe
8d70d287ca
fedora-coreos-pipeline: add Adam Piasecki (c4rt0) drop some others
The few we're dropping are either inactive or have moved on.
2022-12-06 09:59:24 -05:00
Aurélien Bompard
6871f63bdf
Register FMN with IPA
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 12:21:40 +01:00
Aurélien Bompard
a060cef52e
FMN: add a keytab for FASJSON access
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 10:56:42 +01:00
Aurélien Bompard
2b53f7bd03
Don't use the rabbit/queue role for vhost-specific queues
There are naming constraints on those queues and we don't want them.

Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 10:40:10 +01:00
Aurélien Bompard
c8c4dfe72c
FMN: setup the fmn vhost in rabbitmq
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 10:30:43 +01:00
Aurélien Bompard
5afe93f783
FMN: create the RabbitMQ user
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-21 09:49:19 +01:00
Timothée Ravier
95eaf4f2f1 playbooks/openshift-apps/silverblue: Minor whitespace fix 2022-11-19 02:23:49 +00:00
Timothée Ravier
0acae10155 playbooks/openshift-apps/silverblue: Add siosm & tpopela
Give @siosm and @tpopela access to the Silverblue website deployment.
2022-11-19 02:23:49 +00:00
Kevin Fenzi
310ddb9b36 greenwave: rabbitmq queue no longer needed
Signed-off-by: Kevin Fenzi <kevin@scrye.com>
2022-11-18 13:10:36 -08:00
Timothée Ravier
7da090f98e playbooks/openshift-apps/coreos-cincinnati: Fix yamllint warnings 2022-11-18 19:46:24 +00:00
Timothée Ravier
c41faa09b7 playbooks/openshift-apps/coreos-cincinnati: Add siosm to owners
Add @siosm to appowners for the Fedora CoreOS cincinnati project.
2022-11-18 19:46:24 +00:00
Aurélien Bompard
b03a8c6a93
First attempt at a FMN deployment in staging
Signed-off-by: Aurélien Bompard <aurelien@bompard.org>
2022-11-18 11:40:00 +01:00
František Zatloukal
0d170a73de oraculum: Flower: try without rewrite-target 2022-10-06 16:17:16 +02:00
František Zatloukal
d928771ee4 oraculum: Flower: try different rewrite-target 2022-10-06 15:56:49 +02:00
Michal Konečný
10bca6af9e [release-monitoring] Sync production with staging for 1.6.0
With the release of 1.6.0 we can now remove the poetry specific changes for
staging and instead use the same for staging and production.

Signed-off-by: Michal Konečný <mkonecny@redhat.com>
2022-10-05 14:32:22 +02:00
Lukas Holecek
a59f582c09 greenwave: Update OpenShift templates for the new release 2022-09-30 16:23:24 +00:00
Dusty Mabe
61cb7a5f2c
openshift-apps/coreos-ostree-importer: drop changes for debugging
We had these settings in place while we were debugging. Let's drop
them now.
2022-09-29 15:00:41 -04:00
Dusty Mabe
a57e71e0b1
openshift-apps/coreos-ostree-importer: don't start build when deploying
We're debugging something right now and need no new builds to get
created so we don't lose some existing images that we need to go
back to for reference/debugging.
2022-09-29 14:44:51 -04:00