bodhi can't copy flatpaks to stable registry #9027
Labels
No labels
announcement
authentication
automate
aws
backlog
blocked
bodhi
ci
Closed As
Duplicate
Closed As
Fixed
Closed As
Fixed with Explanation
Closed As
Initiative Worthy
Closed As
Insufficient data
Closed As
Invalid
Closed As
Spam
Closed As
Upstream
Closed As/Will Not
Can Not fix
cloud
communishift
copr
database
deprecated
dev
discourse
dns
downloads
easyfix
epel
factory2
firmitas
gitlab
greenwave
hardware
help wanted
high-gain
high-trouble
iad2
koji
koschei
lists
low-gain
low-trouble
mbs
medium-gain
medium-trouble
mini-initiative
mirrorlists
monitoring
Needs investigation
notifier
odcs
OpenShift
ops
OSBS
outage
packager_workflow_blocker
pagure
permissions
Priority
Needs Review
Priority
Next Meeting
Priority
🔥 URGENT 🔥
Priority
Waiting on Assignee
Priority
Waiting on External
Priority
Waiting on Reporter
rabbitmq
rdu-cc
release-monitoring
releng
repoSpanner
request-for-resources
s390x
security
SMTP
src.fp.o
staging
taiga
unfreeze
waiverdb
websites-general
wiki
No milestone
No project
No assignees
4 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Infrastructure/fedora-infrastructure#9027
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Our updates flow is blocked by a flatpak stable push:
https://bodhi.fedoraproject.org/composes/F32F/stable
As far as I understand it, all bodhi does in this case is copy the flatpak from the candidate registry to the normal one and update any updates/etc.
The copy is failing and when I run that command that bodhi is running manually I get:
The registry box is fedora32 now in iad2, where it was fedora 30 in phx2. But the version of docker-distributuon seems pretty much the same. :(
@cverna @mohanboddu @otaylor @kalev
Can any of you see whats going on here? We really need to unblock our flow of updates...
I'll note if you go to that url it gets a 503 on, it says:
"OCI manifest found, but accept header does not support OCI manifests"
old registry: docker-distribution-2.6.2-9.git48294d9.fc30.x86_64
new registry: docker-distribution-2.6.2-11.git48294d9.fc32.x86_64
That's because you need to send a request with an appropriate
Accept
header (seeebb886e739/bodhi/server/scripts/skopeo_lite.py (L333)
).Doing so from my PC works fine, so maybe there's some DNS problem connecting Bodhi server to the registry box?
it appears that the problem is possibly that the image is already on the destination (public registry) and skopeo-light doesn't handle that. I'm not sure why a 503 is being generated by the HTTPD proxy rather than passing back the actual status code / error message which is probably more informative.
I don't have much of an idea why the image would already be there - maybe a previous container push failed because of relocation stuff? Worth looking in the logs to see what the first failure on this push was - it might be different.
I probably won't have time to investigate further or come up with a fix for skopeo-lite until Monday. Is it possible to unqueue this one update for 0ad and see what happens with the next?
ok, that was a saga. ;) After many hours it's working and I was able to complete the push.
Along the way:
The 503 errors were due to firewall / proxy issues in the new datacenter. First on candidate registry, then on the final one. Got all those worked around until we can properly fix them next week by routing over our vpn instead of trying to reach those hosts directly.
Then, I saw that the flatpak was already copied over, so I thought: why not delete it from the registry and let bodhi copy it again? First I had to also get oci-registry02 working, as all the deletes and writes go to it instead of 01. After that Ran into a side problem with docker-distribution having delete allowed, but disallowing delete. Finally I realized the config was put in place, but docker-distribution was never restarted. Restarted it and was able to delete that from registry.
Then, on running bodhi it could now not find the content in the candidate registry? Turns out somhow the candidate registry had /srv/oci_registry mounted. IT HAD THE SAME CONTENT AS PROD! I umounted that and found that on our old candidate registry it was just local disk.
So, luckily I was still able to get into our old datacenter and copy all the old content off the old candiate registry on to the new one. With an aside of the instance having too small a disk and I had to resize it to get the old content to fit.
Finally the update push worked as expected. Whew.
Metadata Update from @kevin:
Thanks, @kevin, for figuring this out!