Add more spiders which do not seem to honour robots.txt #2135
No reviewers
Labels
No labels
freeze-break-request
post-freeze
No milestone
No project
No assignees
3 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Infrastructure/ansible#2135
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "spiders-gone-wild-20240708"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
I went through the last couple of logs afer the first round of 'turn
off the spiders' went out. I looked at the areas which the /robots.txt
disregard and then looked for the bots which ignored it and still
looked up stuff in 'accounts'. This may cut down CPU spikes as these
are looking at dynamic data which can 'blow' things up.
It might be good to add similar tooling to pagure and src since they
seem to be hit a lot in the logs also.
Signed-off-by: Stephen Smoogen ssmoogen@redhat.com
Build succeeded.
https://fedora.softwarefactory-project.io/zuul/buildset/6cbcf10e569b4d75b369dac91ef6beea
2 new commits added
Add blocks to nagios.conf httpd
Add blockers to dl.fedoraproject.org
Build succeeded.
https://fedora.softwarefactory-project.io/zuul/buildset/dcb195cb44d54fae8de7fdb7babfe217
rebased onto
377e83fdd1
rebased onto
377e83fdd1
Pull-Request has been merged by zlopez
Merged and deployed by running: