Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

tw4l · 2025-11-18T19:18:39Z

Full backend and frontend implementation, with a new email notification to org admins when a crawl is paused because an org quota has been reached.

Backend changes

Modify operator to auto-pause crawls when quotas are reached or archiving is disabled rather than stopping the crawls
Add new crawl states: paused_storage_quota_reached, paused_time_quota_reached, paused_org_readonly
Add uploaded WACZs to org storage totals immediately after upload so that auto-paused crawls will actually put the org's bytesStored above the storage quota
Send an email from new template to all org admins when a crawl is auto-paused with information about what to do
Fix datetime deprecation in tests

Frontend changes

Add new paused crawl states
Update checks throughout frontend for whether crawl is paused to compare against all paused states

Needs attention

There is a bug/race condition where sometimes when a crawl is pausing, the uploaded WACZ's size is added to status.filesAddedSize, then added again to stats.size (see TODO comment in crawl operator code) again, which effectively doubles the stats.size of the crawl and results in the crawl seeming larger than it is. I've attempted a few solutions for this such as not adding status.filesAddedSize to stats.size is the crawl is pausing, but no solution I've attempted has consistently resolved the issue without introducing other side effects. I think this may have a downstream effect at times on the storage quota check in is_crawl_stopping - I have that check now subtracting the size of already-uploaded WACZs from the active crawl size that's used in checking whether active crawls will put the org over its storage quota, but if it's inconsistent whether the previously-uploaded WACZs are included in stats.size or not, the check might become inaccurate at times.
In commit 217e935, I've attempted to fix how workflow crawl counts are handled - previously, every crawl (whether successful or failed) would increment crawlSuccessfulCount - this change could use a second pair of eyes to make sure it makes sense - I'm not entirely sure crawlSuccessfulCount is intended to mean crawls that ended with a successful state, or just crawls that completed in any form. This field does not appear to be used in the frontend in any form, and might be inconsistent if we switch how it's counted now without a migration, so maybe this is handled better separately?

Needs to be tested, just pushing as-is so that I can pick it up next week. There's an issue in local testing where crawls sometimes appear to be twice as big as they really are, which is making Browsertrix think the storage quota is reached prematurely. I haven't yet pinned down the cause of this and it seems intermittent.

…failed

tw4l added 21 commits November 18, 2025 15:13

Pause crawls instead of stopping when quotas are reached

6cd525a

Update nightly tests

34f9e2d

Update frontend for new paused states

46d4a79

Fix comments

dedbfd6

Fix status.stopReason handling for paused states

d4a9a11

Fix datetime deprecation in nightly test fixture

7c51f2d

WIP: Mark current issues with some TODOs

d41c1a4

WIP: Add debug logging to beginning of sync_crawls

2ca3bcc

Modify execution time test to account for pausing

8d18277

WIP: Add email notification

2cb9146

Inc org bytes stored when crawl files are added, not at end of crawl

7129d45

More incremental storage work

68ef7c8

One more TODO

94e6807

Move paused with no stop reason condition below quota checks

6d9f2c8

Decrement org in delete_failed_crawl_files

3468552

Shorten docstring

ab5ff3a

Fix email sending (but still not yet idempotent)

22a7bc5

Only send auto-paused emails once

b17ecd2

Add TODO to address already-existing bug that now matters more

db80a95

Fix bug where all crawls are added to workflow as successful even if …

217e935

…failed

tw4l force-pushed the issue-2957-pause-crawl-on-quota-reached branch from f7568a3 to 217e935 Compare November 18, 2025 20:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

Uh oh!

tw4l commented Nov 18, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

Are you sure you want to change the base?

Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

Uh oh!

Conversation

tw4l commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Backend changes

Frontend changes

Needs attention

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tw4l commented Nov 18, 2025 •

edited

Loading