Skip to content

fix(helm): exclude history, linker_output, webpages_text from main Mo…#3140

Merged
yodem merged 1 commit intomasterfrom
feature/sc-42542/exclude-linker-output-webpages-text-and
Mar 10, 2026
Merged

fix(helm): exclude history, linker_output, webpages_text from main Mo…#3140
yodem merged 1 commit intomasterfrom
feature/sc-42542/exclude-linker-output-webpages-text-and

Conversation

@yodem
Copy link
Collaborator

@yodem yodem commented Mar 10, 2026

Description

Excludes three large collections from the main mongodump command to reduce the primary backup size by ~26GB (from ~53GB to ~27GB).

Code Changes

helm-chart/sefaria/templates/configmap/create-mongo-dumps.yaml

  • Added --excludeCollection=history, --excludeCollection=linker_output, and --excludeCollection=webpages_text to the mongodump command alongside the existing django_cache and user_history exclusions

Notes

  • Agreed by Akiva, Noah, Yotam
  • history (18G) is constantly a superset of previous dumps and is backed up separately (sc-42544)
  • linker_output (6.5G) and webpages_text (1.9G) are linker website data not needed in the main backup, backed up separately (sc-42543)
  • Closes sc-42542

…ngoDB backup

Reduces main dump from ~53GB to ~27GB by excluding three large collections
agreed by Akiva, Noah, Yotam (sc-42542).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@mergify
Copy link

mergify bot commented Mar 10, 2026

🧪 CI Insights

Here's what we observed from your CI run for ee33f5b.

🟢 All jobs passed!

But CI Insights is watching 👀

@yodem yodem added this pull request to the merge queue Mar 10, 2026
Merged via the queue into master with commit c2548ee Mar 10, 2026
20 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants