feat(linker): optimize text modification by processing only changed segments by nsantacruz · Pull Request #3104 · Sefaria/Sefaria-Project

nsantacruz · 2026-02-18T20:47:56Z

This pull request refactors how text modifications are tracked and processed in sefaria/tracker.py. The main change is to only process and log segments of text that have actually changed, rather than re-processing the entire text every time. This should improve efficiency and accuracy in change tracking and downstream processing.

Key changes:

Segment-level change tracking:

Added a new helper function _post_modify_changed_segments that recursively compares the old and new text, and only calls post_modify_text for segments that have changed. This ensures that only modified segments are processed and logged, rather than the entire text.
Updated modify_text to use _post_modify_changed_segments instead of calling post_modify_text directly, and adjusted how the count_after parameter is handled to avoid redundant counting and indexing.

…egments

Copilot

Pull request overview

This pull request refactors text modification tracking in sefaria/tracker.py to optimize performance by processing only changed segments rather than the entire text on each modification. The implementation introduces a new recursive helper function _post_modify_changed_segments that compares old and new text structures and selectively calls post_modify_text for segments that have changed.

Changes:

Added _post_modify_changed_segments function to recursively compare and process only changed segments
Modified modify_text to use the new segment-level tracking instead of processing the entire text
Adjusted count_after parameter handling to prevent redundant counting during segment iteration

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

mergify · 2026-02-18T21:29:12Z

🧪 CI Insights

Here's what we observed from your CI run for 55e96b5.

❌ Job Failures

Pipeline	Job	Health on `master`	Retries	🔍 CI Insights	📄 Logs
`Continuous`	`Continuous Testing: PyTest`		`0`	View	View

yonadavGit · 2026-02-25T11:47:29Z

sefaria/tracker.py

+        orig_count_after = kwargs.get("count_after", 1)
+        kwargs['count_after'] = False
+
+        _post_modify_changed_segments(user, action, oref, lang, vtitle, old_text, text, version_id, **kwargs)


the "kwargs all the way down" style here is a bit confusing, especially now that the counting logic has been modified. Maybe worthwhile, while were at it, taking out the the counting-related vars out of kwargs land so it's a bit clearer what's going on.. (concretely, trying to make these vars explicit as much as possible)

feat(linker): optimize text modification by processing only changed s…

55e96b5

…egments

nsantacruz requested review from Copilot and yonadavGit February 18, 2026 20:48

Copilot started reviewing on behalf of nsantacruz February 18, 2026 20:49 View session

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(linker): optimize text modification by processing only changed segments#3104

feat(linker): optimize text modification by processing only changed segments#3104
nsantacruz wants to merge 1 commit intomasterfrom
post-modify-text-only-on-changed

nsantacruz commented Feb 18, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

mergify bot commented Feb 18, 2026 •

edited

Loading

Uh oh!

yonadavGit Feb 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

nsantacruz commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

mergify bot commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🧪 CI Insights

❌ Job Failures

Uh oh!

yonadavGit Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nsantacruz commented Feb 18, 2026 •

edited

Loading

mergify bot commented Feb 18, 2026 •

edited

Loading

yonadavGit Feb 25, 2026 •

edited

Loading