Skip to content

Conversation

@WilDoane
Copy link

@WilDoane WilDoane commented Oct 2, 2021

I've run across a valid DOCX XML structure where the commentStartRange and commentEndRange nodes are not siblings:

trimmed-down

This is an attempt to allow for accurate comment and anchor text extraction from such documents.

All original tests pass. devtools::check(args = c('--as-cran'), build_args = c('--resave-data')) passes

I've added a few additional tests to verify accurate anchor text, author, and initials extraction as well as a new test document that has this alternative XML structure.

I did not update the NEWS or DESCRIPTION files, since I didn't know whether you already had other updates in process.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants