feat(superctguam): new scraper#2021
Conversation
Closes freelawproject#1939 New scraper for the Superior Court of Guam. Made a small change in the `guam` scraper so `superctguam` can inherit most of the code, they share a similar structure.
for more information, see https://pre-commit.ci
|
@claude review |
There was a problem hiding this comment.
Code review is billed via overage credits. To resume reviews, an organization admin can raise the monthly limit at claude.ai/admin-settings/claude-code.
Once credits are available, comment @claude review on this pull request to trigger a review.
grossir
left a comment
There was a problem hiding this comment.
This is working. Tested both the regular scraper and the backscraper
Some code style comments and the "status" comment
| from juriscraper.opinions.united_states.territories import guam | ||
|
|
||
|
|
||
| # Most of this site is the same as guam.py |
There was a problem hiding this comment.
I think we can drop this comment, since this Site is inheriting from guam, so its implicit (I know Zen of Python says otherwise, but we can do it this time)
| ) | ||
| docket = docket_spans[0].text_content() if docket_spans else "" | ||
| # \xc2\xa0 is mojibake, breaks the date regex, same as in guam.py | ||
| docket = re.sub(r"[\xa0\xc2]+", " ", docket).strip() |
There was a problem hiding this comment.
this regex is used again in _process_legacy_html, so you could re.compile as a class attribute, and reuse it
There was a problem hiding this comment.
used again in this method, when parsing description
| "case_dates": "2026-06-23", | ||
| "case_names": "People v. Paulis", | ||
| "download_urls": "https://guamcourts.gov/sites/default/files/CF0513-23-06-23-2026.pdf", | ||
| "precedential_statuses": "Published", |
There was a problem hiding this comment.
I am not sure if everything is "Published" . Some reading on the website may confirm it, but if we don't know we should use "Unknown"
| if not anchors: | ||
| continue |
There was a problem hiding this comment.
it's best to logger.warning, so a dev can inspect skipped rows
Solves #1939
New scraper for the Superior Court of Guam
Made a small change in the
guamscraper sosuperctguamcan inherit most of the code, they share a similar structure