Skip to content

feat(superctguam): new scraper#2021

Open
giancohs wants to merge 3 commits into
freelawproject:mainfrom
giancohs:1939-implement-guam-superior-court
Open

feat(superctguam): new scraper#2021
giancohs wants to merge 3 commits into
freelawproject:mainfrom
giancohs:1939-implement-guam-superior-court

Conversation

@giancohs

Copy link
Copy Markdown
Contributor

Solves #1939

New scraper for the Superior Court of Guam

Made a small change in the guam scraper so superctguam can inherit most of the code, they share a similar structure

Closes freelawproject#1939
New scraper for the Superior Court of Guam.
Made a small change in the `guam` scraper so `superctguam` can inherit
most of the code, they share a similar structure.

@claude claude Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

@grossir grossir moved this to PRs to Review in Sprint (Case Law) Jun 29, 2026
@grossir

grossir commented Jun 30, 2026

Copy link
Copy Markdown
Contributor

@claude review

@claude claude Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Code review skipped — your organization's overage spend limit has been reached.

Code review is billed via overage credits. To resume reviews, an organization admin can raise the monthly limit at claude.ai/admin-settings/claude-code.

Once credits are available, comment @claude review on this pull request to trigger a review.

@grossir grossir left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is working. Tested both the regular scraper and the backscraper

Some code style comments and the "status" comment

from juriscraper.opinions.united_states.territories import guam


# Most of this site is the same as guam.py

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we can drop this comment, since this Site is inheriting from guam, so its implicit (I know Zen of Python says otherwise, but we can do it this time)

)
docket = docket_spans[0].text_content() if docket_spans else ""
# \xc2\xa0 is mojibake, breaks the date regex, same as in guam.py
docket = re.sub(r"[\xa0\xc2]+", " ", docket).strip()

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this regex is used again in _process_legacy_html, so you could re.compile as a class attribute, and reuse it

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

used again in this method, when parsing description

"case_dates": "2026-06-23",
"case_names": "People v. Paulis",
"download_urls": "https://guamcourts.gov/sites/default/files/CF0513-23-06-23-2026.pdf",
"precedential_statuses": "Published",

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not sure if everything is "Published" . Some reading on the website may confirm it, but if we don't know we should use "Unknown"

Comment on lines +119 to +120
if not anchors:
continue

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it's best to logger.warning, so a dev can inspect skipped rows

@grossir grossir linked an issue Jul 1, 2026 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: PRs to Review

Development

Successfully merging this pull request may close these issues.

Create a scraper for Guam's Superior Court

3 participants