Skip to content

Add submittedDate and lastUpdatedDate to the Wiki description of arXiv API #180

@sedimentation-fault

Description

@sedimentation-fault

Please add the following to

https://github.com/ContentMine/getpapers/wiki/arxiv-query-format

Dates and ranges of dates

The terms submittedDate (date/time of first submission) and lastUpdatedDate (date/time of last revision) are particularly useful for limiting a search with many results, so that you may combine multiple searches together, each within some window of time, to get the full results.

The date/time information is of the form YYYYMMDDHHMMSS, for example 20071018122534 for 2007-10-18 12:25:34. You can use * for a wildcard for the times. For example, to get all manuscripts with initial submission on 2007-10-18:

getpapers --api 'arxiv' --query "cat:math.AG AND submittedDate:20071018*"

But you can't use the wildcard within the dates.

getpapers --api 'arxiv' --query "cat:math.AG AND submittedDate:2007*"

To get all manuscripts that were updated in a date range, use [from_date TO to_date]. (If you give a partial date, it's treated as the earliest date/time that matches, and the range appears to be up to but not including the second date/time.)

getpapers --api 'arxiv' --query "cat:math.AG AND lastUpdatedDate:[20170101* TO 20190827*]"

NOTE: For updates of a local collection the lastUpdatedDate parameter and a date range as above are the right options - submittedDate will not catch all updates, whereas lastUpdatedDate will also catch submissions, in addition to updates.
(adapted from: https://cran.r-project.org/web/packages/aRxiv/vignettes/aRxiv.html)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions