-
Notifications
You must be signed in to change notification settings - Fork 37
Description
Please add the following to
https://github.com/ContentMine/getpapers/wiki/arxiv-query-format
Dates and ranges of dates
The terms submittedDate (date/time of first submission) and lastUpdatedDate (date/time of last revision) are particularly useful for limiting a search with many results, so that you may combine multiple searches together, each within some window of time, to get the full results.
The date/time information is of the form YYYYMMDDHHMMSS, for example 20071018122534 for 2007-10-18 12:25:34. You can use * for a wildcard for the times. For example, to get all manuscripts with initial submission on 2007-10-18:
getpapers --api 'arxiv' --query "cat:math.AG AND submittedDate:20071018*"
But you can't use the wildcard within the dates.
getpapers --api 'arxiv' --query "cat:math.AG AND submittedDate:2007*"
To get all manuscripts that were updated in a date range, use [from_date TO to_date]. (If you give a partial date, it's treated as the earliest date/time that matches, and the range appears to be up to but not including the second date/time.)
getpapers --api 'arxiv' --query "cat:math.AG AND lastUpdatedDate:[20170101* TO 20190827*]"
NOTE: For updates of a local collection the lastUpdatedDate parameter and a date range as above are the right options - submittedDate will not catch all updates, whereas lastUpdatedDate will also catch submissions, in addition to updates.
(adapted from: https://cran.r-project.org/web/packages/aRxiv/vignettes/aRxiv.html)