Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Searches with submittedDate ranges give varying results #13

Open
kbroman opened this issue Sep 9, 2014 · 3 comments
Open

Searches with submittedDate ranges give varying results #13

kbroman opened this issue Sep 9, 2014 · 3 comments
Labels

Comments

@kbroman
Copy link
Member

kbroman commented Sep 9, 2014

The tests of sorted results are giving sporadic errors, in which the arxiv_search seems to retrieve fewer than the expected number of results.

I put some example results here. That's for three successive runs of test(), with no other changes. The first two gave errors (but not exactly the same errors), while the last run was clean.

Note that the error message

 Lengths (2, 1) differ (string compare on first 1)

means that the expected result had length 2 but the code was giving a result with length 1.

@kbroman kbroman added the bug label Sep 9, 2014
@kbroman kbroman changed the title Tests of API's sortBy and sortOrder give sporadic errors Tests of the arXiv API's sortBy and sortOrder give sporadic errors Sep 9, 2014
@kbroman kbroman changed the title Tests of the arXiv API's sortBy and sortOrder give sporadic errors Searches with submittedDate ranges give varying results Sep 9, 2014
@kbroman
Copy link
Member Author

kbroman commented Sep 9, 2014

I added some better R code with the example results with the problem, and also posted XML responses for a search that sometimes gives a single record and sometimes gives two records.

Here's the search that showed the problem:

http://export.arxiv.org/api/query?search_query=ti:deconvolution+AND+submittedDate:[199001010000+TO+201409062400]&max_results=2

I'd initially thought this issue occurred when I used sortBy or sortOrder, but rather it seems to happen when I use a range for submittedDate (hence the change in the title of this issue). For example, the following does not show any problems.

http://export.arxiv.org/api/query?search_query=ti:deconvolution&max_results=2

@kbroman
Copy link
Member Author

kbroman commented Sep 9, 2014

I posted a question on this to the arxiv-api discussion list.

@kbroman
Copy link
Member Author

kbroman commented Sep 10, 2014

I think that this problem is the same as this question at the arxiv-api discussion list. There's a bug in the case of repeated identical queries that has to do with the caching of search results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant