Skip to content

Seems now garbage is returned no matterwhat #20

@Universal-Invariant

Description

@Universal-Invariant

I can no longer get archive ripper to download valid data. Returns garbage files so nothing works. As per my previous issue, it was sometimes working and now it no longer works at all(even to download just a few good files).

I'm wondering if something has been changed as it seems they have changed some of their code with borrowing.

DEBUG:root:attempting login with user-supplied credentials
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): archive.org:443
DEBUG:urllib3.connectionpool:https://archive.org:443 "GET /account/login HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /account/login HTTP/1.1" 200 None
DEBUG:root:user has logged in successfully
DEBUG:root:attempting to start scheduler
DEBUG:root:scheduler running borrow/renew for the first time
DEBUG:root:attempting to borrow book
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/ HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/searchInside.php HTTP/1.1" 200 None
DEBUG:root:received book token:
DEBUG:root:time is 1757717951, time to renew book again
DEBUG:root:attempting to renew book
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/ HTTP/1.1" 200 None
DEBUG:root:renewed book token:
DEBUG:root:creating output dir "z:/fearfulstatepowe0000alis"
DEBUG:urllib3.connectionpool:https://archive.org:443 "GET /details/fearfulstatepowe0000alis HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): ia601701.us.archive.org:443
DEBUG:urllib3.connectionpool:https://ia601701.us.archive.org:443 "GET /BookReader/BookReaderJSIA.php?id=fearfulstatepowe0000alis&itemPath=/4/items/fearfulstatepowe0000alis&server=ia601701.us.archive.org&format=jsonp&subPrefix=fearfulstatepowe0000alis&requestUri=/details/fearfulstatepowe0000alis HTTP/1.1" 200 None
DEBUG:root:title: The fearful state : power, people, and internal war in South Asia, imagecount: 282
DEBUG:root:planning on fetching pages 0 thru 280

I have tried to borrow directly and allow the script to do it but always the same results. I have tried multiple books. Manually doing:

https://ia601701.us.archive.org/BookReader/BookReaderJSIA.php?id=fearfulstatepowe0000alis&itemPath=/4/items/fearfulstatepowe0000alis&server=ia601701.us.archive.org&format=jsonp&subPrefix=fearfulstatepowe0000alis&requestUri=/details/fearfulstatepowe0000alis

If I try to get the page directly using one of the links in the json return then

"uri": "https://ia601701.us.archive.org/BookReader/BookReaderImages.php?zip=/4/items/fearfulstatepowe0000alis/fearfulstatepowe0000alis_jp2.zip&file=fearfulstatepowe0000alis_jp2/fearfulstatepowe0000alis_0277.jp2&id=fearfulstatepowe0000alis",

Error serving request:
Image error: Image stack does not exist at

or

Error serving request:
Image error: not found

Page not found
We're sorry, the page you have requested is not available.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions