I can no longer get archive ripper to download valid data. Returns garbage files so nothing works. As per my previous issue, it was sometimes working and now it no longer works at all(even to download just a few good files).
I'm wondering if something has been changed as it seems they have changed some of their code with borrowing.
DEBUG:root:attempting login with user-supplied credentials
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): archive.org:443
DEBUG:urllib3.connectionpool:https://archive.org:443 "GET /account/login HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /account/login HTTP/1.1" 200 None
DEBUG:root:user has logged in successfully
DEBUG:root:attempting to start scheduler
DEBUG:root:scheduler running borrow/renew for the first time
DEBUG:root:attempting to borrow book
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/ HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/searchInside.php HTTP/1.1" 200 None
DEBUG:root:received book token:
DEBUG:root:time is 1757717951, time to renew book again
DEBUG:root:attempting to renew book
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/ HTTP/1.1" 200 None
DEBUG:root:renewed book token:
DEBUG:root:creating output dir "z:/fearfulstatepowe0000alis"
DEBUG:urllib3.connectionpool:https://archive.org:443 "GET /details/fearfulstatepowe0000alis HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): ia601701.us.archive.org:443
DEBUG:urllib3.connectionpool:https://ia601701.us.archive.org:443 "GET /BookReader/BookReaderJSIA.php?id=fearfulstatepowe0000alis&itemPath=/4/items/fearfulstatepowe0000alis&server=ia601701.us.archive.org&format=jsonp&subPrefix=fearfulstatepowe0000alis&requestUri=/details/fearfulstatepowe0000alis HTTP/1.1" 200 None
DEBUG:root:title: The fearful state : power, people, and internal war in South Asia, imagecount: 282
DEBUG:root:planning on fetching pages 0 thru 280
I have tried to borrow directly and allow the script to do it but always the same results. I have tried multiple books. Manually doing:
https://ia601701.us.archive.org/BookReader/BookReaderJSIA.php?id=fearfulstatepowe0000alis&itemPath=/4/items/fearfulstatepowe0000alis&server=ia601701.us.archive.org&format=jsonp&subPrefix=fearfulstatepowe0000alis&requestUri=/details/fearfulstatepowe0000alis
If I try to get the page directly using one of the links in the json return then
"uri": "https://ia601701.us.archive.org/BookReader/BookReaderImages.php?zip=/4/items/fearfulstatepowe0000alis/fearfulstatepowe0000alis_jp2.zip&file=fearfulstatepowe0000alis_jp2/fearfulstatepowe0000alis_0277.jp2&id=fearfulstatepowe0000alis",
Error serving request:
Image error: Image stack does not exist at
or
Error serving request:
Image error: not found
Page not found
We're sorry, the page you have requested is not available.
I can no longer get archive ripper to download valid data. Returns garbage files so nothing works. As per my previous issue, it was sometimes working and now it no longer works at all(even to download just a few good files).
I'm wondering if something has been changed as it seems they have changed some of their code with borrowing.
DEBUG:root:attempting login with user-supplied credentials
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): archive.org:443
DEBUG:urllib3.connectionpool:https://archive.org:443 "GET /account/login HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /account/login HTTP/1.1" 200 None
DEBUG:root:user has logged in successfully
DEBUG:root:attempting to start scheduler
DEBUG:root:scheduler running borrow/renew for the first time
DEBUG:root:attempting to borrow book
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/ HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/searchInside.php HTTP/1.1" 200 None
DEBUG:root:received book token:
DEBUG:root:time is 1757717951, time to renew book again
DEBUG:root:attempting to renew book
DEBUG:urllib3.connectionpool:https://archive.org:443 "POST /services/loans/loan/ HTTP/1.1" 200 None
DEBUG:root:renewed book token:
DEBUG:root:creating output dir "z:/fearfulstatepowe0000alis"
DEBUG:urllib3.connectionpool:https://archive.org:443 "GET /details/fearfulstatepowe0000alis HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): ia601701.us.archive.org:443
DEBUG:urllib3.connectionpool:https://ia601701.us.archive.org:443 "GET /BookReader/BookReaderJSIA.php?id=fearfulstatepowe0000alis&itemPath=/4/items/fearfulstatepowe0000alis&server=ia601701.us.archive.org&format=jsonp&subPrefix=fearfulstatepowe0000alis&requestUri=/details/fearfulstatepowe0000alis HTTP/1.1" 200 None
DEBUG:root:title: The fearful state : power, people, and internal war in South Asia, imagecount: 282
DEBUG:root:planning on fetching pages 0 thru 280
I have tried to borrow directly and allow the script to do it but always the same results. I have tried multiple books. Manually doing:
https://ia601701.us.archive.org/BookReader/BookReaderJSIA.php?id=fearfulstatepowe0000alis&itemPath=/4/items/fearfulstatepowe0000alis&server=ia601701.us.archive.org&format=jsonp&subPrefix=fearfulstatepowe0000alis&requestUri=/details/fearfulstatepowe0000alis
If I try to get the page directly using one of the links in the json return then
"uri": "https://ia601701.us.archive.org/BookReader/BookReaderImages.php?zip=/4/items/fearfulstatepowe0000alis/fearfulstatepowe0000alis_jp2.zip&file=fearfulstatepowe0000alis_jp2/fearfulstatepowe0000alis_0277.jp2&id=fearfulstatepowe0000alis",
Error serving request:
Image error: Image stack does not exist at
or
Error serving request:
Image error: not found
Page not found
We're sorry, the page you have requested is not available.