Handles nested pages, so two sets of pages, a list page and then a download sub-page.

Contributors woodbine blablupcom

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling... Injecting scraper and running... Traceback (most recent call last): File "scraper.py", line 92, in <module> html = urllib2.urlopen(url) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 127, in urlopen return _opener.open(url, data, timeout) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 410, in open response = meth(req, response) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 523, in http_response 'http', request, response, code, msg, hdrs) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 448, in error return self._call_chain(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 382, in _call_chain result = func(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 531, in http_error_default raise HTTPError(req.get_full_url(), code, msg, hdrs, fp) urllib2.HTTPError: HTTP Error 404: Not Found

Data

Downloaded 810 times by SimKennedy MikeRalphson woodbine

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (27 KB) Use the API

rows 10 / 90

Statistics

Average successful run time: 5 minutes

Total run time: 18 days

Total cpu time used: about 5 hours

Total disk space used: 54.8 KB

History

  • Auto ran revision aa43d8d4 and failed .
    nothing changed in the database
  • Auto ran revision aa43d8d4 and failed .
    nothing changed in the database
  • Auto ran revision aa43d8d4 and failed .
    nothing changed in the database
  • Auto ran revision aa43d8d4 and failed .
    nothing changed in the database
  • Auto ran revision aa43d8d4 and failed .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_E5048_SLBC_gov / scraper.py