fgregg / knight_news_challenge_popularity_scraper

Knight News Challenge Popularity Scraper Page by Page


Last run failed with status code 1.

Console output of last run

Scraping page 1 Scraping page 2 Scraping page 3 Scraping page 4 Scraping page 5 Scraping page 6 Scraping page 7 Scraping page 8 Scraping page 9 Scraping page 10 Scraping page 11 Scraping page 12 Scraping page 13 Scraping page 14 Scraping page 15 Scraping page 16 Scraping page 17 Scraping page 18 Scraping page 19 Scraping page 20 Scraping page 21 Scraping page 22 Scraping page 23 Scraping page 24 Traceback (most recent call last): File "/repo/scraper.py", line 39, in <module> entryHTML = scraperwiki.scrape(url) File "/usr/local/lib/python2.7/dist-packages/scraperwiki-0.3.7-py2.7.egg/scraperwiki/utils.py", line 31, in scrape f = urllib2.urlopen(req) File "/usr/lib/python2.7/urllib2.py", line 126, in urlopen return _opener.open(url, data, timeout) File "/usr/lib/python2.7/urllib2.py", line 406, in open response = meth(req, response) File "/usr/lib/python2.7/urllib2.py", line 519, in http_response 'http', request, response, code, msg, hdrs) File "/usr/lib/python2.7/urllib2.py", line 444, in error return self._call_chain(*args) File "/usr/lib/python2.7/urllib2.py", line 378, in _call_chain result = func(*args) File "/usr/lib/python2.7/urllib2.py", line 527, in http_error_default raise HTTPError(req.get_full_url(), code, msg, hdrs, fp) urllib2.HTTPError: HTTP Error 504: Gateway Timeout

Data

Downloaded 1 time by MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (509 KB) Use the API

rows 10 / 1185

Statistics

Total run time: 4 minutes

Total cpu time used: half a minute

Total disk space used: 528 KB

History

  • Manually ran revision 7d0b0a65 and failed .
    355 records added in the database
  • Created on morph.io