Gijsbertdalen / First-test-scraper

Google news

Scrapes news.google.com

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.


Contributors Gijsbertdalen

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling... Injecting scraper and running... Traceback (most recent call last): File "scraper.py", line 20, in <module> items = get_google_new_results( 'prinsjesdag', 2 ) File "scraper.py", line 6, in get_google_new_results obj = parseString( urllib2.urlopen('http://news.google.com/news?q=%s&output=rss' % term).read() ) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 154, in urlopen return opener.open(url, data, timeout) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 437, in open response = meth(req, response) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 550, in http_response 'http', request, response, code, msg, hdrs) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 475, in error return self._call_chain(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 409, in _call_chain result = func(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 558, in http_error_default raise HTTPError(req.get_full_url(), code, msg, hdrs, fp) urllib2.HTTPError: HTTP Error 502: Bad Gateway

Statistics

Total run time: 2 minutes

Total cpu time used: less than 5 seconds

Total disk space used: 27.8 KB

History

  • Manually ran revision 2f9a587f and failed .
    1 page scraped
  • Manually ran revision 3f994932 and failed .
  • Manually ran revision 4cddf164 and failed .
    18 pages scraped
  • Manually ran revision 93024c22 and failed .
    7 pages scraped
  • Created on morph.io

Scraper code

First-test-scraper