mtega / mps_tanzania

Members of the Tanzania Parliament

Scrapes www.parliament.go.tz

Parliament of Tanzania


This scraper scrapes the members of the tanzanian parliament from http://www.paliament.go.tz it includes numbers of contributions and questions asked. Written for a workshop in Tanzania in 10-2012 Updated weekly

Contributors mihi-tr

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling... Injecting scraper and running... Traceback (most recent call last): File "scraper.py", line 57, in <module> members=set(reduce(lambda x,y: x+y,[get_member_ids_from_page(n) for n in pages])) # Only have each member once! File "scraper.py", line 32, in get_member_ids_from_page pg=scraperwiki.scrape("%s%02d"%(baseurl,number)) File "/app/.heroku/src/scraperwiki/scraperwiki/utils.py", line 31, in scrape f = urllib2.urlopen(req) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 127, in urlopen return _opener.open(url, data, timeout) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 410, in open response = meth(req, response) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 523, in http_response 'http', request, response, code, msg, hdrs) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 448, in error return self._call_chain(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 382, in _call_chain result = func(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 531, in http_error_default raise HTTPError(req.get_full_url(), code, msg, hdrs, fp) urllib2.HTTPError: HTTP Error 404: Not Found

Statistics

Total run time: 1 minute

Total cpu time used: less than 5 seconds

Total disk space used: 19.8 KB

History

  • Manually ran revision c6ea1c95 and failed .
    nothing changed in the database
    1 page scraped
  • Created on morph.io

Scraper code

Python

mps_tanzania / scraper.py