carlplant / NHS_choices_GP_data

Performance data for GPs test


First attempt of scraping specific performance data for GP practices via nhs.uk.

I’ve only scraped a small number of GP practices to save hammering the nhs.uk server.

Data refers to:

  • GP survey data- Opening hours
  • GP survey data- Consultation with Doctors
  • % Score for Organisation of Practice
  • I might just scrape the lot if there’s a good reason too.

    I’ve collected the nhs.uk ID number for each practice via <a href"https://scraperwiki.com/scrapers/nhs_choices_gp_recommended/">an earlier scraper as well as here.

    Forked from ScraperWiki

    Contributors carlplant

    Last run failed with status code 1.

    Console output of last run

    Injecting configuration and compiling... Injecting scraper and running... http://www.nhs.uk/Services/GP/Performance/DefaultView.aspx?id=46929 Traceback (most recent call last): File "scraper.py", line 22, in <module> html = scraperwiki.scrape(perf_links) File "/app/.heroku/src/scraperwiki/scraperwiki/utils.py", line 31, in scrape f = urllib2.urlopen(req) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 127, in urlopen return _opener.open(url, data, timeout) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 410, in open response = meth(req, response) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 523, in http_response 'http', request, response, code, msg, hdrs) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 448, in error return self._call_chain(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 382, in _call_chain result = func(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 531, in http_error_default raise HTTPError(req.get_full_url(), code, msg, hdrs, fp) urllib2.HTTPError: HTTP Error 403: Forbidden

    Statistics

    Average successful run time: about 6 hours

    Total run time: 4 months

    Total cpu time used: about 9 hours

    Total disk space used: 160 MB

    History

    • Auto ran revision d2cd8f6c and failed .
    • Auto ran revision d2cd8f6c and failed .
    • Auto ran revision d2cd8f6c and failed .
    • Auto ran revision d2cd8f6c and failed .
    • Auto ran revision d2cd8f6c and failed .
    • ...
    • Forked from ScraperWiki

    Show complete history

    Scraper code

    Python

    NHS_choices_GP_data / scraper.py