carlplant / NHS_choices_GP_data

Performance data for GPs test

Scrapes docs.google.com, www.nhs.uk, and www.nhsbsa.nhs.uk

Redirecting...


First attempt of scraping specific performance data for GP practices via nhs.uk.

I’ve only scraped a small number of GP practices to save hammering the nhs.uk server.

Data refers to:

  • GP survey data- Opening hours
  • GP survey data- Consultation with Doctors
  • % Score for Organisation of Practice
  • I might just scrape the lot if there’s a good reason too.

    I’ve collected the nhs.uk ID number for each practice via <a href"https://scraperwiki.com/scrapers/nhs_choices_gp_recommended/">an earlier scraper as well as here.

    Forked from ScraperWiki

    Contributors carlplant

    Last run failed with status code 1.

    Console output of last run

    Injecting configuration and compiling... Injecting scraper and running... http://www.nhs.uk/Services/GP/Performance/DefaultView.aspx?id=46929 Traceback (most recent call last): File "scraper.py", line 22, in <module> html = scraperwiki.scrape(perf_links) File "/app/.heroku/src/scraperwiki/scraperwiki/utils.py", line 31, in scrape f = urllib2.urlopen(req) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 127, in urlopen return _opener.open(url, data, timeout) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 410, in open response = meth(req, response) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 523, in http_response 'http', request, response, code, msg, hdrs) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 448, in error return self._call_chain(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 382, in _call_chain result = func(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 531, in http_error_default raise HTTPError(req.get_full_url(), code, msg, hdrs, fp) urllib2.HTTPError: HTTP Error 403: Forbidden

    Data

    Downloaded 0 times

    To download data sign in with GitHub

    Download table (as CSV) Download SQLite database (450 MB) Use the API

    rows 0 / 0

    To download data sign in with GitHub

    Download table (as CSV) Download SQLite database (450 MB) Use the API

    rows 10 / 2541205

    name Opening_hours Consultation link address percent_Score_org_of_practice id
    The Shehadeh Medical Centre
    4.2
    7.8
    Telephone:
    99%
    46929
    Maghull PCT Practice
    Telephone:
    99%
    46912
    Hurley Surgery
    3.7
    8.2
    Telephone:
    99%
    46886
    Hambleden Surgery
    3.7
    8.2
    Telephone:
    99%
    46885
    The Lane End Surgery
    3.7
    8.2
    Telephone:
    99%
    46884
    Kingston Health (Branch)
    3.8
    8.2
    Telephone:
    100%
    46883
    Whitefields Surgery
    4.3
    8.2
    Telephone:
    94%
    46836
    North Trafford Group Practice
    3.5
    7.9
    Telephone: 0161 848 8180
    99%
    46812
    BAGARY'S MEDICAL PRACTICE
    3.3
    8.9
    Telephone: 01902 728861
    100%
    46808
    Whitaker Mj
    3.1
    8.9
    Telephone: 0161 445-3731
    100%
    46718

    Statistics

    Average successful run time: about 6 hours

    Total run time: 4 months

    Total cpu time used: about 9 hours

    Total disk space used: 450 MB

    History

    • Auto ran revision d2cd8f6c and failed .
      nothing changed in the database
      68 pages scraped
    • Auto ran revision d2cd8f6c and failed .
      nothing changed in the database
      3 pages scraped
    • Auto ran revision d2cd8f6c and failed .
      nothing changed in the database
      3 pages scraped
    • Auto ran revision d2cd8f6c and failed .
      nothing changed in the database
    • Auto ran revision d2cd8f6c and failed .
      nothing changed in the database
      3 pages scraped
    • ...
    • Forked from ScraperWiki

    Show complete history

    Scraper code

    Python

    NHS_choices_GP_data / scraper.py