woodbine / sp_FTRCDX_HDNHSFT_gov

Scrapes www.hdft.nhs.uk

Harrogate District Hospital - Welcome to Harrogate and District NHS Foundation Trust website


Contributors blablupcom woodbine

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.13, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.13 -----> Installing pip -----> Installing requirements with pip  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Collecting lxml==3.4.4 (from -r /tmp/build/requirements.txt (line 2))  Downloading https://files.pythonhosted.org/packages/63/c7/4f2a2a4ad6c6fa99b14be6b3c1cece9142e2d915aa7c43c908677afc8fa4/lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r /tmp/build/requirements.txt (line 3))  Downloading https://files.pythonhosted.org/packages/aa/e5/9ee1460d485b94a6d55732eb7ad5b6c084caf73dd6f9cb0bb7d2a78fafe8/cssselect-0.9.1.tar.gz  Collecting beautifulsoup4 (from -r /tmp/build/requirements.txt (line 4))  Downloading https://files.pythonhosted.org/packages/a6/29/bcbd41a916ad3faf517780a0af7d0254e8d6722ff6414723eedba4334531/beautifulsoup4-4.6.0-py2-none-any.whl (86kB)  Collecting dumptruck>=0.1.2 (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/49/df/50aa1999ab9bde74656c2919d9c0c085fd2b3775fd3eca826012bef76d8c/requests-2.18.4-py2.py3-none-any.whl (88kB)  Collecting idna<2.7,>=2.5 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/27/cc/6dd9a3869f15c2edfab863b992838277279ce92663d334df9ecf5106f5c6/idna-2.6-py2.py3-none-any.whl (56kB)  Collecting urllib3<1.23,>=1.21.1 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/63/cb/6965947c13a94236f6d4b8223e21beb4d576dc72e8130bd7880f600839b8/urllib3-1.22-py2.py3-none-any.whl (132kB)  Collecting certifi>=2017.4.17 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/7c/e6/92ad559b7192d846975fc916b65f667c7b8c3a32bea7372340bfe9a15fa5/certifi-2018.4.16-py2.py3-none-any.whl (150kB)  Collecting chardet<3.1.0,>=3.0.2 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Installing collected packages: dumptruck, idna, urllib3, certifi, chardet, requests, scraperwiki, lxml, cssselect, beautifulsoup4  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed beautifulsoup4-4.6.0 certifi-2018.4.16 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.6 lxml-3.4.4 requests-2.18.4 scraperwiki urllib3-1.22   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... FTRCDX_HDNHSFT_gov_2018_03 FTRCDX_HDNHSFT_gov_2018_02 FTRCDX_HDNHSFT_gov_2018_01 FTRCDX_HDNHSFT_gov_2017_12 FTRCDX_HDNHSFT_gov_2017_11 FTRCDX_HDNHSFT_gov_2017_10 FTRCDX_HDNHSFT_gov_2017_09 FTRCDX_HDNHSFT_gov_2017_08 FTRCDX_HDNHSFT_gov_2017_07 FTRCDX_HDNHSFT_gov_2017_06 FTRCDX_HDNHSFT_gov_2017_05 FTRCDX_HDNHSFT_gov_2017_04 FTRCDX_HDNHSFT_gov_2017_03 FTRCDX_HDNHSFT_gov_2017_02 FTRCDX_HDNHSFT_gov_2017_01 FTRCDX_HDNHSFT_gov_2016_12 FTRCDX_HDNHSFT_gov_2016_11 FTRCDX_HDNHSFT_gov_2016_10 FTRCDX_HDNHSFT_gov_2016_09 FTRCDX_HDNHSFT_gov_2016_08 FTRCDX_HDNHSFT_gov_2016_07 FTRCDX_HDNHSFT_gov_2016_06 FTRCDX_HDNHSFT_gov_2016_05 FTRCDX_HDNHSFT_gov_2016_04 FTRCDX_HDNHSFT_gov_2015_Y1 FTRCDX_HDNHSFT_gov_2014_Y1

Data

Downloaded 497 times by SimKennedy MikeRalphson woodbine

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (11 KB) Use the API

rows 10 / 26

f l d
FTRCDX_HDNHSFT_gov_2018_03
https://www.hdft.nhs.uk/content/uploads/2018/04/Over-£25k-March-2018.csv
2018-05-27 09:02:01.679242
FTRCDX_HDNHSFT_gov_2018_02
https://www.hdft.nhs.uk/content/uploads/2018/04/Over-£25k-February-2018.csv
2018-05-27 09:02:03.426072
FTRCDX_HDNHSFT_gov_2018_01
https://www.hdft.nhs.uk/content/uploads/2016/05/Over-£25k-January-2018.csv
2018-05-27 09:02:04.869871
FTRCDX_HDNHSFT_gov_2017_12
https://www.hdft.nhs.uk/content/uploads/2016/05/Over-£25k-December-2017.csv
2018-05-27 09:02:06.429227
FTRCDX_HDNHSFT_gov_2017_11
https://www.hdft.nhs.uk/content/uploads/2016/05/Over-£25k-November-2017.csv
2018-05-27 09:02:08.058384
FTRCDX_HDNHSFT_gov_2017_10
https://www.hdft.nhs.uk/content/uploads/2016/05/Over-£25k-October-2017.csv
2018-05-27 09:02:09.568787
FTRCDX_HDNHSFT_gov_2017_09
https://www.hdft.nhs.uk/content/uploads/2016/05/Over-£25k-September-2017.csv
2018-05-27 09:02:11.319394
FTRCDX_HDNHSFT_gov_2017_08
https://www.hdft.nhs.uk/content/uploads/2016/05/Over-£25k-August-2017.csv
2018-05-27 09:02:12.890696
FTRCDX_HDNHSFT_gov_2017_07
https://www.hdft.nhs.uk/content/uploads/2016/05/Over-£25k-July-2017.csv
2018-05-27 09:02:14.366495
FTRCDX_HDNHSFT_gov_2017_06
https://www.hdft.nhs.uk/content/uploads/2016/05/Over-£25k-June-2017.csv
2018-05-27 09:02:15.926385

Statistics

Average successful run time: 1 minute

Total run time: about 12 hours

Total cpu time used: 13 minutes

Total disk space used: 42.6 KB

History

  • Auto ran revision 65d478de and completed successfully .
    26 records added, 26 records removed in the database
    27 pages scraped
  • Auto ran revision 65d478de and completed successfully .
    26 records added, 26 records removed in the database
  • Auto ran revision 65d478de and completed successfully .
    26 records added, 26 records removed in the database
  • Auto ran revision 65d478de and completed successfully .
    26 records added, 26 records removed in the database
    27 pages scraped
  • Auto ran revision 65d478de and completed successfully .
    26 records added, 26 records removed in the database
    27 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_FTRCDX_HDNHSFT_gov / scraper.py