Contributors blablupcom

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.6, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.6 -----> Installing pip -----> Installing requirements with pip  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Collecting lxml==3.4.4 (from -r /tmp/build/requirements.txt (line 2))  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:339: SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  SNIMissingWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:137: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:137: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  Downloading https://files.pythonhosted.org/packages/63/c7/4f2a2a4ad6c6fa99b14be6b3c1cece9142e2d915aa7c43c908677afc8fa4/lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r /tmp/build/requirements.txt (line 3))  Downloading https://files.pythonhosted.org/packages/aa/e5/9ee1460d485b94a6d55732eb7ad5b6c084caf73dd6f9cb0bb7d2a78fafe8/cssselect-0.9.1.tar.gz  Collecting beautifulsoup4 (from -r /tmp/build/requirements.txt (line 4))  Downloading https://files.pythonhosted.org/packages/a6/29/bcbd41a916ad3faf517780a0af7d0254e8d6722ff6414723eedba4334531/beautifulsoup4-4.6.0-py2-none-any.whl (86kB)  Collecting dumptruck>=0.1.2 (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/65/47/7e02164a2a3db50ed6d8a6ab1d6d60b69c4c3fdf57a284257925dfc12bda/requests-2.19.1-py2.py3-none-any.whl (91kB)  Collecting chardet<3.1.0,>=3.0.2 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Collecting urllib3<1.24,>=1.21.1 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/bd/c9/6fdd990019071a4a32a5e7cb78a1d92c53851ef4f56f62a3486e6a7d8ffb/urllib3-1.23-py2.py3-none-any.whl (133kB)  Collecting certifi>=2017.4.17 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/7c/e6/92ad559b7192d846975fc916b65f667c7b8c3a32bea7372340bfe9a15fa5/certifi-2018.4.16-py2.py3-none-any.whl (150kB)  Collecting idna<2.8,>=2.5 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/4b/2a/0276479a4b3caeb8a8c1af2f8e4355746a97fab05a372e4a2c6a6b876165/idna-2.7-py2.py3-none-any.whl (58kB)  Installing collected packages: dumptruck, chardet, urllib3, certifi, idna, requests, scraperwiki, lxml, cssselect, beautifulsoup4  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed beautifulsoup4-4.6.0 certifi-2018.4.16 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.7 lxml-3.4.4 requests-2.19.1 scraperwiki urllib3-1.23   ! Hello! It looks like your application is using an outdated version of Python.  ! This caused the security warning you saw above during the 'pip install' step.  ! We recommend 'python-3.6.2', which you can specify in a 'runtime.txt' file.  ! -- Much Love, Heroku.   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running...

Data

Downloaded 484 times by SimKennedy MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (28 KB) Use the API

rows 10 / 44

d f l
2017-02-27 21:51:36.781562
NHTRT1FT_CAPNFT_gov_2016_04
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2016-2017/201604 NHS Cambs Peterborough CCG Expenditure over 25k April 2016.xlsx
2017-02-27 21:51:37.619057
NHTRT1FT_CAPNFT_gov_2016_05
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2016-2017/201605 NHS Cambs Peterborough CCG Expenditure over 25k May 2016.xlsx
2017-02-27 21:51:38.442988
NHTRT1FT_CAPNFT_gov_2016_06
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2016-2017/201606 NHS Cambs Peterborough CCG Expenditure over 25k June 2016.xlsx
2017-02-27 21:51:39.238586
NHTRT1FT_CAPNFT_gov_2016_07
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2016-2017/201607 NHS Cambs Peterborough CCG Expenditure over 25k July 2016.xlsx
2017-02-27 21:51:40.016710
NHTRT1FT_CAPNFT_gov_2016_08
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2016-2017/201608 NHS Cambs Peterborough CCG Expenditure over 25k August 2016.xlsx
2017-02-27 21:51:40.859427
NHTRT1FT_CAPNFT_gov_2016_09
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2016-2017/201609 NHS Cambs Peterborough CCG Expenditure over 25k September 2016.xlsx
2017-02-27 21:51:41.641336
NHTRT1FT_CAPNFT_gov_2016_10
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2016-2017/201610 NHS Cambs Peterborough CCG Expenditure over 25k October 2016.xlsx
2017-02-27 21:51:42.448326
NHTRT1FT_CAPNFT_gov_2016_11
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2016-2017/201611 NHS Cambs Peterborough CCG Expenditure over 25k November 2016.xlsx
2017-02-27 21:51:43.174482
NHTRT1FT_CAPNFT_gov_2015_04
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2015-2016/201504 NHS Cambs Peterborough CCG Expenditure over 25k April 2015.xls
2017-02-27 21:51:44.158255
NHTRT1FT_CAPNFT_gov_2015_05
http://www.cambridgeshireandpeterboroughccg.nhs.uk/downloads/CCG/Payments over 25k/2015-2016/201505 NHS Cambs Peterborough CCG Expenditure over 25k May 2015.xls

Statistics

Average successful run time: less than a minute

Total run time: about 11 hours

Total cpu time used: 8 minutes

Total disk space used: 73.9 KB

History

  • Auto ran revision 59da4b2c and completed successfully .
    nothing changed in the database
  • Auto ran revision 59da4b2c and completed successfully .
    nothing changed in the database
  • Auto ran revision 59da4b2c and completed successfully .
    nothing changed in the database
    2 pages scraped
  • Auto ran revision 59da4b2c and completed successfully .
    nothing changed in the database
    2 pages scraped
  • Auto ran revision 59da4b2c and completed successfully .
    nothing changed in the database
    2 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_NHTRT1FT_CAPNFT_gov / scraper.py