woodbine / sp_FTRAXX_KHNHSFT_gov

Scrapes www.kingstonhospital.nhs.uk

Kingston Hospital | Home


Contributors blablupcom woodbine

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.6, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.6 -----> Installing pip -----> Installing requirements with pip  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to morph_defaults) to /app/.heroku/src/scraperwiki  Collecting lxml==3.4.4 (from -r /tmp/build/requirements.txt (line 2))  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/requests/packages/urllib3/util/ssl_.py:318: SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/security.html#snimissingwarning.  SNIMissingWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/requests/packages/urllib3/util/ssl_.py:122: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/security.html#insecureplatformwarning.  InsecurePlatformWarning  Downloading lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r /tmp/build/requirements.txt (line 3))  Downloading cssselect-0.9.1.tar.gz  Collecting beautifulsoup4 (from -r /tmp/build/requirements.txt (line 4))  Downloading beautifulsoup4-4.6.0-py2-none-any.whl (86kB)  Collecting requests[security] (from -r /tmp/build/requirements.txt (line 5))  Downloading requests-2.18.4-py2.py3-none-any.whl (88kB)  Collecting dumptruck>=0.1.2 (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading dumptruck-0.1.6.tar.gz  Collecting chardet<3.1.0,>=3.0.2 (from requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading chardet-3.0.4-py2.py3-none-any.whl (133kB)  Collecting certifi>=2017.4.17 (from requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading certifi-2018.1.18-py2.py3-none-any.whl (151kB)  Collecting urllib3<1.23,>=1.21.1 (from requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading urllib3-1.22-py2.py3-none-any.whl (132kB)  Collecting idna<2.7,>=2.5 (from requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading idna-2.6-py2.py3-none-any.whl (56kB)  Collecting cryptography>=1.3.4; extra == "security" (from requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading cryptography-2.1.4-cp27-cp27m-manylinux1_x86_64.whl (2.2MB)  Collecting pyOpenSSL>=0.14; extra == "security" (from requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading pyOpenSSL-17.5.0-py2.py3-none-any.whl (53kB)  Collecting cffi>=1.7; platform_python_implementation != "PyPy" (from cryptography>=1.3.4; extra == "security"->requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading cffi-1.11.4-cp27-cp27m-manylinux1_x86_64.whl (407kB)  Collecting enum34; python_version < "3" (from cryptography>=1.3.4; extra == "security"->requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading enum34-1.1.6-py2-none-any.whl  Collecting six>=1.4.1 (from cryptography>=1.3.4; extra == "security"->requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading six-1.11.0-py2.py3-none-any.whl  Collecting ipaddress; python_version < "3" (from cryptography>=1.3.4; extra == "security"->requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading ipaddress-1.0.19.tar.gz  Collecting asn1crypto>=0.21.0 (from cryptography>=1.3.4; extra == "security"->requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading asn1crypto-0.24.0-py2.py3-none-any.whl (101kB)  Collecting pycparser (from cffi>=1.7; platform_python_implementation != "PyPy"->cryptography>=1.3.4; extra == "security"->requests[security]->-r /tmp/build/requirements.txt (line 5))  Downloading pycparser-2.18.tar.gz (245kB)  Installing collected packages: dumptruck, chardet, certifi, urllib3, idna, pycparser, cffi, enum34, six, ipaddress, asn1crypto, cryptography, pyOpenSSL, requests, scraperwiki, lxml, cssselect, beautifulsoup4  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py install for pycparser: started  Running setup.py install for pycparser: finished with status 'done'  Running setup.py install for ipaddress: started  Running setup.py install for ipaddress: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed asn1crypto-0.24.0 beautifulsoup4-4.6.0 certifi-2018.1.18 cffi-1.11.4 chardet-3.0.4 cryptography-2.1.4 cssselect-0.9.1 dumptruck-0.1.6 enum34-1.1.6 idna-2.6 ipaddress-1.0.19 lxml-3.4.4 pyOpenSSL-17.5.0 pycparser-2.18 requests-2.18.4 scraperwiki six-1.11.0 urllib3-1.22   ! Hello! It looks like your application is using an outdated version of Python.  ! This caused the security warning you saw above during the 'pip install' step.  ! We recommend 'python-3.6.2', which you can specify in a 'runtime.txt' file.  ! -- Much Love, Heroku.   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/276227/m08-november-2017.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/276223/m07-october-2017.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/276219/m06-september-2017.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/276215/m05-august-2017.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/264425/1718-over-25k.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/264425/1718-over-25k.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/264425/1718-over-25k.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/264425/1718-over-25k.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/253258/m12-march-2017.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/253254/m11-february-2017.xlsx FTRAXX_KHNHSFT_gov_ices_01 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/253250/m10-january-2017.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/242873/m08-november-2016.xlsx FTRAXX_KHNHSFT_gov_2013_12 FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/236566/m07-october-2016.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/233643/m06-september-2016.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/233639/m05-august-2016.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/249460/m04-july-2016.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/229634/m03-june-2016.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/244371/m02-may-2016.xlsx FTRAXX_KHNHSFT_gov_ices_201 *Error: Invalid filename* https://www.kingstonhospital.nhs.uk/media/244367/m01-april-2016.xlsx FTRAXX_KHNHSFT_gov_2016_03 FTRAXX_KHNHSFT_gov_2016_02 FTRAXX_KHNHSFT_gov_2016_01 FTRAXX_KHNHSFT_gov_2013_12 FTRAXX_KHNHSFT_gov_2015_11 FTRAXX_KHNHSFT_gov_2015_10 FTRAXX_KHNHSFT_gov_2015_09 FTRAXX_KHNHSFT_gov_2015_08 FTRAXX_KHNHSFT_gov_2015_07 FTRAXX_KHNHSFT_gov_2015_06 FTRAXX_KHNHSFT_gov_2015_05 FTRAXX_KHNHSFT_gov_2015_04 FTRAXX_KHNHSFT_gov_2013_12 FTRAXX_KHNHSFT_gov_2014_01 FTRAXX_KHNHSFT_gov_2014_04 FTRAXX_KHNHSFT_gov_2014_07 FTRAXX_KHNHSFT_gov_2014_08 Traceback (most recent call last): File "scraper.py", line 134, in <module> raise Exception("%d errors occurred during scrape." % errors) Exception: 19 errors occurred during scrape. FTRAXX_KHNHSFT_gov_2014_09

Data

Downloaded 503 times by SimKennedy MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (10 KB) Use the API

rows 10 / 20

f l d
FTRAXX_KHNHSFT_gov_2015_11
https://www.kingstonhospital.nhs.uk/media/208588/08-november-2015-invoices-over-£25k.xls
2017-01-24 12:48:29.015801
FTRAXX_KHNHSFT_gov_2013_12
2018-01-20 19:11:09.617937
FTRAXX_KHNHSFT_gov_2016_03
https://www.kingstonhospital.nhs.uk/media/220780/12-march-2016-invoices-over-£25k.xls
2018-01-20 19:11:25.515109
FTRAXX_KHNHSFT_gov_2016_02
https://www.kingstonhospital.nhs.uk/media/220767/11-february-2016-invoices-over-£25k.xls
2018-01-20 19:11:27.879732
FTRAXX_KHNHSFT_gov_2016_01
https://www.kingstonhospital.nhs.uk/media/208596/10-january-2015-invoices-over-£25k.xls
2018-01-20 19:11:29.573450
FTRAXX_KHNHSFT_gov_2013_12
https://www.kingstonhospital.nhs.uk/media/208592/09-december-2015-invoices-over-£25k.xls
2018-01-20 19:11:31.141700
FTRAXX_KHNHSFT_gov_2015_11
2018-01-20 19:11:32.752096
FTRAXX_KHNHSFT_gov_2015_10
https://www.kingstonhospital.nhs.uk/media/194900/07-october-2015-invoices-over-£25k.xls
2018-01-20 19:11:34.193055
FTRAXX_KHNHSFT_gov_2015_09
https://www.kingstonhospital.nhs.uk/media/192992/06-september-2015-invoices-over-£25k.xls
2018-01-20 19:11:35.766477
FTRAXX_KHNHSFT_gov_2015_08
https://www.kingstonhospital.nhs.uk/media/192988/05-august-2015-invoices-over-£25k.xls
2018-01-20 19:11:37.457209

Statistics

Average successful run time: 11 minutes

Total run time: about 1 month

Total cpu time used: 19 minutes

Total disk space used: 33.8 KB

History

  • Auto ran revision b4713770 and failed .
    19 records added, 19 records removed in the database
    39 pages scraped
  • Auto ran revision b4713770 and failed .
    19 records added, 19 records removed in the database
    39 pages scraped
  • Auto ran revision b4713770 and failed .
    nothing changed in the database
  • Auto ran revision b4713770 and failed .
    nothing changed in the database
  • Auto ran revision b4713770 and failed .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_FTRAXX_KHNHSFT_gov / scraper.py