woodbine / sp_FTRJ1X_GSTNHSFT_gov

Scrapes www.guysandstthomas.nhs.uk

Homepage for Guy's and St Thomas' NHS Foundation Trust


Contributors blablupcom

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.6, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.6 -----> Installing pip -----> Installing requirements with pip  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Collecting lxml==3.4.4 (from -r /tmp/build/requirements.txt (line 2))  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:339: SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  SNIMissingWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:137: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:137: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  Downloading https://files.pythonhosted.org/packages/63/c7/4f2a2a4ad6c6fa99b14be6b3c1cece9142e2d915aa7c43c908677afc8fa4/lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r /tmp/build/requirements.txt (line 3))  Downloading https://files.pythonhosted.org/packages/aa/e5/9ee1460d485b94a6d55732eb7ad5b6c084caf73dd6f9cb0bb7d2a78fafe8/cssselect-0.9.1.tar.gz  Collecting beautifulsoup4 (from -r /tmp/build/requirements.txt (line 4))  Downloading https://files.pythonhosted.org/packages/a6/29/bcbd41a916ad3faf517780a0af7d0254e8d6722ff6414723eedba4334531/beautifulsoup4-4.6.0-py2-none-any.whl (86kB)  Collecting dumptruck>=0.1.2 (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/49/df/50aa1999ab9bde74656c2919d9c0c085fd2b3775fd3eca826012bef76d8c/requests-2.18.4-py2.py3-none-any.whl (88kB)  Collecting chardet<3.1.0,>=3.0.2 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Collecting certifi>=2017.4.17 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/7c/e6/92ad559b7192d846975fc916b65f667c7b8c3a32bea7372340bfe9a15fa5/certifi-2018.4.16-py2.py3-none-any.whl (150kB)  Collecting urllib3<1.23,>=1.21.1 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/63/cb/6965947c13a94236f6d4b8223e21beb4d576dc72e8130bd7880f600839b8/urllib3-1.22-py2.py3-none-any.whl (132kB)  Collecting idna<2.7,>=2.5 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/27/cc/6dd9a3869f15c2edfab863b992838277279ce92663d334df9ecf5106f5c6/idna-2.6-py2.py3-none-any.whl (56kB)  Installing collected packages: dumptruck, chardet, certifi, urllib3, idna, requests, scraperwiki, lxml, cssselect, beautifulsoup4  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed beautifulsoup4-4.6.0 certifi-2018.4.16 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.6 lxml-3.4.4 requests-2.18.4 scraperwiki urllib3-1.22   ! Hello! It looks like your application is using an outdated version of Python.  ! This caused the security warning you saw above during the 'pip install' step.  ! We recommend 'python-3.6.2', which you can specify in a 'runtime.txt' file.  ! -- Much Love, Heroku.   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... Error validating URL. FTRJ1X_GSTNHSFT_gov_2018_02 *Error: Invalid URL* /resources/about-us/foi/expenditure/2018/gstt-spend-february-2018.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2018_01 *Error: Invalid URL* /resources/about-us/foi/expenditure/2018/gstt-spend-january-2018.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_12 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-December-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_11 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-november-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_10 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-october-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_09 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-september-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_08 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-august-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_07 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-July-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_06 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-June-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_05 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-may-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_04 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-april-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_03 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-march-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_02 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-February-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2017_01 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-January-2017.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2016_12 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-december-2016.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2016_11 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-november-2016.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2016_10 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-october-2016.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2016_09 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-september-2016.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2016_08 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-August-2016.csv Error validating URL. FTRJ1X_GSTNHSFT_gov_2016_07 *Error: Invalid URL* /resources/about-us/foi/expenditure/gstt-spend-July-2016.csv FTRJ1X_GSTNHSFT_gov_2016_06 FTRJ1X_GSTNHSFT_gov_2016_05 FTRJ1X_GSTNHSFT_gov_2016_04 FTRJ1X_GSTNHSFT_gov_2016_03 FTRJ1X_GSTNHSFT_gov_2016_02 FTRJ1X_GSTNHSFT_gov_2016_01 FTRJ1X_GSTNHSFT_gov_2015_12 FTRJ1X_GSTNHSFT_gov_2015_11 FTRJ1X_GSTNHSFT_gov_2015_10 FTRJ1X_GSTNHSFT_gov_2015_09 FTRJ1X_GSTNHSFT_gov_2015_08 FTRJ1X_GSTNHSFT_gov_2015_07 FTRJ1X_GSTNHSFT_gov_2015_06 FTRJ1X_GSTNHSFT_gov_2015_05 FTRJ1X_GSTNHSFT_gov_2015_04 FTRJ1X_GSTNHSFT_gov_2015_03 FTRJ1X_GSTNHSFT_gov_2015_02 FTRJ1X_GSTNHSFT_gov_2015_01 FTRJ1X_GSTNHSFT_gov_2014_12 FTRJ1X_GSTNHSFT_gov_2014_11 FTRJ1X_GSTNHSFT_gov_2014_10 FTRJ1X_GSTNHSFT_gov_2014_09 FTRJ1X_GSTNHSFT_gov_2014_08 FTRJ1X_GSTNHSFT_gov_2014_07 FTRJ1X_GSTNHSFT_gov_2014_06 FTRJ1X_GSTNHSFT_gov_2014_05 FTRJ1X_GSTNHSFT_gov_2014_04 FTRJ1X_GSTNHSFT_gov_2014_03 FTRJ1X_GSTNHSFT_gov_2014_02 FTRJ1X_GSTNHSFT_gov_2014_01 FTRJ1X_GSTNHSFT_gov_2013_12 FTRJ1X_GSTNHSFT_gov_2013_11 FTRJ1X_GSTNHSFT_gov_2013_10 FTRJ1X_GSTNHSFT_gov_2013_09 FTRJ1X_GSTNHSFT_gov_2013_08 FTRJ1X_GSTNHSFT_gov_2013_07 FTRJ1X_GSTNHSFT_gov_2013_06 FTRJ1X_GSTNHSFT_gov_2013_05 FTRJ1X_GSTNHSFT_gov_2013_04 FTRJ1X_GSTNHSFT_gov_2013_03 FTRJ1X_GSTNHSFT_gov_2013_02 FTRJ1X_GSTNHSFT_gov_2013_01 FTRJ1X_GSTNHSFT_gov_2012_12 FTRJ1X_GSTNHSFT_gov_2012_11 FTRJ1X_GSTNHSFT_gov_2012_10 FTRJ1X_GSTNHSFT_gov_2012_09 FTRJ1X_GSTNHSFT_gov_2012_08 FTRJ1X_GSTNHSFT_gov_2012_07 FTRJ1X_GSTNHSFT_gov_2012_06 FTRJ1X_GSTNHSFT_gov_2012_05 FTRJ1X_GSTNHSFT_gov_2012_04 FTRJ1X_GSTNHSFT_gov_2012_03 FTRJ1X_GSTNHSFT_gov_2012_02 FTRJ1X_GSTNHSFT_gov_2012_01 FTRJ1X_GSTNHSFT_gov_2011_12 FTRJ1X_GSTNHSFT_gov_2011_11 FTRJ1X_GSTNHSFT_gov_2011_10 FTRJ1X_GSTNHSFT_gov_2011_09 FTRJ1X_GSTNHSFT_gov_2011_08 FTRJ1X_GSTNHSFT_gov_2011_07 FTRJ1X_GSTNHSFT_gov_2011_06 FTRJ1X_GSTNHSFT_gov_2011_05 FTRJ1X_GSTNHSFT_gov_2011_04 FTRJ1X_GSTNHSFT_gov_2011_03 FTRJ1X_GSTNHSFT_gov_2011_02 FTRJ1X_GSTNHSFT_gov_2011_01 FTRJ1X_GSTNHSFT_gov_2010_12 FTRJ1X_GSTNHSFT_gov_2010_11 FTRJ1X_GSTNHSFT_gov_2010_10 FTRJ1X_GSTNHSFT_gov_2010_09 FTRJ1X_GSTNHSFT_gov_2010_08 FTRJ1X_GSTNHSFT_gov_2010_07 FTRJ1X_GSTNHSFT_gov_2010_06 FTRJ1X_GSTNHSFT_gov_2010_05 FTRJ1X_GSTNHSFT_gov_2010_04 Traceback (most recent call last): File "scraper.py", line 133, in <module> raise Exception("%d errors occurred during scrape." % errors) Exception: 20 errors occurred during scrape.

Data

Downloaded 505 times by SimKennedy MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (53 KB) Use the API

rows 10 / 150

d l f
2018-01-23 07:36:38.346315
FTRJ1X_GSTNHSFT_gov_2011_01
2018-02-07 00:45:25.052611
FTRJ1X_GSTNHSFT_gov_2010_12
2018-02-07 00:45:34.549327
FTRJ1X_GSTNHSFT_gov_2010_08
2018-02-07 00:45:37.014290
FTRJ1X_GSTNHSFT_gov_2010_07
2018-02-07 00:45:44.087411
FTRJ1X_GSTNHSFT_gov_2010_04
2018-02-10 12:00:18.359263
FTRJ1X_GSTNHSFT_gov_2016_06
2018-02-10 12:00:20.965270
FTRJ1X_GSTNHSFT_gov_2016_05
2018-02-10 12:00:23.345827
FTRJ1X_GSTNHSFT_gov_2016_04
2018-02-10 12:00:25.942222
FTRJ1X_GSTNHSFT_gov_2016_03
2018-02-10 12:00:28.419394
FTRJ1X_GSTNHSFT_gov_2016_02

Statistics

Average successful run time: 20 minutes

Total run time: 6 days

Total cpu time used: 22 minutes

Total disk space used: 79.2 KB

History

  • Auto ran revision 276b1ef3 and failed .
    75 records added, 75 records removed in the database
    77 pages scraped
  • Auto ran revision 276b1ef3 and failed .
    75 records added, 75 records removed in the database
    77 pages scraped
  • Auto ran revision 276b1ef3 and failed .
    75 records added, 75 records removed in the database
    77 pages scraped
  • Auto ran revision 276b1ef3 and failed .
    75 records added, 75 records removed in the database
    77 pages scraped
  • Auto ran revision 276b1ef3 and failed .
    75 records added, 75 records removed in the database
    77 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_FTRJ1X_GSTNHSFT_gov / scraper.py