blablupcom / sp_E5022_WCC_gov

Scrapes www.spotlightonspend.org.uk, www.westminster.gov.uk, and transact.westminster.gov.uk

Transparency spotlightonspend Who's In? Below you'll find a list of a few of the public bodies who have so far gone live with spotlightonspend. Click any link to be taken directly to


Contributors blablupcom

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected -----> Installing python-2.7.6  $ pip install -r requirements.txt  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to morph_defaults) to ./.heroku/src/scraperwiki  Collecting lxml==3.4.4 (from -r requirements.txt (line 2))  /app/.heroku/python/lib/python2.7/site-packages/pip-8.1.2-py2.7.egg/pip/_vendor/requests/packages/urllib3/util/ssl_.py:318: SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.org/en/latest/security.html#snimissingwarning.  SNIMissingWarning  /app/.heroku/python/lib/python2.7/site-packages/pip-8.1.2-py2.7.egg/pip/_vendor/requests/packages/urllib3/util/ssl_.py:122: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.org/en/latest/security.html#insecureplatformwarning.  InsecurePlatformWarning  Downloading lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r requirements.txt (line 3))  Downloading cssselect-0.9.1.tar.gz  Collecting beautifulsoup4 (from -r requirements.txt (line 4))  Downloading beautifulsoup4-4.5.1-py2-none-any.whl (83kB)  Collecting python-dateutil (from -r requirements.txt (line 5))  Downloading python_dateutil-2.5.3-py2.py3-none-any.whl (201kB)  Collecting dumptruck>=0.1.2 (from scraperwiki->-r requirements.txt (line 1))  Downloading dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r requirements.txt (line 1))  Downloading requests-2.11.1-py2.py3-none-any.whl (514kB)  Collecting six>=1.5 (from python-dateutil->-r requirements.txt (line 5))  Downloading six-1.10.0-py2.py3-none-any.whl  Installing collected packages: dumptruck, requests, scraperwiki, lxml, cssselect, beautifulsoup4, six, python-dateutil  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed beautifulsoup4-4.5.1 cssselect-0.9.1 dumptruck-0.1.6 lxml-3.4.4 python-dateutil-2.5.3 requests-2.11.1 scraperwiki six-1.10.0   ! Hello! It looks like your application is using an outdated version of Python.  ! This caused the security warning you saw above during the 'pip install' step.  ! We recommend 'python-2.7.12', which you can specify in a 'runtime.txt' file.  ! -- Much Love, Heroku.   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... E5022_WCC_gov_2011_04 E5022_WCC_gov_2011_05 E5022_WCC_gov_2011_06 E5022_WCC_gov_2011_07 E5022_WCC_gov_2011_08 E5022_WCC_gov_2011_09 E5022_WCC_gov_2011_10 E5022_WCC_gov_2011_11 E5022_WCC_gov_2011_12 E5022_WCC_gov_2012_01 E5022_WCC_gov_2012_02 E5022_WCC_gov_2012_03 E5022_WCC_gov_2012_04 E5022_WCC_gov_2012_05 E5022_WCC_gov_2012_06 E5022_WCC_gov_2012_07 E5022_WCC_gov_2012_08 E5022_WCC_gov_2012_09 E5022_WCC_gov_2012_10 E5022_WCC_gov_2012_11 E5022_WCC_gov_2012_12 E5022_WCC_gov_2013_01 E5022_WCC_gov_2013_02 E5022_WCC_gov_2013_03 E5022_WCC_gov_2013_04 E5022_WCC_gov_2013_05 E5022_WCC_gov_2013_06 E5022_WCC_gov_2013_07 E5022_WCC_gov_2013_08 E5022_WCC_gov_2013_09 E5022_WCC_gov_2013_10 E5022_WCC_gov_2013_11 E5022_WCC_gov_2013_12 E5022_WCC_gov_2014_01 E5022_WCC_gov_2014_02 E5022_WCC_gov_2014_03 E5022_WCC_gov_2016_Q1 E5022_WCC_gov_2015_Q1 E5022_WCC_gov_2015_Q2 E5022_WCC_gov_2015_Q3 E5022_WCC_gov_2015_Q4 E5022_WCC_gov_2014_Q2 E5022_WCC_gov_2014_Q3 E5022_WCC_gov_2014_Q4

Data

Downloaded 5 times by MikeRalphson woodbine

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (13 KB) Use the API

rows 10 / 44

d l f
2016-11-04 12:03:02.393934
E5022_WCC_gov_2011_04
2016-11-04 12:03:03.507374
E5022_WCC_gov_2011_05
2016-11-04 12:03:04.530300
E5022_WCC_gov_2011_06
2016-11-04 12:03:05.461420
E5022_WCC_gov_2011_07
2016-11-04 12:03:06.485704
E5022_WCC_gov_2011_08
2016-11-04 12:03:07.547293
E5022_WCC_gov_2011_09
2016-11-04 12:03:08.494066
E5022_WCC_gov_2011_10
2016-11-04 12:03:09.473515
E5022_WCC_gov_2011_11
2016-11-04 12:03:10.472578
E5022_WCC_gov_2011_12
2016-11-04 12:03:11.468743
E5022_WCC_gov_2012_01

Statistics

Average successful run time: 2 minutes

Total run time: 10 minutes

Total cpu time used: less than 5 seconds

Total disk space used: 70.8 KB

History

  • Manually ran revision 1fabf168 and completed successfully .
    44 records added in the database
    46 pages scraped
  • Manually ran revision 1fabf168 and completed successfully .
    44 records added, 36 records removed in the database
    46 pages scraped
  • Manually ran revision 66dd729b and completed successfully .
    36 records added, 24 records removed in the database
    37 pages scraped
  • Manually ran revision f52cd32d and completed successfully .
    36 records added in the database
    37 pages scraped
  • Manually ran revision f52cd32d and completed successfully .
    36 records added in the database
    37 pages scraped
  • Created on morph.io

Scraper code

Python

sp_E5022_WCC_gov / scraper.py