blablupcom / E4505_SCMBC_gov

Scrapes www.sunderland.gov.uk

Sunderland City Council welcomes you


Contributors blablupcom

Last run completed successfully .

Console output of last run

Injecting configuration and compiling... [1G-----> Python app detected [1G-----> Stack changed, re-installing runtime [1G-----> Installing runtime (python-2.7.6) [1G-----> Installing dependencies with pip [1G Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r requirements.txt (line 1)) [1G Cloning http://github.com/openaustralia/scraperwiki-python.git (to morph_defaults) to ./.heroku/src/scraperwiki [1G Collecting lxml==3.4.4 (from -r requirements.txt (line 2)) [1G Downloading lxml-3.4.4.tar.gz (3.5MB) [1G Building lxml version 3.4.4. [1G Building without Cython. [1G Using build configuration of libxslt 1.1.28 [1G /app/.heroku/python/lib/python2.7/distutils/dist.py:267: UserWarning: Unknown distribution option: 'bugtrack_url' [1G warnings.warn(msg) [1G Collecting cssselect==0.9.1 (from -r requirements.txt (line 3)) [1G Downloading cssselect-0.9.1.tar.gz [1G Collecting beautifulsoup4 (from -r requirements.txt (line 4)) [1G Downloading beautifulsoup4-4.4.0-py2-none-any.whl (81kB) [1G Collecting python-dateutil (from -r requirements.txt (line 5)) [1G Downloading python_dateutil-2.4.2-py2.py3-none-any.whl (188kB) [1G Collecting dumptruck>=0.1.2 (from scraperwiki->-r requirements.txt (line 1)) [1G Downloading dumptruck-0.1.6.tar.gz [1G Collecting requests (from scraperwiki->-r requirements.txt (line 1)) [1G Downloading requests-2.7.0-py2.py3-none-any.whl (470kB) [1G Collecting six>=1.5 (from python-dateutil->-r requirements.txt (line 5)) [1G Downloading six-1.9.0-py2.py3-none-any.whl [1G Installing collected packages: six, requests, dumptruck, python-dateutil, beautifulsoup4, cssselect, lxml, scraperwiki [1G [1G [1G Running setup.py install for dumptruck [1G [1G [1G Running setup.py install for cssselect [1G Running setup.py install for lxml [1G Building lxml version 3.4.4. [1G Building without Cython. [1G Using build configuration of libxslt 1.1.28 [1G /app/.heroku/python/lib/python2.7/distutils/dist.py:267: UserWarning: Unknown distribution option: 'bugtrack_url' [1G warnings.warn(msg) [1G building 'lxml.etree' extension [1G gcc -pthread -fno-strict-aliasing -g -O2 -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/usr/include/libxml2 -I/tmp/pip-build-XSofYx/lxml/src/lxml/includes -I/app/.heroku/python/include/python2.7 -c src/lxml/lxml.etree.c -o build/temp.linux-x86_64-2.7/src/lxml/lxml.etree.o -w [1G gcc -pthread -shared build/temp.linux-x86_64-2.7/src/lxml/lxml.etree.o -L/app/.heroku/python/lib -lxslt -lexslt -lxml2 -lz -lm -lpython2.7 -o build/lib.linux-x86_64-2.7/lxml/etree.so [1G building 'lxml.objectify' extension [1G gcc -pthread -fno-strict-aliasing -g -O2 -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/usr/include/libxml2 -I/tmp/pip-build-XSofYx/lxml/src/lxml/includes -I/app/.heroku/python/include/python2.7 -c src/lxml/lxml.objectify.c -o build/temp.linux-x86_64-2.7/src/lxml/lxml.objectify.o -w [1G gcc -pthread -shared build/temp.linux-x86_64-2.7/src/lxml/lxml.objectify.o -L/app/.heroku/python/lib -lxslt -lexslt -lxml2 -lz -lm -lpython2.7 -o build/lib.linux-x86_64-2.7/lxml/objectify.so [1G Running setup.py develop for scraperwiki [1G Creating /app/.heroku/python/lib/python2.7/site-packages/scraperwiki.egg-link (link to .) [1G Adding scraperwiki 0.3.7 to easy-install.pth file [1G Installed /app/.heroku/src/scraperwiki [1G Successfully installed beautifulsoup4-4.4.0 cssselect-0.9.1 dumptruck-0.1.6 lxml-3.4.4 python-dateutil-2.4.2 requests-2.7.0 scraperwiki six-1.9.0 [1G [1G-----> Discovering process types [1G Procfile declares types -> scraper Injecting scraper and running... /app/.heroku/python/lib/python2.7/site-packages/bs4/__init__.py:166: UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("lxml"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently. To get rid of this warning, change this: BeautifulSoup([your markup]) to this: BeautifulSoup([your markup], "lxml") markup_type=markup_type)) E4505_SCMBC_gov_2015_03 E4505_SCMBC_gov_2014_12 E4505_SCMBC_gov_2014_09 E4505_SCMBC_gov_2014_06 E4505_SCMBC_gov_2014_03 E4505_SCMBC_gov_2013_12 E4505_SCMBC_gov_2013_09 E4505_SCMBC_gov_2013_06 E4505_SCMBC_gov_2013_03 E4505_SCMBC_gov_2012_12 E4505_SCMBC_gov_2012_09 E4505_SCMBC_gov_2012_06 E4505_SCMBC_gov_2012_03 E4505_SCMBC_gov_2011_12 E4505_SCMBC_gov_2011_09 E4505_SCMBC_gov_2011_06 E4505_SCMBC_gov_2011_03 E4505_SCMBC_gov_2010_12

Data

Downloaded 0 times

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (10 KB) Use the API

rows 10 / 18

l f d
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=16766&p=0&fsize=1Mb&ftype=Expenditure over £500 Q4 Jan to Mar 2015 (CSV).CSV
E4505_SCMBC_gov_2015_03
2015-07-16 00:07:54.388791
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=16683&p=0&fsize=1Mb&ftype=Expenditure over £500 Q3 Oct to Dec 2014 (CSV).CSV
E4505_SCMBC_gov_2014_12
2015-07-16 00:07:58.239053
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=16682&p=0&fsize=1Mb&ftype=Expenditure over £500 Q2 Jul to Sep 2014 (CSV).CSV
E4505_SCMBC_gov_2014_09
2015-07-16 00:08:01.944820
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=16681&p=0&fsize=1Mb&ftype=Expenditure over £500 Q1 Apr to Jun 2014 (CSV).CSV
E4505_SCMBC_gov_2014_06
2015-07-16 00:08:05.557675
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=15045&p=0&fsize=1Mb&ftype=Expenditure over £500 Q4 Jan to Mar 2014 (CSV).CSV
E4505_SCMBC_gov_2014_03
2015-07-16 00:08:09.032860
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=14726&p=0&fsize=1Mb&ftype=Expenditure over £500 Q3 Oct to Dec 2013 (CSV).CSV
E4505_SCMBC_gov_2013_12
2015-07-16 00:08:12.684052
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=14455&p=0&fsize=1Mb&ftype=Expenditure over £500 Q2 Jul to Sep 2013 (CSV).CSV
E4505_SCMBC_gov_2013_09
2015-07-16 00:08:16.184875
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=14081&p=0&fsize=1Mb&ftype=Expenditure over £500 Q1 Apr to Jun 2013 (CSV).CSV
E4505_SCMBC_gov_2013_06
2015-07-16 00:08:19.813138
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=13766&p=0&fsize=1Mb&ftype=Expenditure over £500 Q4 Jan to Mar 2013 (CSV).CSV
E4505_SCMBC_gov_2013_03
2015-07-16 00:08:23.658228
http://www.sunderland.gov.uk/CHttpHandler.ashx?id=13398&p=0&fsize=1Mb&ftype=Expenditure over £500 Q3 Oct to Dec 2012 (CSV).CSV
E4505_SCMBC_gov_2012_12
2015-07-16 00:08:26.985118

Statistics

Average successful run time: 4 minutes

Total run time: 4 minutes

Total cpu time used: less than 5 seconds

Total disk space used: 37.9 KB

History

  • Manually ran revision bd4de036 and completed successfully .
    18 records added in the database
    19 pages scraped
  • Created on morph.io

Scraper code

E4505_SCMBC_gov