This is a scraper that runs on Morph. To get started see the documentation

Contributors woodbine blablupcom

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.9, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.9 -----> Installing pip -----> Installing requirements with pip  DEPRECATION: Python 2.7 reached the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 is no longer maintained. pip 21.0 will drop support for Python 2.7 in January 2021. More details about Python 2 support in pip can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Collecting lxml==3.4.4  Downloading lxml-3.4.4.tar.gz (3.5 MB)  Collecting cssselect==0.9.1  Downloading cssselect-0.9.1.tar.gz (32 kB)  Collecting beautifulsoup4  Downloading beautifulsoup4-4.9.1-py2-none-any.whl (111 kB)  Collecting dumptruck>=0.1.2  Downloading dumptruck-0.1.6.tar.gz (15 kB)  Collecting requests  Downloading requests-2.24.0-py2.py3-none-any.whl (61 kB)  Collecting soupsieve<2.0  Downloading soupsieve-1.9.6-py2.py3-none-any.whl (33 kB)  Collecting urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1  Downloading urllib3-1.25.10-py2.py3-none-any.whl (127 kB)  Collecting certifi>=2017.4.17  Downloading certifi-2020.6.20-py2.py3-none-any.whl (156 kB)  Collecting chardet<4,>=3.0.2  Downloading chardet-3.0.4-py2.py3-none-any.whl (133 kB)  Collecting idna<3,>=2.5  Downloading idna-2.10-py2.py3-none-any.whl (58 kB)  Collecting backports.functools-lru-cache; python_version < "3"  Downloading backports.functools_lru_cache-1.6.1-py2.py3-none-any.whl (5.7 kB)  Building wheels for collected packages: lxml, cssselect, dumptruck  Building wheel for lxml (setup.py): started  Building wheel for lxml (setup.py): still running...  Building wheel for lxml (setup.py): finished with status 'done'  Created wheel for lxml: filename=lxml-3.4.4-cp27-cp27m-linux_x86_64.whl size=2989850 sha256=826c14191d9c77ab27511dcab5e592e8a69a2e27258e6a9363d9b9681d512a24  Stored in directory: /tmp/pip-ephem-wheel-cache-rFQoF7/wheels/d6/de/81/11ae6edd05c75aac677e67dd154c85da758ba6f3e8e80e962e  Building wheel for cssselect (setup.py): started  Building wheel for cssselect (setup.py): finished with status 'done'  Created wheel for cssselect: filename=cssselect-0.9.1-py2-none-any.whl size=26993 sha256=46fa408a3f5538e5646bffb9fb3d18f80b7d6995d533ec2050c8fc8285fdaec0  Stored in directory: /tmp/pip-ephem-wheel-cache-rFQoF7/wheels/85/fe/00/b94036d8583cec9791d8cda24c184f2d2ac1397822f7f0e8d4  Building wheel for dumptruck (setup.py): started  Building wheel for dumptruck (setup.py): finished with status 'done'  Created wheel for dumptruck: filename=dumptruck-0.1.6-py2-none-any.whl size=11842 sha256=ce0fb71602a3ecdbc8fd2fceb020bf89514ec6d8d2cacdbb44e12fb9e0efa2fa  Stored in directory: /tmp/pip-ephem-wheel-cache-rFQoF7/wheels/dc/75/e9/1e61c4080c73e7bda99614549591f83b53bcc2d682f26fce62  Successfully built lxml cssselect dumptruck  Installing collected packages: dumptruck, urllib3, certifi, chardet, idna, requests, scraperwiki, lxml, cssselect, backports.functools-lru-cache, soupsieve, beautifulsoup4  Running setup.py develop for scraperwiki  Successfully installed backports.functools-lru-cache-1.6.1 beautifulsoup4-4.9.1 certifi-2020.6.20 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.10 lxml-3.4.4 requests-2.24.0 scraperwiki soupsieve-1.9.6 urllib3-1.25.10 DEPRECATION: Python 2.7 reached the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 is no longer maintained. pip 21.0 will drop support for Python 2.7 in January 2021. More details about Python 2 support in pip can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support    -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... E3720_WCC_gov_2018_07 E3720_WCC_gov_2011_04 E3720_WCC_gov_2012_04 E3720_WCC_gov_2013_04 E3720_WCC_gov_2014_04 E3720_WCC_gov_2015_04 E3720_WCC_gov_2016_04 E3720_WCC_gov_2017_04 E3720_WCC_gov_2018_04 E3720_WCC_gov_2019_04 E3720_WCC_gov_2011_08 E3720_WCC_gov_2012_08 E3720_WCC_gov_2013_08 E3720_WCC_gov_2014_08 E3720_WCC_gov_2015_08 E3720_WCC_gov_2016_08 E3720_WCC_gov_2017_08 E3720_WCC_gov_2018_08 E3720_WCC_gov_2019_08 E3720_WCC_gov_2010_12 E3720_WCC_gov_2011_12 E3720_WCC_gov_2011_12 E3720_WCC_gov_2012_12 E3720_WCC_gov_2013_12 E3720_WCC_gov_2014_12 E3720_WCC_gov_2015_12 E3720_WCC_gov_2016_12 E3720_WCC_gov_2017_12 E3720_WCC_gov_2018_12 E3720_WCC_gov_2019_12 E3720_WCC_gov_2011_02 E3720_WCC_gov_2012_02 E3720_WCC_gov_2012_02 E3720_WCC_gov_2013_02 E3720_WCC_gov_2014_02 E3720_WCC_gov_2015_02 E3720_WCC_gov_2016_02 E3720_WCC_gov_2017_02 E3720_WCC_gov_2018_02 E3720_WCC_gov_2019_02 E3720_WCC_gov_2020_02 E3720_WCC_gov_2011_01 E3720_WCC_gov_2012_01 E3720_WCC_gov_2012_01 E3720_WCC_gov_2013_01 E3720_WCC_gov_2014_01 E3720_WCC_gov_2015_01 E3720_WCC_gov_2016_01 E3720_WCC_gov_2017_01 E3720_WCC_gov_2018_01 E3720_WCC_gov_2019_01 E3720_WCC_gov_2011_07 E3720_WCC_gov_2012_07 E3720_WCC_gov_2013_07 E3720_WCC_gov_2014_07 E3720_WCC_gov_2014_07 E3720_WCC_gov_2015_07 E3720_WCC_gov_2016_07 E3720_WCC_gov_2017_07 E3720_WCC_gov_2019_07 E3720_WCC_gov_2011_06 E3720_WCC_gov_2012_06 E3720_WCC_gov_2013_06 E3720_WCC_gov_2014_06 E3720_WCC_gov_2015_06 E3720_WCC_gov_2016_06 E3720_WCC_gov_2017_06 E3720_WCC_gov_2018_06 E3720_WCC_gov_2019_06 E3720_WCC_gov_2020_06 E3720_WCC_gov_2011_03 E3720_WCC_gov_2012_03 E3720_WCC_gov_2012_03 E3720_WCC_gov_2013_03 E3720_WCC_gov_2014_03 E3720_WCC_gov_2015_03 E3720_WCC_gov_2016_03 E3720_WCC_gov_2017_03 E3720_WCC_gov_2018_03 E3720_WCC_gov_2019_03 E3720_WCC_gov_2020_03 E3720_WCC_gov_2011_05 E3720_WCC_gov_2012_05 E3720_WCC_gov_2013_05 E3720_WCC_gov_2014_05 E3720_WCC_gov_2015_05 E3720_WCC_gov_2016_05 E3720_WCC_gov_2017_05 E3720_WCC_gov_2018_05 E3720_WCC_gov_2019_05 E3720_WCC_gov_2020_05 E3720_WCC_gov_2010_11 E3720_WCC_gov_2011_11 E3720_WCC_gov_2011_11 E3720_WCC_gov_2012_11 E3720_WCC_gov_2013_11 E3720_WCC_gov_2014_11 E3720_WCC_gov_2015_11

Data

Downloaded 833 times by SimKennedy woodbine MikeRalphson blablupcom

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (66 KB) Use the API

rows 10 / 218

d f l
2018-02-08 23:31:53.251748
E3720_WCC_gov_2015_02
2018-02-08 23:32:27.337050
E3720_WCC_gov_2014_07
https://s3-eu-west-1.amazonaws.com/opendata/supplierpayments/all-july-2014.csv
2018-11-26 02:56:21.216661
E3720_WCC_gov_2016_09
2018-11-26 02:56:24.384195
E3720_WCC_gov_2017_09
2019-02-04 10:11:44.285298
E3720_WCC_gov_2018_07
2019-02-04 10:11:46.296672
E3720_WCC_gov_2011_04
2019-02-04 10:11:48.467975
E3720_WCC_gov_2012_04
2019-02-04 10:11:50.421670
E3720_WCC_gov_2013_04
2019-02-04 10:11:52.671064
E3720_WCC_gov_2014_04
2019-02-04 10:11:54.774276
E3720_WCC_gov_2015_04

Statistics

Average successful run time: 9 minutes

Total run time: about 1 month

Total cpu time used: about 10 hours

Total disk space used: 104 KB

History

  • Auto ran revision 45d3ebf9 and completed successfully .
    97 records added, 97 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    97 records added, 96 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    97 records added, 97 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    97 records added, 97 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    97 records added, 97 records removed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_E3720_WCC_gov / scraper.py