This is a scraper that runs on Morph. To get started see the documentation

Contributors woodbine blablupcom

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.9, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.9 -----> Installing pip -----> Installing requirements with pip  DEPRECATION: Python 2.7 reached the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 is no longer maintained. A future version of pip will drop support for Python 2.7. More details about Python 2 support in pip, can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Running command git clone -q http://github.com/openaustralia/scraperwiki-python.git /app/.heroku/src/scraperwiki  Running command git checkout -b morph_defaults --track origin/morph_defaults  Switched to a new branch 'morph_defaults'  Branch morph_defaults set up to track remote branch morph_defaults from origin.  Collecting lxml==3.4.4  Downloading lxml-3.4.4.tar.gz (3.5 MB)  Collecting cssselect==0.9.1  Downloading cssselect-0.9.1.tar.gz (32 kB)  Collecting beautifulsoup4  Downloading beautifulsoup4-4.8.2-py2-none-any.whl (106 kB)  Collecting dumptruck>=0.1.2  Downloading dumptruck-0.1.6.tar.gz (15 kB)  Collecting requests  Downloading requests-2.22.0-py2.py3-none-any.whl (57 kB)  Collecting soupsieve>=1.2  Downloading soupsieve-1.9.5-py2.py3-none-any.whl (33 kB)  Collecting certifi>=2017.4.17  Downloading certifi-2019.11.28-py2.py3-none-any.whl (156 kB)  Collecting urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1  Downloading urllib3-1.25.8-py2.py3-none-any.whl (125 kB)  Collecting chardet<3.1.0,>=3.0.2  Downloading chardet-3.0.4-py2.py3-none-any.whl (133 kB)  Collecting idna<2.9,>=2.5  Downloading idna-2.8-py2.py3-none-any.whl (58 kB)  Collecting backports.functools-lru-cache; python_version < "3"  Downloading backports.functools_lru_cache-1.6.1-py2.py3-none-any.whl (5.7 kB)  Building wheels for collected packages: lxml, cssselect, dumptruck  Building wheel for lxml (setup.py): started  Building wheel for lxml (setup.py): still running...  Building wheel for lxml (setup.py): finished with status 'done'  Created wheel for lxml: filename=lxml-3.4.4-cp27-cp27m-linux_x86_64.whl size=2989835 sha256=f3157015c5cdbf29ba003b14f0d9f298f087eb9c8e84aed59d6b046bf785a958  Stored in directory: /tmp/pip-ephem-wheel-cache-fOqd8T/wheels/d6/de/81/11ae6edd05c75aac677e67dd154c85da758ba6f3e8e80e962e  Building wheel for cssselect (setup.py): started  Building wheel for cssselect (setup.py): finished with status 'done'  Created wheel for cssselect: filename=cssselect-0.9.1-py2-none-any.whl size=26994 sha256=efdb36937e5857f9e35f642466fbd857469ec700a9d0f74c371e1655d95f4506  Stored in directory: /tmp/pip-ephem-wheel-cache-fOqd8T/wheels/85/fe/00/b94036d8583cec9791d8cda24c184f2d2ac1397822f7f0e8d4  Building wheel for dumptruck (setup.py): started  Building wheel for dumptruck (setup.py): finished with status 'done'  Created wheel for dumptruck: filename=dumptruck-0.1.6-py2-none-any.whl size=11842 sha256=ed86811a93e3807eac36337174d6599d51f77031cba64b9149751ff0da030c02  Stored in directory: /tmp/pip-ephem-wheel-cache-fOqd8T/wheels/dc/75/e9/1e61c4080c73e7bda99614549591f83b53bcc2d682f26fce62  Successfully built lxml cssselect dumptruck  Installing collected packages: dumptruck, certifi, urllib3, chardet, idna, requests, scraperwiki, lxml, cssselect, backports.functools-lru-cache, soupsieve, beautifulsoup4  Running setup.py develop for scraperwiki  Successfully installed backports.functools-lru-cache-1.6.1 beautifulsoup4-4.8.2 certifi-2019.11.28 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.8 lxml-3.4.4 requests-2.22.0 scraperwiki soupsieve-1.9.5 urllib3-1.25.8 DEPRECATION: Python 2.7 reached the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 is no longer maintained. A future version of pip will drop support for Python 2.7. More details about Python 2 support in pip, can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support    -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... E3720_WCC_gov_2018_07 E3720_WCC_gov_2011_04 E3720_WCC_gov_2012_04 E3720_WCC_gov_2013_04 E3720_WCC_gov_2014_04 E3720_WCC_gov_2015_04 E3720_WCC_gov_2016_04 E3720_WCC_gov_2017_04 E3720_WCC_gov_2018_04 E3720_WCC_gov_2019_04 E3720_WCC_gov_2011_08 E3720_WCC_gov_2012_08 E3720_WCC_gov_2013_08 E3720_WCC_gov_2014_08 E3720_WCC_gov_2015_08 E3720_WCC_gov_2016_08 E3720_WCC_gov_2017_08 E3720_WCC_gov_2018_08 E3720_WCC_gov_2019_08 E3720_WCC_gov_2010_12 E3720_WCC_gov_2011_12 E3720_WCC_gov_2011_12 E3720_WCC_gov_2012_12 E3720_WCC_gov_2013_12 E3720_WCC_gov_2014_12 E3720_WCC_gov_2015_12 E3720_WCC_gov_2016_12 E3720_WCC_gov_2017_12 E3720_WCC_gov_2018_12 E3720_WCC_gov_2011_02 E3720_WCC_gov_2012_02 E3720_WCC_gov_2012_02 E3720_WCC_gov_2013_02 E3720_WCC_gov_2014_02 E3720_WCC_gov_2015_02 E3720_WCC_gov_2016_02 E3720_WCC_gov_2017_02 E3720_WCC_gov_2018_02 E3720_WCC_gov_2019_02 E3720_WCC_gov_2011_01 E3720_WCC_gov_2012_01 E3720_WCC_gov_2012_01 E3720_WCC_gov_2013_01 E3720_WCC_gov_2014_01 E3720_WCC_gov_2015_01 E3720_WCC_gov_2016_01 E3720_WCC_gov_2017_01 E3720_WCC_gov_2018_01 E3720_WCC_gov_2019_01 E3720_WCC_gov_2011_07 E3720_WCC_gov_2012_07 E3720_WCC_gov_2013_07 E3720_WCC_gov_2014_07 E3720_WCC_gov_2014_07 E3720_WCC_gov_2015_07 E3720_WCC_gov_2016_07 E3720_WCC_gov_2017_07 E3720_WCC_gov_2019_07 E3720_WCC_gov_2011_06 E3720_WCC_gov_2012_06 E3720_WCC_gov_2013_06 E3720_WCC_gov_2014_06 E3720_WCC_gov_2015_06 E3720_WCC_gov_2016_06 E3720_WCC_gov_2017_06 E3720_WCC_gov_2018_06 E3720_WCC_gov_2019_06 E3720_WCC_gov_2011_03 E3720_WCC_gov_2012_03 E3720_WCC_gov_2012_03 E3720_WCC_gov_2013_03 E3720_WCC_gov_2014_03 E3720_WCC_gov_2015_03 E3720_WCC_gov_2016_03 E3720_WCC_gov_2017_03 E3720_WCC_gov_2018_03 E3720_WCC_gov_2019_03 E3720_WCC_gov_2011_05 E3720_WCC_gov_2012_05 E3720_WCC_gov_2013_05 E3720_WCC_gov_2014_05 E3720_WCC_gov_2015_05 E3720_WCC_gov_2016_05 E3720_WCC_gov_2017_05 E3720_WCC_gov_2018_05 E3720_WCC_gov_2019_05 E3720_WCC_gov_2010_11 E3720_WCC_gov_2011_11 E3720_WCC_gov_2011_11 E3720_WCC_gov_2012_11 E3720_WCC_gov_2013_11 E3720_WCC_gov_2014_11 E3720_WCC_gov_2015_11 E3720_WCC_gov_2016_11 E3720_WCC_gov_2017_11 E3720_WCC_gov_2018_11 E3720_WCC_gov_2019_11 E3720_WCC_gov_2011_10 E3720_WCC_gov_2011_10

Data

Downloaded 833 times by SimKennedy woodbine MikeRalphson blablupcom

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (65 KB) Use the API

rows 10 / 213

d f l
2018-02-08 23:31:53.251748
E3720_WCC_gov_2015_02
2018-02-08 23:32:27.337050
E3720_WCC_gov_2014_07
https://s3-eu-west-1.amazonaws.com/opendata/supplierpayments/all-july-2014.csv
2018-11-26 02:56:21.216661
E3720_WCC_gov_2016_09
2018-11-26 02:56:24.384195
E3720_WCC_gov_2017_09
2019-02-04 10:11:44.285298
E3720_WCC_gov_2018_07
2019-02-04 10:11:46.296672
E3720_WCC_gov_2011_04
2019-02-04 10:11:48.467975
E3720_WCC_gov_2012_04
2019-02-04 10:11:50.421670
E3720_WCC_gov_2013_04
2019-02-04 10:11:52.671064
E3720_WCC_gov_2014_04
2019-02-04 10:11:54.774276
E3720_WCC_gov_2015_04

Statistics

Average successful run time: 9 minutes

Total run time: about 1 month

Total cpu time used: about 8 hours

Total disk space used: 103 KB

History

  • Auto ran revision 45d3ebf9 and completed successfully .
    98 records added, 98 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    98 records added, 98 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    98 records added, 98 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    98 records added, 98 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    98 records added, 98 records removed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_E3720_WCC_gov / scraper.py