This is a scraper that runs on Morph. To get started see the documentation

Contributors woodbine blablupcom

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.9, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.9 -----> Installing pip -----> Installing requirements with pip  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Collecting lxml==3.4.4 (from -r /tmp/build/requirements.txt (line 2))  Downloading https://files.pythonhosted.org/packages/63/c7/4f2a2a4ad6c6fa99b14be6b3c1cece9142e2d915aa7c43c908677afc8fa4/lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r /tmp/build/requirements.txt (line 3))  Downloading https://files.pythonhosted.org/packages/aa/e5/9ee1460d485b94a6d55732eb7ad5b6c084caf73dd6f9cb0bb7d2a78fafe8/cssselect-0.9.1.tar.gz  Collecting beautifulsoup4 (from -r /tmp/build/requirements.txt (line 4))  Downloading https://files.pythonhosted.org/packages/a6/29/bcbd41a916ad3faf517780a0af7d0254e8d6722ff6414723eedba4334531/beautifulsoup4-4.6.0-py2-none-any.whl (86kB)  Collecting dumptruck>=0.1.2 (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/65/47/7e02164a2a3db50ed6d8a6ab1d6d60b69c4c3fdf57a284257925dfc12bda/requests-2.19.1-py2.py3-none-any.whl (91kB)  Collecting idna<2.8,>=2.5 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/4b/2a/0276479a4b3caeb8a8c1af2f8e4355746a97fab05a372e4a2c6a6b876165/idna-2.7-py2.py3-none-any.whl (58kB)  Collecting certifi>=2017.4.17 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/7c/e6/92ad559b7192d846975fc916b65f667c7b8c3a32bea7372340bfe9a15fa5/certifi-2018.4.16-py2.py3-none-any.whl (150kB)  Collecting urllib3<1.24,>=1.21.1 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/bd/c9/6fdd990019071a4a32a5e7cb78a1d92c53851ef4f56f62a3486e6a7d8ffb/urllib3-1.23-py2.py3-none-any.whl (133kB)  Collecting chardet<3.1.0,>=3.0.2 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Installing collected packages: dumptruck, idna, certifi, urllib3, chardet, requests, scraperwiki, lxml, cssselect, beautifulsoup4  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed beautifulsoup4-4.6.0 certifi-2018.4.16 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.7 lxml-3.4.4 requests-2.19.1 scraperwiki urllib3-1.23   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... E3720_WCC_gov_2011_04 E3720_WCC_gov_2012_04 E3720_WCC_gov_2013_04 E3720_WCC_gov_2014_04 E3720_WCC_gov_2015_04 E3720_WCC_gov_2016_04 E3720_WCC_gov_2017_04 E3720_WCC_gov_2018_04 E3720_WCC_gov_2011_08 E3720_WCC_gov_2012_08 E3720_WCC_gov_2013_08 E3720_WCC_gov_2014_08 E3720_WCC_gov_2015_08 E3720_WCC_gov_2016_08 E3720_WCC_gov_2017_08 E3720_WCC_gov_2010_12 E3720_WCC_gov_2011_12 E3720_WCC_gov_2011_12 E3720_WCC_gov_2012_12 E3720_WCC_gov_2013_12 E3720_WCC_gov_2014_12 E3720_WCC_gov_2015_12 E3720_WCC_gov_2016_12 E3720_WCC_gov_2017_12 E3720_WCC_gov_2011_02 E3720_WCC_gov_2012_02 E3720_WCC_gov_2012_02 E3720_WCC_gov_2013_02 E3720_WCC_gov_2014_02 E3720_WCC_gov_2015_02 E3720_WCC_gov_2016_02 E3720_WCC_gov_2017_02 E3720_WCC_gov_2018_02 E3720_WCC_gov_2011_01 E3720_WCC_gov_2012_01 E3720_WCC_gov_2012_01 E3720_WCC_gov_2013_01 E3720_WCC_gov_2014_01 E3720_WCC_gov_2015_01 E3720_WCC_gov_2016_01 E3720_WCC_gov_2017_01 E3720_WCC_gov_2018_01 E3720_WCC_gov_2011_07 E3720_WCC_gov_2012_07 E3720_WCC_gov_2013_07 E3720_WCC_gov_2014_07 E3720_WCC_gov_2014_07 E3720_WCC_gov_2015_07 E3720_WCC_gov_2016_07 E3720_WCC_gov_2017_07 E3720_WCC_gov_2011_06 E3720_WCC_gov_2012_06 E3720_WCC_gov_2013_06 E3720_WCC_gov_2014_06 E3720_WCC_gov_2015_06 E3720_WCC_gov_2016_06 E3720_WCC_gov_2017_06 E3720_WCC_gov_2011_03 E3720_WCC_gov_2012_03 E3720_WCC_gov_2012_03 E3720_WCC_gov_2013_03 E3720_WCC_gov_2014_03 E3720_WCC_gov_2015_03 E3720_WCC_gov_2016_03 E3720_WCC_gov_2017_03 E3720_WCC_gov_2018_03 E3720_WCC_gov_2011_05 E3720_WCC_gov_2012_05 E3720_WCC_gov_2013_05 E3720_WCC_gov_2014_05 E3720_WCC_gov_2015_05 E3720_WCC_gov_2016_05 E3720_WCC_gov_2017_05 E3720_WCC_gov_2010_11 E3720_WCC_gov_2011_11 E3720_WCC_gov_2011_11 E3720_WCC_gov_2012_11 E3720_WCC_gov_2013_11 E3720_WCC_gov_2014_11 E3720_WCC_gov_2015_11 E3720_WCC_gov_2016_11 E3720_WCC_gov_2017_11 E3720_WCC_gov_2011_10 E3720_WCC_gov_2011_10 E3720_WCC_gov_2012_10 E3720_WCC_gov_2013_10 E3720_WCC_gov_2014_10 E3720_WCC_gov_2015_10 E3720_WCC_gov_2016_10 E3720_WCC_gov_2017_10 E3720_WCC_gov_2011_09 E3720_WCC_gov_2012_09 E3720_WCC_gov_2013_09 E3720_WCC_gov_2014_09 E3720_WCC_gov_2015_09 E3720_WCC_gov_2016_09 E3720_WCC_gov_2017_09

Data

Downloaded 575 times by SimKennedy woodbine MikeRalphson blablupcom

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (28 KB) Use the API

rows 10 / 98

d f l
2018-02-08 23:31:53.251748
E3720_WCC_gov_2015_02
2018-02-08 23:32:27.337050
E3720_WCC_gov_2014_07
https://s3-eu-west-1.amazonaws.com/opendata/supplierpayments/all-july-2014.csv
2018-07-20 11:26:06.656382
E3720_WCC_gov_2011_04
2018-07-20 11:26:10.014719
E3720_WCC_gov_2012_04
2018-07-20 11:26:14.316527
E3720_WCC_gov_2013_04
2018-07-20 11:26:21.260939
E3720_WCC_gov_2014_04
2018-07-20 11:26:38.202371
E3720_WCC_gov_2015_04
2018-07-20 11:26:46.637975
E3720_WCC_gov_2016_04
2018-07-20 11:26:50.235590
E3720_WCC_gov_2017_04
2018-07-20 11:26:53.284774
E3720_WCC_gov_2018_04

Statistics

Average successful run time: 10 minutes

Total run time: about 1 month

Total cpu time used: about 3 hours

Total disk space used: 66.4 KB

History

  • Auto ran revision 45d3ebf9 and completed successfully .
    96 records added, 96 records removed in the database
    303 pages scraped
  • Auto ran revision 45d3ebf9 and completed successfully .
    96 records added, 96 records removed in the database
    303 pages scraped
  • Auto ran revision 45d3ebf9 and completed successfully .
    96 records added, 96 records removed in the database
  • Auto ran revision 45d3ebf9 and completed successfully .
    96 records added, 96 records removed in the database
    303 pages scraped
  • Auto ran revision 45d3ebf9 and completed successfully .
    96 records added, 96 records removed in the database
    303 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_E3720_WCC_gov / scraper.py