This is a scraper that runs on Morph. To get started see the documentation

Contributors woodbine henare blablupcom

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.9, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.9 -----> Installing pip -----> Installing requirements with pip  DEPRECATION: Python 2.7 reached the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 is no longer maintained. pip 21.0 will drop support for Python 2.7 in January 2021. More details about Python 2 support in pip, can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support  Collecting beautifulsoup4==4.2.0  Downloading beautifulsoup4-4.2.0.tar.gz (63 kB)  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 4))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Running command git clone -q http://github.com/openaustralia/scraperwiki-python.git /app/.heroku/src/scraperwiki  Running command git checkout -b morph_defaults --track origin/morph_defaults  Switched to a new branch 'morph_defaults'  Branch morph_defaults set up to track remote branch morph_defaults from origin.  Collecting dumptruck>=0.1.2  Downloading dumptruck-0.1.6.tar.gz (15 kB)  Collecting requests  Downloading requests-2.23.0-py2.py3-none-any.whl (58 kB)  Collecting urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1  Downloading urllib3-1.25.9-py2.py3-none-any.whl (126 kB)  Collecting certifi>=2017.4.17  Downloading certifi-2020.4.5.1-py2.py3-none-any.whl (157 kB)  Collecting chardet<4,>=3.0.2  Downloading chardet-3.0.4-py2.py3-none-any.whl (133 kB)  Collecting idna<3,>=2.5  Downloading idna-2.9-py2.py3-none-any.whl (58 kB)  Building wheels for collected packages: beautifulsoup4, dumptruck  Building wheel for beautifulsoup4 (setup.py): started  Building wheel for beautifulsoup4 (setup.py): finished with status 'done'  Created wheel for beautifulsoup4: filename=beautifulsoup4-4.2.0-py2-none-any.whl size=71804 sha256=5574ad0dee53f6e9f97d74f3f862aa0ba44108f2a5e85c773065a32331271fec  Stored in directory: /tmp/pip-ephem-wheel-cache-4s5e1n/wheels/dc/69/36/2edde9ec10080447fc56d1a3d1235ddba3dd4b0dc9fff29134  Building wheel for dumptruck (setup.py): started  Building wheel for dumptruck (setup.py): finished with status 'done'  Created wheel for dumptruck: filename=dumptruck-0.1.6-py2-none-any.whl size=11842 sha256=61dbb7f2b6086da1a95c7a75e8a0978d1390f610ba67b67ddf7669a20147474f  Stored in directory: /tmp/pip-ephem-wheel-cache-4s5e1n/wheels/dc/75/e9/1e61c4080c73e7bda99614549591f83b53bcc2d682f26fce62  Successfully built beautifulsoup4 dumptruck  Installing collected packages: beautifulsoup4, dumptruck, urllib3, certifi, chardet, idna, requests, scraperwiki  Running setup.py develop for scraperwiki  Successfully installed beautifulsoup4-4.2.0 certifi-2020.4.5.1 chardet-3.0.4 dumptruck-0.1.6 idna-2.9 requests-2.23.0 scraperwiki urllib3-1.25.9 DEPRECATION: Python 2.7 reached the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 is no longer maintained. pip 21.0 will drop support for Python 2.7 in January 2021. More details about Python 2 support in pip, can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support    -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... Traceback (most recent call last): File "scraper.py", line 107, in <module> url = link['href'] File "/app/.heroku/python/lib/python2.7/site-packages/bs4/element.py", line 892, in __getitem__ return self.attrs[key] KeyError: 'href'

Data

Downloaded 821 times by SimKennedy woodbine MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (54 KB) Use the API

rows 10 / 122

f d l
E3520_SCC_gov_2015_08
2015-11-05 07:30:44.393847
E3520_SCC_gov_2015_08
2015-11-05 07:30:51.944453
E3520_SCC_gov_2016_01
2016-04-04 01:26:46.860070
E3520_SCC_gov_2015_12
2016-04-04 01:26:57.732484
E3520_SCC_gov_2015_11
2016-04-04 01:27:04.470437
E3520_SCC_gov_2015_10
2016-04-04 01:27:09.283733
E3520_SCC_gov_2015_09
2016-04-04 01:27:18.096753
E3520_SCC_gov_2015_08
2016-04-04 01:27:22.096502
E3520_SCC_gov_2016_07
2016-12-04 15:33:29.190942
E3520_SCC_gov_2013_12
2017-05-22 05:01:11.189042

Statistics

Average successful run time: 3 minutes

Total run time: 3 days

Total cpu time used: 19 minutes

Total disk space used: 86.2 KB

History

  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_E3520_SCC_gov / scraper.py