This is a scraper that runs on Morph. To get started see the documentation

Contributors woodbine henare blablupcom

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.9, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.9 -----> Installing pip -----> Installing requirements with pip  DEPRECATION: Python 2.7 will reach the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 won't be maintained after that date. A future version of pip will drop support for Python 2.7. More details about Python 2 support in pip, can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support  Collecting beautifulsoup4==4.2.0  Downloading https://files.pythonhosted.org/packages/b3/fe/6888f755c0b9f66d4cc1f9ee0077a613a09e211acca3242a7335ffe7ed06/beautifulsoup4-4.2.0.tar.gz (63kB)  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 4))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Running command git clone -q http://github.com/openaustralia/scraperwiki-python.git /app/.heroku/src/scraperwiki  Running command git checkout -b morph_defaults --track origin/morph_defaults  Switched to a new branch 'morph_defaults'  Branch morph_defaults set up to track remote branch morph_defaults from origin.  Collecting dumptruck>=0.1.2  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests  Downloading https://files.pythonhosted.org/packages/51/bd/23c926cd341ea6b7dd0b2a00aba99ae0f828be89d72b2190f27c11d4b7fb/requests-2.22.0-py2.py3-none-any.whl (57kB)  Collecting certifi>=2017.4.17  Downloading https://files.pythonhosted.org/packages/b9/63/df50cac98ea0d5b006c55a399c3bf1db9da7b5a24de7890bc9cfd5dd9e99/certifi-2019.11.28-py2.py3-none-any.whl (156kB)  Collecting urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1  Downloading https://files.pythonhosted.org/packages/b4/40/a9837291310ee1ccc242ceb6ebfd9eb21539649f193a7c8c86ba15b98539/urllib3-1.25.7-py2.py3-none-any.whl (125kB)  Collecting idna<2.9,>=2.5  Downloading https://files.pythonhosted.org/packages/14/2c/cd551d81dbe15200be1cf41cd03869a46fe7226e7450af7a6545bfc474c9/idna-2.8-py2.py3-none-any.whl (58kB)  Collecting chardet<3.1.0,>=3.0.2  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Building wheels for collected packages: beautifulsoup4, dumptruck  Building wheel for beautifulsoup4 (setup.py): started  Building wheel for beautifulsoup4 (setup.py): finished with status 'done'  Created wheel for beautifulsoup4: filename=beautifulsoup4-4.2.0-cp27-none-any.whl size=71806 sha256=d334e65a7ee20438b13db9363e4a8cd11d23006f8b311063ed823fe438043888  Stored in directory: /tmp/pip-ephem-wheel-cache-p9pwka/wheels/3b/fb/48/5009f96b53a4d41ebb53c3e5598204dd3032b8db0110526e88  Building wheel for dumptruck (setup.py): started  Building wheel for dumptruck (setup.py): finished with status 'done'  Created wheel for dumptruck: filename=dumptruck-0.1.6-cp27-none-any.whl size=11845 sha256=93f5751c1e3baef30432feb6f204fdcfd88908f98eba3b262994566f60c64cbc  Stored in directory: /tmp/pip-ephem-wheel-cache-p9pwka/wheels/57/df/83/32654ae89119876c7a7db66829bbdb646caa151589dbaf226e  Successfully built beautifulsoup4 dumptruck  Installing collected packages: beautifulsoup4, dumptruck, certifi, urllib3, idna, chardet, requests, scraperwiki  Running setup.py develop for scraperwiki  Successfully installed beautifulsoup4-4.2.0 certifi-2019.11.28 chardet-3.0.4 dumptruck-0.1.6 idna-2.8 requests-2.22.0 scraperwiki urllib3-1.25.7 DEPRECATION: Python 2.7 will reach the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 won't be maintained after that date. A future version of pip will drop support for Python 2.7. More details about Python 2 support in pip, can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support    -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... Traceback (most recent call last): File "scraper.py", line 107, in <module> url = link['href'] File "/app/.heroku/python/lib/python2.7/site-packages/bs4/element.py", line 892, in __getitem__ return self.attrs[key] KeyError: 'href'

Data

Downloaded 821 times by SimKennedy woodbine MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (54 KB) Use the API

rows 10 / 122

f d l
E3520_SCC_gov_2015_08
2015-11-05 07:30:44.393847
E3520_SCC_gov_2015_08
2015-11-05 07:30:51.944453
E3520_SCC_gov_2016_01
2016-04-04 01:26:46.860070
E3520_SCC_gov_2015_12
2016-04-04 01:26:57.732484
E3520_SCC_gov_2015_11
2016-04-04 01:27:04.470437
E3520_SCC_gov_2015_10
2016-04-04 01:27:09.283733
E3520_SCC_gov_2015_09
2016-04-04 01:27:18.096753
E3520_SCC_gov_2015_08
2016-04-04 01:27:22.096502
E3520_SCC_gov_2016_07
2016-12-04 15:33:29.190942
E3520_SCC_gov_2013_12
2017-05-22 05:01:11.189042

Statistics

Average successful run time: 3 minutes

Total run time: 3 days

Total cpu time used: 18 minutes

Total disk space used: 86.2 KB

History

  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • Auto ran revision 75dd0b31 and failed .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_E3520_SCC_gov / scraper.py