This is a scraper that runs on Morph. To get started see the documentation

Contributors woodbine blablupcom

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.9, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.9 -----> Installing pip -----> Installing requirements with pip  DEPRECATION: Python 2.7 will reach the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 won't be maintained after that date. A future version of pip will drop support for Python 2.7. More details about Python 2 support in pip, can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support  Collecting beautifulsoup4==4.2.0  Downloading https://files.pythonhosted.org/packages/b3/fe/6888f755c0b9f66d4cc1f9ee0077a613a09e211acca3242a7335ffe7ed06/beautifulsoup4-4.2.0.tar.gz (63kB)  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 8))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Running command git clone -q http://github.com/openaustralia/scraperwiki-python.git /app/.heroku/src/scraperwiki  Running command git checkout -b morph_defaults --track origin/morph_defaults  Switched to a new branch 'morph_defaults'  Branch morph_defaults set up to track remote branch morph_defaults from origin.  Collecting lxml==3.4.4  Downloading https://files.pythonhosted.org/packages/63/c7/4f2a2a4ad6c6fa99b14be6b3c1cece9142e2d915aa7c43c908677afc8fa4/lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1  Downloading https://files.pythonhosted.org/packages/aa/e5/9ee1460d485b94a6d55732eb7ad5b6c084caf73dd6f9cb0bb7d2a78fafe8/cssselect-0.9.1.tar.gz  Collecting dumptruck>=0.1.2  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests  Downloading https://files.pythonhosted.org/packages/51/bd/23c926cd341ea6b7dd0b2a00aba99ae0f828be89d72b2190f27c11d4b7fb/requests-2.22.0-py2.py3-none-any.whl (57kB)  Collecting certifi>=2017.4.17  Downloading https://files.pythonhosted.org/packages/b9/63/df50cac98ea0d5b006c55a399c3bf1db9da7b5a24de7890bc9cfd5dd9e99/certifi-2019.11.28-py2.py3-none-any.whl (156kB)  Collecting urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1  Downloading https://files.pythonhosted.org/packages/b4/40/a9837291310ee1ccc242ceb6ebfd9eb21539649f193a7c8c86ba15b98539/urllib3-1.25.7-py2.py3-none-any.whl (125kB)  Collecting idna<2.9,>=2.5  Downloading https://files.pythonhosted.org/packages/14/2c/cd551d81dbe15200be1cf41cd03869a46fe7226e7450af7a6545bfc474c9/idna-2.8-py2.py3-none-any.whl (58kB)  Collecting chardet<3.1.0,>=3.0.2  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Building wheels for collected packages: beautifulsoup4, lxml, cssselect, dumptruck  Building wheel for beautifulsoup4 (setup.py): started  Building wheel for beautifulsoup4 (setup.py): finished with status 'done'  Created wheel for beautifulsoup4: filename=beautifulsoup4-4.2.0-cp27-none-any.whl size=71806 sha256=9f245a42a0bb61375139af08b5d947faff132f1037664cf1f2fc8c50f0cff7ed  Stored in directory: /tmp/pip-ephem-wheel-cache-9nwg8X/wheels/3b/fb/48/5009f96b53a4d41ebb53c3e5598204dd3032b8db0110526e88  Building wheel for lxml (setup.py): started  Building wheel for lxml (setup.py): still running...  Building wheel for lxml (setup.py): finished with status 'done'  Created wheel for lxml: filename=lxml-3.4.4-cp27-cp27m-linux_x86_64.whl size=2989859 sha256=2ae2cabfbcda901bbe18edd9d6164cdc4c834a0476511b33ff0d8a303cb4d038  Stored in directory: /tmp/pip-ephem-wheel-cache-9nwg8X/wheels/f6/df/7b/af9cace9baf95a6e4a2b5790e30da55fc780ddee598314d1ed  Building wheel for cssselect (setup.py): started  Building wheel for cssselect (setup.py): finished with status 'done'  Created wheel for cssselect: filename=cssselect-0.9.1-cp27-none-any.whl size=26994 sha256=6953aa0dcfc7f29e11cfedbc6e1b5db01a6ed454a9ae78c77117a2408688402f  Stored in directory: /tmp/pip-ephem-wheel-cache-9nwg8X/wheels/45/25/d7/5a3b06d22b1ffb616f868a74729a5a002bcc04d45109b4f223  Building wheel for dumptruck (setup.py): started  Building wheel for dumptruck (setup.py): finished with status 'done'  Created wheel for dumptruck: filename=dumptruck-0.1.6-cp27-none-any.whl size=11845 sha256=8a1446b3e53221d85053c198d050d6acbe8111d72816a0f3a38054bd310f7d83  Stored in directory: /tmp/pip-ephem-wheel-cache-9nwg8X/wheels/57/df/83/32654ae89119876c7a7db66829bbdb646caa151589dbaf226e  Successfully built beautifulsoup4 lxml cssselect dumptruck  Installing collected packages: beautifulsoup4, dumptruck, certifi, urllib3, idna, chardet, requests, scraperwiki, lxml, cssselect  Running setup.py develop for scraperwiki  Successfully installed beautifulsoup4-4.2.0 certifi-2019.11.28 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.8 lxml-3.4.4 requests-2.22.0 scraperwiki urllib3-1.25.7 DEPRECATION: Python 2.7 will reach the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 won't be maintained after that date. A future version of pip will drop support for Python 2.7. More details about Python 2 support in pip, can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support    -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running...

Data

Downloaded 791 times by SimKennedy woodbine MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (17 KB) Use the API

rows 10 / 50

f d l
E3820_WSCC_gov_2016_03
2016-06-16 10:51:46.802345
E3820_WSCC_gov_2016_02
2016-06-16 10:51:48.024810
E3820_WSCC_gov_2016_01
2016-06-16 10:51:49.395262
E3820_WSCC_gov_2015_12
2016-06-16 10:51:50.946524
E3820_WSCC_gov_2015_11
2016-06-16 10:51:52.333700
E3820_WSCC_gov_2015_10
2016-06-16 10:51:53.519136
E3820_WSCC_gov_2015_09
2016-06-16 10:51:54.835552
E3820_WSCC_gov_2015_08
2016-06-16 10:51:56.757327
E3820_WSCC_gov_2015_07
2016-06-16 10:51:57.788285
E3820_WSCC_gov_2015_06
2016-06-16 10:51:58.945084

Statistics

Average successful run time: 10 minutes

Total run time: about 1 month

Total cpu time used: 19 minutes

Total disk space used: 46.6 KB

History

  • Auto ran revision bbabae65 and completed successfully .
    nothing changed in the database
  • Auto ran revision bbabae65 and completed successfully .
    nothing changed in the database
  • Auto ran revision bbabae65 and completed successfully .
    nothing changed in the database
  • Auto ran revision bbabae65 and completed successfully .
    nothing changed in the database
  • Auto ran revision bbabae65 and completed successfully .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_E3820_WSCC_gov / scraper.py