mobeets / wikipedia-signpost

new posts on Wikipedia's The Signpost


Checks for new articles in Wikipedia's The Signpost.

This is a scraper that runs on Morph.

Contributors mobeets

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.9, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.9 -----> Installing pip -----> Installing requirements with pip  DEPRECATION: Python 2.7 reached the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 is no longer maintained. pip 21.0 will drop support for Python 2.7 in January 2021. More details about Python 2 support in pip can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support pip 21.0 will remove support for this functionality.  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 6))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Running command git clone -q http://github.com/openaustralia/scraperwiki-python.git /app/.heroku/src/scraperwiki  Running command git checkout -b morph_defaults --track origin/morph_defaults  Switched to a new branch 'morph_defaults'  Branch morph_defaults set up to track remote branch morph_defaults from origin.  Collecting lxml==3.4.4  Downloading lxml-3.4.4.tar.gz (3.5 MB)  Collecting cssselect==0.9.1  Downloading cssselect-0.9.1.tar.gz (32 kB)  Collecting BeautifulSoup==3.2.1  Downloading BeautifulSoup-3.2.1.tar.gz (31 kB)  Collecting dumptruck>=0.1.2  Downloading dumptruck-0.1.6.tar.gz (15 kB)  Collecting requests  Downloading requests-2.27.1-py2.py3-none-any.whl (63 kB)  Collecting idna<3,>=2.5; python_version < "3"  Downloading idna-2.10-py2.py3-none-any.whl (58 kB)  Collecting certifi>=2017.4.17  Downloading certifi-2021.10.8-py2.py3-none-any.whl (149 kB)  Collecting chardet<5,>=3.0.2; python_version < "3"  Downloading chardet-4.0.0-py2.py3-none-any.whl (178 kB)  Collecting urllib3<1.27,>=1.21.1  Downloading urllib3-1.26.9-py2.py3-none-any.whl (138 kB)  Building wheels for collected packages: lxml, cssselect, BeautifulSoup, dumptruck  Building wheel for lxml (setup.py): started  Building wheel for lxml (setup.py): still running...  Building wheel for lxml (setup.py): finished with status 'done'  Created wheel for lxml: filename=lxml-3.4.4-cp27-cp27m-linux_x86_64.whl size=2989862 sha256=2af7d257d9e0ccdb5fa26e2d4de8027d738d864bde7e3c44ab6c358ce52f3df1  Stored in directory: /tmp/pip-ephem-wheel-cache-VP383v/wheels/d6/de/81/11ae6edd05c75aac677e67dd154c85da758ba6f3e8e80e962e  Building wheel for cssselect (setup.py): started  Building wheel for cssselect (setup.py): finished with status 'done'  Created wheel for cssselect: filename=cssselect-0.9.1-py2-none-any.whl size=26992 sha256=c6555991a61e1911ba2526474b2956519a0076cb1ef20a84d0fc569498f38077  Stored in directory: /tmp/pip-ephem-wheel-cache-VP383v/wheels/85/fe/00/b94036d8583cec9791d8cda24c184f2d2ac1397822f7f0e8d4  Building wheel for BeautifulSoup (setup.py): started  Building wheel for BeautifulSoup (setup.py): finished with status 'done'  Created wheel for BeautifulSoup: filename=BeautifulSoup-3.2.1-py2-none-any.whl size=31960 sha256=4352f10e1d88eb67e6e5bd2a03ee8503b9132ccc59156460b00aa60a7ef4dbe6  Stored in directory: /tmp/pip-ephem-wheel-cache-VP383v/wheels/4d/ca/f6/2638be1fa1df72e30c9f0264c6e4fd77b97eb0044aa8083e12  Building wheel for dumptruck (setup.py): started  Building wheel for dumptruck (setup.py): finished with status 'done'  Created wheel for dumptruck: filename=dumptruck-0.1.6-py2-none-any.whl size=11844 sha256=80e0f8255f437cf71a8fc959788f21909502af0c065bb57c65d1750c207dc9e6  Stored in directory: /tmp/pip-ephem-wheel-cache-VP383v/wheels/dc/75/e9/1e61c4080c73e7bda99614549591f83b53bcc2d682f26fce62  Successfully built lxml cssselect BeautifulSoup dumptruck  Installing collected packages: dumptruck, idna, certifi, chardet, urllib3, requests, scraperwiki, lxml, cssselect, BeautifulSoup  Running setup.py develop for scraperwiki  Successfully installed BeautifulSoup-3.2.1 certifi-2021.10.8 chardet-4.0.0 cssselect-0.9.1 dumptruck-0.1.6 idna-2.10 lxml-3.4.4 requests-2.27.1 scraperwiki urllib3-1.26.9 DEPRECATION: Python 2.7 reached the end of its life on January 1st, 2020. Please upgrade your Python as Python 2.7 is no longer maintained. pip 21.0 will drop support for Python 2.7 in January 2021. More details about Python 2 support in pip can be found at https://pip.pypa.io/en/latest/development/release-process/#python-2-support pip 21.0 will remove support for this functionality.    -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... Traceback (most recent call last): File "scraper.py", line 16, in <module> html = urllib2.urlopen(URL).read() File "/app/.heroku/python/lib/python2.7/urllib2.py", line 154, in urlopen return opener.open(url, data, timeout) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 431, in open response = self._open(req, data) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 449, in _open '_open', req) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 409, in _call_chain result = func(*args) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 1240, in https_open context=self._context) File "/app/.heroku/python/lib/python2.7/urllib2.py", line 1197, in do_open raise URLError(err) urllib2.URLError: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:581)>

Data

Downloaded 4 times by MikeRalphson

Statistics

Average successful run time: 1 minute

Total run time: 3 days

Total cpu time used: 28 minutes

Total disk space used: 80.8 KB

History

  • Auto ran revision 13b8adbf and failed .
    nothing changed in the database
  • Auto ran revision 13b8adbf and failed .
    nothing changed in the database
  • Auto ran revision 13b8adbf and failed .
    nothing changed in the database
  • Auto ran revision 13b8adbf and failed .
    nothing changed in the database
  • Auto ran revision 13b8adbf and failed .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

wikipedia-signpost / scraper.py