OKWiki scraper

Scraping the recent changes of the OKwiki (http://wiki.okfn.org)

Contributors mihi-tr

The scraper is running. It was queued automatically .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.6, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.6 -----> Installing pip -----> Installing requirements with pip  Collecting lxml (from -r /tmp/build/requirements.txt (line 1))  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:369: SNIMissingWarning: An HTTPS request has been made, but the SNI (Server Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  SNIMissingWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:160: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:160: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  Downloading https://files.pythonhosted.org/packages/98/69/eb6eb6746ffbb5020794a8b8cfe62ad2cab6884dac93eb743c6dc6655991/lxml-4.2.4-cp27-cp27m-manylinux1_x86_64.whl (5.8MB)  Installing collected packages: lxml  Successfully installed lxml-4.2.4   ! Hello! It looks like your application is using an outdated version of Python.  ! This caused the security warning you saw above during the 'pip install' step.  ! We recommend 'python-3.6.2', which you can specify in a 'runtime.txt' file.  ! -- Much Love, Heroku.   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running...

Statistics

Average successful run time: less than 20 seconds

Total run time: about 20 hours

Total cpu time used: 7 minutes

Total disk space used: 35.5 KB

History

  • Auto ran revision ebd374e8 and failed .
    nothing changed in the database
    21 pages scraped
  • Auto ran revision ebd374e8 and failed .
    nothing changed in the database
  • Auto ran revision ebd374e8 and failed .
    nothing changed in the database
  • Auto ran revision ebd374e8 and failed .
    nothing changed in the database
  • Auto ran revision ebd374e8 and failed .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

okwiki-scraper / scraper.py