blablupcom / sp_E4210_WMBC_gov

Scrapes www.wigan.gov.uk

Information for local residents, visitors and local businesses


Contributors blablupcom

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling...  -----> Python app detected -----> Installing python-2.7.6  $ pip install -r requirements.txt  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r requirements.txt (line 6))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to morph_defaults) to ./.heroku/src/scraperwiki  Collecting beautifulsoup4==4.2.0 (from -r requirements.txt (line 7))  /app/.heroku/python/lib/python2.7/site-packages/pip-8.1.2-py2.7.egg/pip/_vendor/requests/packages/urllib3/util/ssl_.py:318: SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.org/en/latest/security.html#snimissingwarning.  SNIMissingWarning  /app/.heroku/python/lib/python2.7/site-packages/pip-8.1.2-py2.7.egg/pip/_vendor/requests/packages/urllib3/util/ssl_.py:122: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.org/en/latest/security.html#insecureplatformwarning.  InsecurePlatformWarning  Downloading beautifulsoup4-4.2.0.tar.gz (63kB)  Collecting grequests==0.2.0 (from -r requirements.txt (line 8))  Downloading grequests-0.2.0.tar.gz  Collecting lxml==3.4.4 (from -r requirements.txt (line 9))  Downloading lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r requirements.txt (line 10))  Downloading cssselect-0.9.1.tar.gz  Collecting dumptruck>=0.1.2 (from scraperwiki->-r requirements.txt (line 6))  Downloading dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r requirements.txt (line 6))  Downloading requests-2.12.3-py2.py3-none-any.whl (575kB)  Collecting gevent (from grequests==0.2.0->-r requirements.txt (line 8))  Downloading gevent-1.1.2-cp27-cp27m-manylinux1_x86_64.whl (1.3MB)  Collecting greenlet>=0.4.9 (from gevent->grequests==0.2.0->-r requirements.txt (line 8))  Downloading greenlet-0.4.10-cp27-cp27m-manylinux1_x86_64.whl (41kB)  Installing collected packages: dumptruck, requests, scraperwiki, beautifulsoup4, greenlet, gevent, grequests, lxml, cssselect  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for beautifulsoup4: started  Running setup.py install for beautifulsoup4: finished with status 'done'  Running setup.py install for grequests: started  Running setup.py install for grequests: finished with status 'done'  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed beautifulsoup4-4.2.0 cssselect-0.9.1 dumptruck-0.1.6 gevent-1.1.2 greenlet-0.4.10 grequests-0.2.0 lxml-3.4.4 requests-2.12.3 scraperwiki   ! Hello! It looks like your application is using an outdated version of Python.  ! This caused the security warning you saw above during the 'pip install' step.  ! We recommend 'python-2.7.12', which you can specify in a 'runtime.txt' file.  ! -- Much Love, Heroku.   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... /app/.heroku/python/lib/python2.7/site-packages/requests/packages/urllib3/util/ssl_.py:334: SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings SNIMissingWarning /app/.heroku/python/lib/python2.7/site-packages/requests/packages/urllib3/util/ssl_.py:132: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings InsecurePlatformWarning E4210_WMBC_gov_2016_Q0 /app/.heroku/python/lib/python2.7/site-packages/requests/packages/urllib3/util/ssl_.py:132: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings InsecurePlatformWarning Error validating URL. E4210_WMBC_gov_2016_Y1 *Error: Invalid URL* https://www.wigan.gov.uk/Docs/PDF/Council/Data-Protection-FOI/Open-Data/Opendata2015-16/OpenData500-2015-16.csv /app/.heroku/python/lib/python2.7/site-packages/requests/packages/urllib3/util/ssl_.py:132: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings InsecurePlatformWarning Error validating URL. E4210_WMBC_gov_2015_Y1 *Error: Invalid URL* https://www.wigan.gov.uk/Docs/PDF/Council/Data-Protection-FOI/Open-Data/Opendata2014-15/OpenData500-2014-15.csv /app/.heroku/python/lib/python2.7/site-packages/requests/packages/urllib3/util/ssl_.py:132: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings InsecurePlatformWarning Error validating URL. E4210_WMBC_gov_2014_Y1 *Error: Invalid URL* https://www.wigan.gov.uk/Docs/PDF/Council/Data-Protection-FOI/Open-Data/Opendata2013-14/OpenData500-2013-14.csv /app/.heroku/python/lib/python2.7/site-packages/requests/packages/urllib3/util/ssl_.py:132: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings InsecurePlatformWarning E4210_WMBC_gov_2013_Y1 /app/.heroku/python/lib/python2.7/site-packages/requests/packages/urllib3/util/ssl_.py:132: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings InsecurePlatformWarning E4210_WMBC_gov_2012_Y1 Traceback (most recent call last): File "scraper.py", line 145, in <module> raise Exception("%d errors occurred during scrape." % errors) Exception: 3 errors occurred during scrape.

Statistics

Average successful run time: 2 minutes

Total run time: 43 minutes

Total cpu time used: less than 20 seconds

Total disk space used: 2.44 MB

History

  • Auto ran revision 4b0def10 and failed .
    3 records added, 3 records removed in the database
    7 pages scraped
  • Auto ran revision 4b0def10 and failed .
    nothing changed in the database
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
  • Auto ran revision 4b0def10 and completed successfully .
    nothing changed in the database
    7 pages scraped
  • Auto ran revision 4b0def10 and failed .
    nothing changed in the database
    7 pages scraped
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
  • Auto ran revision 4b0def10 and completed successfully .
    nothing changed in the database
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
    7 pages scraped
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
    7 pages scraped
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
    7 pages scraped
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
    7 pages scraped
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added, 6 records removed in the database
    7 pages scraped
  • Auto ran revision 4b0def10 and completed successfully .
    6 records added in the database
    7 pages scraped
  • Manually ran revision 4b0def10 and completed successfully .
    nothing changed in the database
  • Manually ran revision e9791fc7 and completed successfully .
    6 records added in the database
    7 pages scraped
  • Created on morph.io

Scraper code

Python

sp_E4210_WMBC_gov / scraper.py