tdproactisnewsite

Contributors ErinClark

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.6, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.6 -----> Installing pip -----> Installing requirements with pip  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Collecting lxml==3.4.4 (from -r /tmp/build/requirements.txt (line 3))  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:339: SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  SNIMissingWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:137: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:137: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  Downloading https://files.pythonhosted.org/packages/63/c7/4f2a2a4ad6c6fa99b14be6b3c1cece9142e2d915aa7c43c908677afc8fa4/lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r /tmp/build/requirements.txt (line 4))  Downloading https://files.pythonhosted.org/packages/aa/e5/9ee1460d485b94a6d55732eb7ad5b6c084caf73dd6f9cb0bb7d2a78fafe8/cssselect-0.9.1.tar.gz  Collecting beautifulsoup4 (from -r /tmp/build/requirements.txt (line 5))  Downloading https://files.pythonhosted.org/packages/a6/29/bcbd41a916ad3faf517780a0af7d0254e8d6722ff6414723eedba4334531/beautifulsoup4-4.6.0-py2-none-any.whl (86kB)  Collecting python-dateutil (from -r /tmp/build/requirements.txt (line 6))  Downloading https://files.pythonhosted.org/packages/0c/57/19f3a65bcf6d5be570ee8c35a5398496e10a0ddcbc95393b2d17f86aaaf8/python_dateutil-2.7.2-py2.py3-none-any.whl (212kB)  Collecting selenium (from -r /tmp/build/requirements.txt (line 7))  Downloading https://files.pythonhosted.org/packages/5e/1f/6c2204b9ae14eddab615c5e2ee4956c65ed533e0a9986c23eabd801ae849/selenium-3.11.0-py2.py3-none-any.whl (943kB)  Collecting splinter>=0.7.3 (from -r /tmp/build/requirements.txt (line 8))  Downloading https://files.pythonhosted.org/packages/8f/81/ee550fb949a897c95aa3837b673b3d39ffc066234d9d3af9e06901448242/splinter-0.7.7-py2-none-any.whl  Collecting dumptruck>=0.1.2 (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/49/df/50aa1999ab9bde74656c2919d9c0c085fd2b3775fd3eca826012bef76d8c/requests-2.18.4-py2.py3-none-any.whl (88kB)  Collecting six>=1.5 (from python-dateutil->-r /tmp/build/requirements.txt (line 6))  Downloading https://files.pythonhosted.org/packages/67/4b/141a581104b1f6397bfa78ac9d43d8ad29a7ca43ea90a2d863fe3056e86a/six-1.11.0-py2.py3-none-any.whl  Collecting chardet<3.1.0,>=3.0.2 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Collecting certifi>=2017.4.17 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/7c/e6/92ad559b7192d846975fc916b65f667c7b8c3a32bea7372340bfe9a15fa5/certifi-2018.4.16-py2.py3-none-any.whl (150kB)  Collecting urllib3<1.23,>=1.21.1 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/63/cb/6965947c13a94236f6d4b8223e21beb4d576dc72e8130bd7880f600839b8/urllib3-1.22-py2.py3-none-any.whl (132kB)  Collecting idna<2.7,>=2.5 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/27/cc/6dd9a3869f15c2edfab863b992838277279ce92663d334df9ecf5106f5c6/idna-2.6-py2.py3-none-any.whl (56kB)  Installing collected packages: dumptruck, chardet, certifi, urllib3, idna, requests, scraperwiki, lxml, cssselect, beautifulsoup4, six, python-dateutil, selenium, splinter  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed beautifulsoup4-4.6.0 certifi-2018.4.16 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.6 lxml-3.4.4 python-dateutil-2.7.2 requests-2.18.4 scraperwiki selenium-3.11.0 six-1.11.0 splinter-0.7.7 urllib3-1.22   ! Hello! It looks like your application is using an outdated version of Python.  ! This caused the security warning you saw above during the 'pip install' step.  ! We recommend 'python-3.6.2', which you can specify in a 'runtime.txt' file.  ! -- Much Love, Heroku.   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... https://supplierlive.proactisp2p.com/Account/Login /app/.heroku/python/lib/python2.7/site-packages/selenium/webdriver/phantomjs/webdriver.py:49: UserWarning: Selenium support for PhantomJS has been deprecated, please use headless versions of Chrome or Firefox instead warnings.warn('Selenium support for PhantomJS has been deprecated, please use headless ' Traceback (most recent call last): File "scraper.py", line 60, in <module> soups = get_tender_soups(portal) File "scraper.py", line 22, in get_tender_soups last_page = int(soup1.find('a', {"class":"k-link k-pager-nav k-pager-last"})['data-page']) TypeError: 'NoneType' object has no attribute '__getitem__'

Data

Downloaded 5 times by MikeRalphson ErinClark

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (196 KB) Use the API

rows 10 / 597

tender_url reference customer_name closing_date title todays_date time_remaining
GSC1000041REQ
Flintshire County Council
2/16/2016 11:59 PM
Framework Agreement for the Hire of Grounds Maintenance Equipment (Season 2016)
2016-02-16 16:48:26.918915
7 hours 11 minutes
SREQ1000074
Staffordshire County Council
2/17/2016 11:59 PM
PC457 Highway Channel Sweeping
2016-02-17 23:07:36.707878
52 minutes 9 seconds
ERFX1000003
Somerset County Council
2/19/2016 11:59 PM
Demand Responsive Transport via Somerset County Coucil's DPS
2016-02-19 22:42:01.094739
1 hour 17 minutes
RQST11310
Caerphilly County Borough Council
2/23/2016 11:59 PM
The Provision of the Maintenance and Support of VM Ware Licences for Caerphilly CBC
2016-02-23 19:47:26.770955
4 hours 12 minutes
WKS1000011REQ
Denbighshire County Council
2/25/2016 11:59 PM
Rhyl Harbour Maintenance Land-Based Dredging
2016-02-25 05:27:20.598579
18 hours 32 minutes
EREQ1000776
Torfaen County Borough Council
9/24/2019 11:59 PM
DPS for Home to School Transport
2016-02-25 05:27:20.598579
More than a year
REQD1001535
Bristol City Council
2/26/2016 11:59 PM
CON - Construction of New Build Council Homes - Ashcroft & West Parade
2016-02-26 19:57:02.133733
4 hours 2 minutes
ERFX1001823
The City of Cardiff Council
3/2/2016 11:59 PM
DPS for Provision of Passenger Transport (27)
2016-03-01 23:36:05.563793
1 day 0 hours
RFx127
Department for Education
3/2/2016 11:59 PM
High Potential Senior Leaders Residual Cohort Delivery (HPSL)
2016-03-01 23:36:05.563793
1 day 0 hours
RQST11344
Caerphilly County Borough Council
3/3/2016 11:59 PM
Supply of 1 x New Wheel Front Loader Caerphilly County Borough Council Waste Transfer Station
2016-03-03 20:28:47.131363
3 hours 30 minutes

Statistics

Average successful run time: 2 minutes

Total run time: 20 days

Total cpu time used: 26 minutes

Total disk space used: 221 KB

History

  • Auto ran revision 150d888d and failed .
    nothing changed in the database
  • Auto ran revision 150d888d and failed .
    nothing changed in the database
  • Auto ran revision 150d888d and failed .
    nothing changed in the database
    33 pages scraped
  • Auto ran revision 150d888d and failed .
    nothing changed in the database
  • Auto ran revision 150d888d and failed .
    nothing changed in the database
    34 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

td_proactis_newsite / scraper.py