woodbine / sp_t391_CR_tra

Scrapes www.crossrail.co.uk and 74f85f59f39b887b696f-ab656259048fb93837ecc0ecbcf0c557.r23.cf3.rackcdn.com

Crossrail is the new high frequency, high capacity railway for London and the South East. When the service opens Crossrail trains will travel from Maidenhead and Heathrow in the west to Shenfield and Abbey Wood in the east via new twin tunnels under central London. It will link Heathrow Airport, the West End, the City of London and Canary Wharf.


Contributors blablupcom

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.6, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.6 -----> Installing pip -----> Installing requirements with pip  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 1))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Collecting lxml==3.4.4 (from -r /tmp/build/requirements.txt (line 2))  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:339: SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject Name Indication) extension to TLS is not available on this platform. This may cause the server to present an incorrect TLS certificate, which can cause validation failures. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  SNIMissingWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:137: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  /app/.heroku/python/lib/python2.7/site-packages/pip/_vendor/urllib3/util/ssl_.py:137: InsecurePlatformWarning: A true SSLContext object is not available. This prevents urllib3 from configuring SSL appropriately and may cause certain SSL connections to fail. You can upgrade to a newer version of Python to solve this. For more information, see https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings  InsecurePlatformWarning  Downloading https://files.pythonhosted.org/packages/63/c7/4f2a2a4ad6c6fa99b14be6b3c1cece9142e2d915aa7c43c908677afc8fa4/lxml-3.4.4.tar.gz (3.5MB)  Collecting cssselect==0.9.1 (from -r /tmp/build/requirements.txt (line 3))  Downloading https://files.pythonhosted.org/packages/aa/e5/9ee1460d485b94a6d55732eb7ad5b6c084caf73dd6f9cb0bb7d2a78fafe8/cssselect-0.9.1.tar.gz  Collecting beautifulsoup4 (from -r /tmp/build/requirements.txt (line 4))  Downloading https://files.pythonhosted.org/packages/a6/29/bcbd41a916ad3faf517780a0af7d0254e8d6722ff6414723eedba4334531/beautifulsoup4-4.6.0-py2-none-any.whl (86kB)  Collecting dumptruck>=0.1.2 (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/49/df/50aa1999ab9bde74656c2919d9c0c085fd2b3775fd3eca826012bef76d8c/requests-2.18.4-py2.py3-none-any.whl (88kB)  Collecting chardet<3.1.0,>=3.0.2 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Collecting certifi>=2017.4.17 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/7c/e6/92ad559b7192d846975fc916b65f667c7b8c3a32bea7372340bfe9a15fa5/certifi-2018.4.16-py2.py3-none-any.whl (150kB)  Collecting urllib3<1.23,>=1.21.1 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/63/cb/6965947c13a94236f6d4b8223e21beb4d576dc72e8130bd7880f600839b8/urllib3-1.22-py2.py3-none-any.whl (132kB)  Collecting idna<2.7,>=2.5 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 1))  Downloading https://files.pythonhosted.org/packages/27/cc/6dd9a3869f15c2edfab863b992838277279ce92663d334df9ecf5106f5c6/idna-2.6-py2.py3-none-any.whl (56kB)  Installing collected packages: dumptruck, chardet, certifi, urllib3, idna, requests, scraperwiki, lxml, cssselect, beautifulsoup4  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for lxml: started  Running setup.py install for lxml: still running...  Running setup.py install for lxml: finished with status 'done'  Running setup.py install for cssselect: started  Running setup.py install for cssselect: finished with status 'done'  Successfully installed beautifulsoup4-4.6.0 certifi-2018.4.16 chardet-3.0.4 cssselect-0.9.1 dumptruck-0.1.6 idna-2.6 lxml-3.4.4 requests-2.18.4 scraperwiki urllib3-1.22   ! Hello! It looks like your application is using an outdated version of Python.  ! This caused the security warning you saw above during the 'pip install' step.  ! We recommend 'python-3.6.2', which you can specify in a 'runtime.txt' file.  ! -- Much Love, Heroku.   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... t391_CR_tra_2010_Q0 t391_CR_tra_2010_Q0 t391_CR_tra_2010_Q0 t391_CR_tra_2010_Q0 t391_CR_tra_2010_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2011_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2012_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2013_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2014_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2015_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2016_Q0 t391_CR_tra_2017_Q0 t391_CR_tra_2017_Q0 t391_CR_tra_2017_Q0 t391_CR_tra_2017_Q0 t391_CR_tra_2017_Q0 t391_CR_tra_2017_Q0

Data

Downloaded 504 times by SimKennedy woodbine MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (46 KB) Use the API

rows 10 / 90

l f d
t391_CR_tra_2016_Q0
2016-06-15 03:10:52.102207
t391_CR_tra_2010_Q0
2018-04-24 04:39:44.775108
t391_CR_tra_2010_Q0
2018-04-24 04:39:45.282644
t391_CR_tra_2010_Q0
2018-04-24 04:39:45.481494
t391_CR_tra_2010_Q0
2018-04-24 04:39:46.039463
t391_CR_tra_2010_Q0
2018-04-24 04:39:46.641760
t391_CR_tra_2011_Q0
2018-04-24 04:39:47.243395
t391_CR_tra_2011_Q0
2018-04-24 04:39:49.194366
t391_CR_tra_2011_Q0
2018-04-24 04:39:49.891860
t391_CR_tra_2011_Q0
2018-04-24 04:39:50.450638

Statistics

Average successful run time: 3 minutes

Total run time: about 1 month

Total cpu time used: 20 minutes

Total disk space used: 70.7 KB

History

  • Auto ran revision c7ccc1e7 and completed successfully .
    89 records added, 89 records removed in the database
    98 pages scraped
  • Auto ran revision c7ccc1e7 and completed successfully .
    89 records added, 89 records removed in the database
    98 pages scraped
  • Auto ran revision c7ccc1e7 and completed successfully .
    89 records added, 89 records removed in the database
  • Auto ran revision c7ccc1e7 and completed successfully .
    89 records added, 89 records removed in the database
    98 pages scraped
  • Auto ran revision c7ccc1e7 and completed successfully .
    89 records added, 89 records removed in the database
    98 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

sp_t391_CR_tra / scraper.py