jeffreyliu / archivers-template

Generic template for Archivers projects


Contributors jeffreyliu

Last run completed successfully .

Console output of last run

Injecting configuration and compiling... [1G [1G-----> Python app detected [1G-----> Installing python-2.7.9 [1G-----> Noticed cffi. Bootstrapping libffi. [1G $ pip install -r requirements.txt [1G Collecting asn1crypto==0.22.0 (from -r /tmp/build/requirements.txt (line 1)) [1G Downloading asn1crypto-0.22.0-py2.py3-none-any.whl (97kB) [1G Collecting certifi==2017.4.17 (from -r /tmp/build/requirements.txt (line 2)) [1G Downloading certifi-2017.4.17-py2.py3-none-any.whl (375kB) [1G Collecting cffi==1.10.0 (from -r /tmp/build/requirements.txt (line 3)) [1G Downloading cffi-1.10.0-cp27-cp27m-manylinux1_x86_64.whl (394kB) [1G Collecting chardet==3.0.3 (from -r /tmp/build/requirements.txt (line 4)) [1G Downloading chardet-3.0.3-py2.py3-none-any.whl (133kB) [1G Collecting cryptography==1.9 (from -r /tmp/build/requirements.txt (line 5)) [1G Downloading cryptography-1.9.tar.gz (409kB) [1G Collecting cssselect==0.9.1 (from -r /tmp/build/requirements.txt (line 6)) [1G Downloading cssselect-0.9.1.tar.gz [1G Collecting DateTime==4.2 (from -r /tmp/build/requirements.txt (line 7)) [1G Downloading DateTime-4.2-py2.py3-none-any.whl (60kB) [1G Collecting dumptruck==0.1.6 (from -r /tmp/build/requirements.txt (line 8)) [1G Downloading dumptruck-0.1.6.tar.gz [1G Collecting enum34==1.1.6 (from -r /tmp/build/requirements.txt (line 9)) [1G Downloading enum34-1.1.6-py2-none-any.whl [1G Collecting idna==2.5 (from -r /tmp/build/requirements.txt (line 10)) [1G Downloading idna-2.5-py2.py3-none-any.whl (55kB) [1G Collecting ipaddress==1.0.18 (from -r /tmp/build/requirements.txt (line 11)) [1G Downloading ipaddress-1.0.18-py2-none-any.whl [1G Collecting lxml==3.4.4 (from -r /tmp/build/requirements.txt (line 12)) [1G Downloading lxml-3.4.4.tar.gz (3.5MB) [1G Collecting pycparser==2.17 (from -r /tmp/build/requirements.txt (line 13)) [1G Downloading pycparser-2.17.tar.gz (231kB) [1G Collecting pyOpenSSL==17.0.0 (from -r /tmp/build/requirements.txt (line 14)) [1G Downloading pyOpenSSL-17.0.0-py2.py3-none-any.whl (51kB) [1G Collecting pytz==2017.2 (from -r /tmp/build/requirements.txt (line 15)) [1G Downloading pytz-2017.2-py2.py3-none-any.whl (484kB) [1G Collecting requests==2.17.3 (from -r /tmp/build/requirements.txt (line 16)) [1G Downloading requests-2.17.3-py2.py3-none-any.whl (87kB) [1G Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@732dda1982a3b2073f6341a6a24f9df1bda77fa0#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 17)) [1G Cloning http://github.com/openaustralia/scraperwiki-python.git (to 732dda1982a3b2073f6341a6a24f9df1bda77fa0) to /app/.heroku/src/scraperwiki [1G Could not find a tag or branch '732dda1982a3b2073f6341a6a24f9df1bda77fa0', assuming commit. [1G Collecting six==1.10.0 (from -r /tmp/build/requirements.txt (line 18)) [1G Downloading six-1.10.0-py2.py3-none-any.whl [1G Collecting urllib3==1.21.1 (from -r /tmp/build/requirements.txt (line 19)) [1G Downloading urllib3-1.21.1-py2.py3-none-any.whl (131kB) [1G Collecting zope.interface==4.4.1 (from -r /tmp/build/requirements.txt (line 20)) [1G Downloading zope.interface-4.4.1-cp27-cp27m-manylinux1_x86_64.whl (169kB) [1G Installing collected packages: asn1crypto, certifi, pycparser, cffi, chardet, idna, six, enum34, ipaddress, cryptography, cssselect, pytz, zope.interface, DateTime, dumptruck, lxml, pyOpenSSL, urllib3, requests, scraperwiki [1G Running setup.py install for pycparser: started [1G Running setup.py install for pycparser: finished with status 'done' [1G Running setup.py install for cryptography: started [1G Running setup.py install for cryptography: finished with status 'done' [1G Running setup.py install for cssselect: started [1G Running setup.py install for cssselect: finished with status 'done' [1G Running setup.py install for dumptruck: started [1G Running setup.py install for dumptruck: finished with status 'done' [1G Running setup.py install for lxml: started [1G Running setup.py install for lxml: still running... [1G Running setup.py install for lxml: finished with status 'done' [1G Running setup.py develop for scraperwiki [1G Successfully installed DateTime-4.2 asn1crypto-0.22.0 certifi-2017.4.17 cffi-1.10.0 chardet-3.0.3 cryptography-1.9 cssselect-0.9.1 dumptruck-0.1.6 enum34-1.1.6 idna-2.5 ipaddress-1.0.18 lxml-3.4.4 pyOpenSSL-17.0.0 pycparser-2.17 pytz-2017.2 requests-2.17.3 scraperwiki six-1.10.0 urllib3-1.21.1 zope.interface-4.4.1 [1G [1G [1G-----> Discovering process types [1G Procfile declares types -> scraper Injecting scraper and running...

Data

Downloaded 0 times

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (7 KB) Use the API

rows 2 / 2

run_id url UUID timestamp body_content body_SHA256 headers
1
0000
2017-06-07 21:13:30.107072
<!doctype html> <html> <head> <title>Example Domain</title> <meta charset="utf-8" /> <meta http-equiv="Content-type" content="text/html; charset=utf-8" /> <meta name="viewport" content="width=device-width, initial-scale=1" /> <style type="text/css"> body { background-color: #f0f0f2; margin: 0; padding: 0; font-family: "Open Sans", "Helvetica Neue", Helvetica, Arial, sans-serif; } div { width: 600px; margin: 5em auto; padding: 50px; background-color: #fff; border-radius: 1em; } a:link, a:visited { color: #38488f; text-decoration: none; } @media (max-width: 700px) { body { background-color: #fff; } div { width: auto; margin: 0 auto; border-radius: 0; padding: 1em; } } </style> </head> <body> <div> <h1>Example Domain</h1> <p>This domain is established to be used for illustrative examples in documents. You may use this domain in examples without prior coordination or asking for permission.</p> <p><a href="http://www.iana.org/domains/example">More information...</a></p> </div> </body> </html>
3587cb776ce0e4e8237f215800b7dffba0f25865cb84550e87ea8bbac838c423
{"Accept-Ranges": "bytes", "Content-Encoding": "gzip", "Etag": "\"359670651+gzip\"", "Date": "Wed, 07 Jun 2017 21:13:30 GMT", "Expires": "Wed, 14 Jun 2017 21:13:30 GMT", "Content-Length": "606", "Server": "ECS (rhv/818F)", "Vary": "Accept-Encoding", "Cache-Control": "max-age=604800", "X-Cache": "HIT", "Content-Type": "text/html", "Last-Modified": "Fri, 09 Aug 2013 23:54:35 GMT"}
2
0000
2017-06-08 22:00:19.666165
<!doctype html> <html> <head> <title>Example Domain</title> <meta charset="utf-8" /> <meta http-equiv="Content-type" content="text/html; charset=utf-8" /> <meta name="viewport" content="width=device-width, initial-scale=1" /> <style type="text/css"> body { background-color: #f0f0f2; margin: 0; padding: 0; font-family: "Open Sans", "Helvetica Neue", Helvetica, Arial, sans-serif; } div { width: 600px; margin: 5em auto; padding: 50px; background-color: #fff; border-radius: 1em; } a:link, a:visited { color: #38488f; text-decoration: none; } @media (max-width: 700px) { body { background-color: #fff; } div { width: auto; margin: 0 auto; border-radius: 0; padding: 1em; } } </style> </head> <body> <div> <h1>Example Domain</h1> <p>This domain is established to be used for illustrative examples in documents. You may use this domain in examples without prior coordination or asking for permission.</p> <p><a href="http://www.iana.org/domains/example">More information...</a></p> </div> </body> </html>
3587cb776ce0e4e8237f215800b7dffba0f25865cb84550e87ea8bbac838c423
{"Expires": "Thu, 15 Jun 2017 22:00:19 GMT", "Content-Type": "text/html", "Content-Encoding": "gzip", "Date": "Thu, 08 Jun 2017 22:00:19 GMT", "Etag": "\"359670651+gzip\"", "Accept-Ranges": "bytes", "Vary": "Accept-Encoding", "Server": "ECS (rhv/818F)", "Last-Modified": "Fri, 09 Aug 2013 23:54:35 GMT", "Cache-Control": "max-age=604800", "Content-Length": "606", "X-Cache": "HIT"}

Statistics

Average successful run time: 3 minutes

Total run time: 5 minutes

Total cpu time used: less than 5 seconds

Total disk space used: 30.9 KB

History

  • Manually ran revision c014fad7 and completed successfully .
    1 record added, 1 record updated in the database
  • Manually ran revision c014fad7 and completed successfully .
    2 records added in the database
  • Created on morph.io

Scraper code

archivers-template