masterofpun / r_datasets

Data of all posts on r/datasets


Contributors masterofpun

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling... [1G [1G-----> Python app detected [1G-----> Installing python-3.5.1 [1G $ pip install -r requirements.txt [1G Collecting requests==2.10.0 (from -r /tmp/build/requirements.txt (line 8)) [1G Downloading requests-2.10.0-py2.py3-none-any.whl (506kB) [1G Installing collected packages: requests [1G Successfully installed requests-2.10.0 [1G [1G [1G-----> Discovering process types [1G Procfile declares types -> scraper Injecting scraper and running... ('51k3y8',) Traceback (most recent call last): File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/connection.py", line 142, in _new_conn (self.host, self.port), self.timeout, **extra_kw) File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/util/connection.py", line 91, in create_connection raise err File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/util/connection.py", line 81, in create_connection sock.connect(sa) ConnectionRefusedError: [Errno 111] Connection refused During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/connectionpool.py", line 578, in urlopen chunked=chunked) File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/connectionpool.py", line 351, in _make_request self._validate_conn(conn) File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/connectionpool.py", line 814, in _validate_conn conn.connect() File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/connection.py", line 254, in connect conn = self._new_conn() File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/connection.py", line 151, in _new_conn self, "Failed to establish a new connection: %s" % e) requests.packages.urllib3.exceptions.NewConnectionError: <requests.packages.urllib3.connection.VerifiedHTTPSConnection object at 0x7f6c230dda90>: Failed to establish a new connection: [Errno 111] Connection refused During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/app/.heroku/python/lib/python3.5/site-packages/requests/adapters.py", line 403, in send timeout=timeout File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/connectionpool.py", line 623, in urlopen _stacktrace=sys.exc_info()[2]) File "/app/.heroku/python/lib/python3.5/site-packages/requests/packages/urllib3/util/retry.py", line 281, in increment raise MaxRetryError(_pool, url, error or ResponseError(cause)) requests.packages.urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='www.reddit.com', port=443): Max retries exceeded with url: /r/datasets/new/.json?before=t3_51k3y8 (Caused by NewConnectionError('<requests.packages.urllib3.connection.VerifiedHTTPSConnection object at 0x7f6c230dda90>: Failed to establish a new connection: [Errno 111] Connection refused',)) During handling of the above exception, another exception occurred: Traceback (most recent call last): File "scraper.py", line 38, in <module> newData = json.loads(req.get(reddit_url+_id, headers=headers).text)['data']['children'] File "/app/.heroku/python/lib/python3.5/site-packages/requests/sessions.py", line 487, in get return self.request('GET', url, **kwargs) File "/app/.heroku/python/lib/python3.5/site-packages/requests/sessions.py", line 475, in request resp = self.send(prep, **send_kwargs) File "/app/.heroku/python/lib/python3.5/site-packages/requests/sessions.py", line 585, in send r = adapter.send(request, **kwargs) File "/app/.heroku/python/lib/python3.5/site-packages/requests/adapters.py", line 467, in send raise ConnectionError(e, request=request) requests.exceptions.ConnectionError: HTTPSConnectionPool(host='www.reddit.com', port=443): Max retries exceeded with url: /r/datasets/new/.json?before=t3_51k3y8 (Caused by NewConnectionError('<requests.packages.urllib3.connection.VerifiedHTTPSConnection object at 0x7f6c230dda90>: Failed to establish a new connection: [Errno 111] Connection refused',))

Data

Downloaded 14 times by masterofpun franc00018 whenamanlies nhrs marcschonwandt talperetz guyscriven

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (2.21 MB) Use the API

rows 10 / 4079

created_utc author domain num_comments score title id is_self url selftext
1255017488
antitheftdevice
nyc.gov
0
5
New York City Data Mine - 170 datasets from 30 agencies
9s3dg
false
1255017624
antitheftdevice
ratedata.gaincapital.com
0
3
Forex Historic Rate Data from 2000 to present
9s3em
false
1255044577
antitheftdevice
free-zipcodes.com
1
4
Zip Code Database
9s7vp
false
1255092512
antitheftdevice
cs.cmu.edu
0
7
Enron Email Dataset
9sej9
false
1255353085
antitheftdevice
archive.ics.uci.edu
0
4
Poker Hand Data Set
9t7yp
false
1270446815
draicone
grouplens.org
0
6
MovieLens Data Sets
bmiqi
false
1270448213
draicone
blog.orite.com.au
0
3
Australian postcodes with geocoding
bmiyt
false
1270449054
draicone
musicbrainz.org
0
17
MusicBrainz database - artists, releases, tracks, labels, incl. relationships
bmj4p
false
1270449675
draicone
guardian.co.uk
0
5
The Guardian's data store - large collection of datasets on topics like banks, UK elections, government borrowing, primary school league tables...
bmj8u
false
1270450644
voltagex
data.gov
0
20
US Data.gov datasets
bmjfd
false

Statistics

Average successful run time: less than a minute

Total run time: 9 days

Total cpu time used: 3 minutes

Total disk space used: 2.24 MB

History

  • Auto ran revision 8419e722 and failed .
    nothing changed in the database
  • Auto ran revision 8419e722 and completed successfully .
    nothing changed in the database
  • Auto ran revision 8419e722 and completed successfully .
    nothing changed in the database
  • Auto ran revision 8419e722 and completed successfully .
    nothing changed in the database
  • Auto ran revision 8419e722 and completed successfully .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

r_datasets