soit-sk / slovakia_post_codes

Scraper for slovak mail service post codes.

Scrapes www.posta.sk

Slovenská pošta


This is a scraper that runs on Morph. To get started see the documentation

This scraper mines streets, towns and their post codes from the Slovak mail service.

Contributors mnagy hanecak j91321 katkad

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Python app detected  ! The latest version of Python 2 is python-2.7.14 (you are using python-2.7.12, which is unsupported).  ! We recommend upgrading by specifying the latest version (python-2.7.14).  Learn More: https://devcenter.heroku.com/articles/python-runtimes -----> Installing python-2.7.12 -----> Installing pip -----> Installing requirements with pip  Obtaining scraperwiki from git+http://github.com/openaustralia/scraperwiki-python.git@morph_defaults#egg=scraperwiki (from -r /tmp/build/requirements.txt (line 2))  Cloning http://github.com/openaustralia/scraperwiki-python.git (to revision morph_defaults) to /app/.heroku/src/scraperwiki  Switched to a new branch 'morph_defaults'  Branch morph_defaults set up to track remote branch morph_defaults from origin.  Collecting xlrd==0.9.2 (from -r /tmp/build/requirements.txt (line 3))  Downloading https://files.pythonhosted.org/packages/31/69/cc9aa897cf25e8ace34e36144d95cfbe4ce0837b41b1ed980b77997f5b95/xlrd-0.9.2.tar.gz (167kB)  Collecting dumptruck>=0.1.2 (from scraperwiki->-r /tmp/build/requirements.txt (line 2))  Downloading https://files.pythonhosted.org/packages/15/27/3330a343de80d6849545b6c7723f8c9a08b4b104de964ac366e7e6b318df/dumptruck-0.1.6.tar.gz  Collecting requests (from scraperwiki->-r /tmp/build/requirements.txt (line 2))  Downloading https://files.pythonhosted.org/packages/ff/17/5cbb026005115301a8fb2f9b0e3e8d32313142fe8b617070e7baad20554f/requests-2.20.1-py2.py3-none-any.whl (57kB)  Collecting idna<2.8,>=2.5 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 2))  Downloading https://files.pythonhosted.org/packages/4b/2a/0276479a4b3caeb8a8c1af2f8e4355746a97fab05a372e4a2c6a6b876165/idna-2.7-py2.py3-none-any.whl (58kB)  Collecting certifi>=2017.4.17 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 2))  Downloading https://files.pythonhosted.org/packages/56/9d/1d02dd80bc4cd955f98980f28c5ee2200e1209292d5f9e9cc8d030d18655/certifi-2018.10.15-py2.py3-none-any.whl (146kB)  Collecting urllib3<1.25,>=1.21.1 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 2))  Downloading https://files.pythonhosted.org/packages/62/00/ee1d7de624db8ba7090d1226aebefab96a2c71cd5cfa7629d6ad3f61b79e/urllib3-1.24.1-py2.py3-none-any.whl (118kB)  Collecting chardet<3.1.0,>=3.0.2 (from requests->scraperwiki->-r /tmp/build/requirements.txt (line 2))  Downloading https://files.pythonhosted.org/packages/bc/a9/01ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8/chardet-3.0.4-py2.py3-none-any.whl (133kB)  Installing collected packages: dumptruck, idna, certifi, urllib3, chardet, requests, scraperwiki, xlrd  Running setup.py install for dumptruck: started  Running setup.py install for dumptruck: finished with status 'done'  Running setup.py develop for scraperwiki  Running setup.py install for xlrd: started  Running setup.py install for xlrd: finished with status 'done'  Successfully installed certifi-2018.10.15 chardet-3.0.4 dumptruck-0.1.6 idna-2.7 requests-2.20.1 scraperwiki urllib3-1.24.1 xlrd-0.9.2   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running...

Data

Downloaded 7 times by lubosvanta usamec melezhik

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (851 KB) Use the API

rows 10 / 3962

okres kraj psc obec
Košice III
KI
Dargovských Hrdinov
Košice IV
KI
Nad Jazerom
Liptovský Mikuláš
ZI
031 01
Andice
Žilina
ZI
010 04
Bánová
Liptovský Mikuláš
ZI
031 01
Beňušovce
Ružomberok
ZI
034 03
Biely Potok
Liptovský Mikuláš
ZI
031 01
Bodice
Komárno
NI
947 03
Bohatá
Žilina
ZI
010 14
Brodno
Žilina
ZI
010 03
Budatín

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (851 KB) Use the API

rows 10 / 6572

psc ulica obec
040 01
Bankov - baňa
Košice
040 01
Čermeľská
Košice
040 01
Dominikánske nám.
Košice
040 01
Festivalové nám.
Košice
040 01
Hájnícká
Košice
040 01
Heringeš
Košice
040 01
Horný Bankov
Košice
040 01
Južné nábr.
Košice
040 01
Kasárenské nám.
Košice
040 01
Kavečany
Košice

Statistics

Average successful run time: 1 minute

Total run time: 3 days

Total cpu time used: about 6 hours

Total disk space used: 877 KB

History

  • Auto ran revision e1d2e8a7 and completed successfully .
    9651 records added, 9651 records removed in the database
    2 pages scraped
  • Auto ran revision e1d2e8a7 and completed successfully .
    9651 records added, 9651 records removed in the database
    2 pages scraped
  • Auto ran revision e1d2e8a7 and completed successfully .
    9651 records added, 9651 records removed in the database
  • Auto ran revision e1d2e8a7 and completed successfully .
    9651 records added, 9651 records removed in the database
    2 pages scraped
  • Auto ran revision e1d2e8a7 and completed successfully .
    9651 records added, 9651 records removed in the database
    2 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Python

slovakia_post_codes / scraper.py