everypolitician-scrapers / mexico-diputados

Scrapes github.com and sitl.diputados.gob.mx

Build software better, together.


This is a scraper that runs on Morph. To get started see the documentation

Contributors tmtmtmtm ondenman

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Ruby app detected -----> Compiling Ruby -----> Using Ruby version: ruby-2.3.3 -----> Installing dependencies using bundler 1.15.2  Running: bundle install --without development:test --path vendor/bundle --binstubs vendor/bundle/bin -j4 --deployment  Fetching gem metadata from https://rubygems.org/.........  Fetching version metadata from https://rubygems.org/.  Fetching https://github.com/everypolitician/scraped.git  Fetching https://github.com/everypolitician/scraped_page_archive.git  Fetching https://github.com/everypolitician/scraper_test.git  Fetching https://github.com/openaustralia/scraperwiki-ruby.git  Fetching https://github.com/everypolitician/table_unspanner.git  Fetching rake 12.0.0  Fetching public_suffix 2.0.5  Fetching ast 2.3.0  Installing rake 12.0.0  Installing public_suffix 2.0.5  Installing ast 2.3.0  Using bundler 1.15.2  Fetching coderay 1.1.1  Fetching safe_yaml 1.0.4  Fetching unf_ext 0.0.7.2  Installing coderay 1.1.1  Installing safe_yaml 1.0.4  Installing unf_ext 0.0.7.2 with native extensions  Fetching field_serializer 0.3.0  Installing field_serializer 0.3.0  Fetching git 1.3.0  Fetching hashdiff 0.3.2  Installing git 1.3.0  Installing hashdiff 0.3.2  Fetching httpclient 2.6.0.1  Fetching method_source 0.8.2  Installing method_source 0.8.2  Fetching mime-types-data 3.2016.0521  Installing mime-types-data 3.2016.0521  Installing httpclient 2.6.0.1  Fetching mini_portile2 2.1.0  Fetching minitest 5.10.1  Installing mini_portile2 2.1.0  Installing minitest 5.10.1  Fetching vcr 3.0.3  Fetching netrc 0.11.0  Installing netrc 0.11.0  Fetching open-uri-cached 0.0.5  Installing vcr 3.0.3  Installing open-uri-cached 0.0.5  Fetching powerpack 0.1.1  Installing powerpack 0.1.1  Fetching slop 3.6.0  Fetching require_all 1.4.0  Installing slop 3.6.0  Installing require_all 1.4.0  Fetching ruby-progressbar 1.8.1  Fetching unicode-display_width 1.2.1  Installing unicode-display_width 1.2.1  Installing ruby-progressbar 1.8.1  Fetching sqlite3 1.3.10  Fetching parser 2.4.0.0  Installing sqlite3 1.3.10 with native extensions  Installing parser 2.4.0.0  Fetching addressable 2.5.1  Installing addressable 2.5.1  Fetching rainbow 2.2.2  Installing rainbow 2.2.2 with native extensions  Fetching crack 0.4.3  Installing crack 0.4.3  Fetching mime-types 3.1  Installing mime-types 3.1  Fetching nokogiri 1.7.1  Installing nokogiri 1.7.1 with native extensions  Fetching minispec-metadata 2.0.0  Installing minispec-metadata 2.0.0  Fetching minitest-around 0.4.0  Installing minitest-around 0.4.0  Fetching pry 0.10.4  Installing pry 0.10.4  Fetching unf 0.1.4  Installing unf 0.1.4  Fetching webmock 2.0.3  Installing webmock 2.0.3  Fetching sqlite_magic 0.0.3  Fetching minitest-vcr 1.4.0  Installing sqlite_magic 0.0.3  Installing minitest-vcr 1.4.0  Fetching rubocop 0.48.1  Fetching domain_name 0.5.20161129  Installing domain_name 0.5.20161129  Installing rubocop 0.48.1  Fetching vcr-archive 0.3.0  Installing vcr-archive 0.3.0  Using scraper_test 0.1.0 from https://github.com/everypolitician/scraper_test.git (at master@a8a6c79)  Using scraperwiki 3.0.1 from https://github.com/openaustralia/scraperwiki-ruby.git (at morph_defaults@fc50176)  Fetching http-cookie 1.0.3  Using scraped_page_archive 0.5.0 from https://github.com/everypolitician/scraped_page_archive.git (at master@28f93d7)  Installing http-cookie 1.0.3  Fetching rest-client 2.0.0  Using scraped 0.6.0 from https://github.com/everypolitician/scraped.git (at master@23a9620)  Using table_unspanner 0.1.0 from https://github.com/everypolitician/table_unspanner.git (at master@a70a98a)  Installing rest-client 2.0.0  Bundle complete! 16 Gemfile dependencies, 47 gems now installed.  Gems in the groups development and test were not installed.  Bundled gems are installed into ./vendor/bundle.  Post-install message from webmock:  WebMock 2.0 has some breaking changes. Please check the CHANGELOG: https://goo.gl/piDGLu  Bundle completed (45.36s)  Cleaning up the bundler cache. -----> Detecting rake tasks   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... 50 50 50 100 100 150 150 150 150 200 250 200 250 200 250 200 250 300 350 400 300 350 400 300 350 400 300 350 400 450 500

Data

Downloaded 748 times by everypolitician octopusinvitro

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (208 KB) Use the API

rows 10 / 500

id sort_name party party_id area_id area term name image email source
430
Aceves y del Olmo Carlos Humberto
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:ciudad-de-méxico/circunscripción:4
Ciudad de México 4
62
Carlos Humberto Aceves y del Olmo
carlos.aceves@congreso.gob.mx
23
Aguayo López Miguel Ángel
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:colima/distrito:1
Colima 1
62
Miguel Ángel Aguayo López
miguel.aguayo@congreso.gob.mx
114
Alcalá Padilla Leobardo
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:jalisco/distrito:8
Jalisco 8
62
Leobardo Alcalá Padilla
leobardo.alcala@congreso.gob.mx
394
Aldana Prieto Luis Ricardo
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:veracruz/circunscripción:3
Veracruz 3
62
Luis Ricardo Aldana Prieto
luis.aldana@congreso.gob.mx
216
Allende Cano Ana Isabel
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:puebla/distrito:8
Puebla 8
62
Ana Isabel Allende Cano
ana.allende@congreso.gob.mx
892
Alonso Álvarez Celestino Manuel
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:oaxaca/circunscripción:3
Oaxaca 3
62
Celestino Manuel Alonso Álvarez
celestino.alonso@congreso.gob.mx
136
Alvarado Sánchez Brenda María Izontli
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:méxico/distrito:11
México 11
62
Brenda María Izontli Alvarado Sánchez
brenda.alvarado@congreso.gob.mx
477
Anaya Gudiño Alfredo
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:michoacán/circunscripción:5
Michoacán 5
62
Alfredo Anaya Gudiño
alfredo.anaya@congreso.gob.mx
361
Araujo de la Torre Elsa Patricia
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:tamaulipas/circunscripción:2
Tamaulipas 2
62
Elsa Patricia Araujo de la Torre
patricia.araujo@congreso.gob.mx
110
Arellano Guzmán Salvador
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:jalisco/distrito:4
Jalisco 4
62
Salvador Arellano Guzmán
salvador.arellano@congreso.gob.mx

Statistics

Average successful run time: about 4 hours

Total run time: 4 months

Total cpu time used: about 1 month

Total disk space used: 230 MB

History

  • Auto ran revision 509d03e7 and completed successfully .
    475 records added in the database
    2006 pages scraped
  • Auto ran revision 509d03e7 and failed .
    475 records removed in the database
    110 pages scraped
  • Auto ran revision 509d03e7 and completed successfully .
    348 records added in the database
    2006 pages scraped
  • Auto ran revision 509d03e7 and failed .
    348 records removed in the database
    614 pages scraped
  • Auto ran revision 509d03e7 and completed successfully .
    nothing changed in the database
    2006 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Ruby

mexico-diputados / scraper.rb