everypolitician-scrapers / mexico-diputados

Scrapes github.com and sitl.diputados.gob.mx

Build software better, together.


This is a scraper that runs on Morph. To get started see the documentation

Contributors tmtmtmtm ondenman

Last run completed successfully .

Console output of last run

Injecting configuration and compiling...  -----> Ruby app detected -----> Compiling Ruby -----> Using Ruby version: ruby-2.3.3 -----> Installing dependencies using bundler 1.15.2  Running: bundle install --without development:test --path vendor/bundle --binstubs vendor/bundle/bin -j4 --deployment  Fetching gem metadata from https://rubygems.org/.........  Fetching version metadata from https://rubygems.org/.  Fetching https://github.com/everypolitician/scraped.git  Fetching https://github.com/everypolitician/scraped_page_archive.git  Fetching https://github.com/everypolitician/scraper_test.git  Fetching https://github.com/openaustralia/scraperwiki-ruby.git  Fetching https://github.com/everypolitician/table_unspanner.git  Fetching rake 12.0.0  Fetching public_suffix 2.0.5  Fetching ast 2.3.0  Installing ast 2.3.0  Installing rake 12.0.0  Installing public_suffix 2.0.5  Using bundler 1.15.2  Fetching coderay 1.1.1  Installing coderay 1.1.1  Fetching safe_yaml 1.0.4  Installing safe_yaml 1.0.4  Fetching unf_ext 0.0.7.2  Fetching field_serializer 0.3.0  Fetching git 1.3.0  Installing unf_ext 0.0.7.2 with native extensions  Installing git 1.3.0  Installing field_serializer 0.3.0  Fetching hashdiff 0.3.2  Fetching httpclient 2.6.0.1  Installing hashdiff 0.3.2  Fetching method_source 0.8.2  Installing method_source 0.8.2  Fetching mime-types-data 3.2016.0521  Installing httpclient 2.6.0.1  Installing mime-types-data 3.2016.0521  Fetching mini_portile2 2.1.0  Installing mini_portile2 2.1.0  Fetching minitest 5.10.1  Installing minitest 5.10.1  Fetching vcr 3.0.3  Fetching netrc 0.11.0  Installing netrc 0.11.0  Fetching open-uri-cached 0.0.5  Installing vcr 3.0.3  Installing open-uri-cached 0.0.5  Fetching powerpack 0.1.1  Installing powerpack 0.1.1  Fetching slop 3.6.0  Fetching require_all 1.4.0  Installing slop 3.6.0  Installing require_all 1.4.0  Fetching ruby-progressbar 1.8.1  Installing ruby-progressbar 1.8.1  Fetching unicode-display_width 1.2.1  Fetching sqlite3 1.3.10  Installing unicode-display_width 1.2.1  Fetching parser 2.4.0.0  Installing sqlite3 1.3.10 with native extensions  Installing parser 2.4.0.0  Fetching addressable 2.5.1  Installing addressable 2.5.1  Fetching rainbow 2.2.2  Installing rainbow 2.2.2 with native extensions  Fetching crack 0.4.3  Installing crack 0.4.3  Fetching mime-types 3.1  Installing mime-types 3.1  Fetching nokogiri 1.7.1  Installing nokogiri 1.7.1 with native extensions  Fetching minispec-metadata 2.0.0  Installing minispec-metadata 2.0.0  Fetching minitest-around 0.4.0  Installing minitest-around 0.4.0  Fetching pry 0.10.4  Installing pry 0.10.4  Fetching unf 0.1.4  Installing unf 0.1.4  Fetching webmock 2.0.3  Installing webmock 2.0.3  Fetching sqlite_magic 0.0.3  Installing sqlite_magic 0.0.3  Fetching minitest-vcr 1.4.0  Fetching domain_name 0.5.20161129  Installing minitest-vcr 1.4.0  Fetching vcr-archive 0.3.0  Installing domain_name 0.5.20161129  Using scraper_test 0.1.0 from https://github.com/everypolitician/scraper_test.git (at master@a8a6c79)  Installing vcr-archive 0.3.0  Fetching rubocop 0.48.1  Installing rubocop 0.48.1  Using scraperwiki 3.0.1 from https://github.com/openaustralia/scraperwiki-ruby.git (at morph_defaults@fc50176)  Fetching http-cookie 1.0.3  Using scraped_page_archive 0.5.0 from https://github.com/everypolitician/scraped_page_archive.git (at master@28f93d7)  Using scraped 0.6.0 from https://github.com/everypolitician/scraped.git (at master@23a9620)  Using table_unspanner 0.1.0 from https://github.com/everypolitician/table_unspanner.git (at master@a70a98a)  Installing http-cookie 1.0.3  Fetching rest-client 2.0.0  Installing rest-client 2.0.0  Bundle complete! 16 Gemfile dependencies, 47 gems now installed.  Gems in the groups development and test were not installed.  Bundled gems are installed into ./vendor/bundle.  Post-install message from webmock:  WebMock 2.0 has some breaking changes. Please check the CHANGELOG: https://goo.gl/piDGLu  Bundle completed (27.52s)  Cleaning up the bundler cache. -----> Detecting rake tasks   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... 50 50 50 50 100 100 100 100 150 150 150 150 200 250 200 250 200 250 200 250 300 350 400 300 350 400 300 350 400 300 350 400 450 500

Data

Downloaded 598 times by everypolitician octopusinvitro

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (208 KB) Use the API

rows 10 / 500

id sort_name party party_id area_id area term name image email source
430
Aceves y del Olmo Carlos Humberto
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:ciudad-de-méxico/circunscripción:4
Ciudad de México 4
62
Carlos Humberto Aceves y del Olmo
carlos.aceves@congreso.gob.mx
23
Aguayo López Miguel Ángel
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:colima/distrito:1
Colima 1
62
Miguel Ángel Aguayo López
miguel.aguayo@congreso.gob.mx
114
Alcalá Padilla Leobardo
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:jalisco/distrito:8
Jalisco 8
62
Leobardo Alcalá Padilla
leobardo.alcala@congreso.gob.mx
394
Aldana Prieto Luis Ricardo
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:veracruz/circunscripción:3
Veracruz 3
62
Luis Ricardo Aldana Prieto
luis.aldana@congreso.gob.mx
216
Allende Cano Ana Isabel
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:puebla/distrito:8
Puebla 8
62
Ana Isabel Allende Cano
ana.allende@congreso.gob.mx
892
Alonso Álvarez Celestino Manuel
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:oaxaca/circunscripción:3
Oaxaca 3
62
Celestino Manuel Alonso Álvarez
celestino.alonso@congreso.gob.mx
136
Alvarado Sánchez Brenda María Izontli
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:méxico/distrito:11
México 11
62
Brenda María Izontli Alvarado Sánchez
brenda.alvarado@congreso.gob.mx
477
Anaya Gudiño Alfredo
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:michoacán/circunscripción:5
Michoacán 5
62
Alfredo Anaya Gudiño
alfredo.anaya@congreso.gob.mx
361
Araujo de la Torre Elsa Patricia
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:tamaulipas/circunscripción:2
Tamaulipas 2
62
Elsa Patricia Araujo de la Torre
patricia.araujo@congreso.gob.mx
110
Arellano Guzmán Salvador
Partido Revolucionario Institucional
PRI
ocd-division/country:mx/entidad:jalisco/distrito:4
Jalisco 4
62
Salvador Arellano Guzmán
salvador.arellano@congreso.gob.mx

Statistics

Average successful run time: about 4 hours

Total run time: 2 months

Total cpu time used: about 1 month

Total disk space used: 191 MB

History

  • Auto ran revision 509d03e7 and completed successfully .
    nothing changed in the database
    2006 pages scraped
  • Auto ran revision 509d03e7 and completed successfully .
    nothing changed in the database
    2006 pages scraped
  • Auto ran revision 509d03e7 and completed successfully .
    3 records added in the database
    2006 pages scraped
  • Auto ran revision 509d03e7 and failed .
    281 records added in the database
    1995 pages scraped
  • Auto ran revision 509d03e7 and failed .
    nothing changed in the database
  • ...
  • Created on morph.io

Show complete history

Scraper code

Ruby

mexico-diputados / scraper.rb