tmtmtmtm / poland-sejm-wikipedia

Scrapes pl.wikipedia.org

Wikipedia, wolna encyklopedia


This is a scraper that runs on Morph. To get started see the documentation

Contributors tmtmtmtm ondenman

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling...  -----> Ruby app detected -----> Compiling Ruby -----> Using Ruby version: ruby-2.0.0 -----> Installing dependencies using bundler 1.15.2  Running: bundle install --without development:test --path vendor/bundle --binstubs vendor/bundle/bin -j4 --deployment  Fetching gem metadata from https://rubygems.org/........  Fetching version metadata from https://rubygems.org/..  Fetching dependency metadata from https://rubygems.org/.  Fetching https://github.com/openaustralia/scraperwiki-ruby.git  Rubygems 2.0.14.1 is not threadsafe, so your gems will be installed one at a time. Upgrade to Rubygems 2.1.0 or higher to enable parallel gem installation.  Using bundler 1.15.2  Fetching coderay 1.1.0  Installing coderay 1.1.0  Fetching colorize 0.7.7  Installing colorize 0.7.7  Fetching unf_ext 0.0.7.1  Installing unf_ext 0.0.7.1 with native extensions  Fetching excon 0.45.4  Installing excon 0.45.4  Fetching execjs 2.5.2  Installing execjs 2.5.2  Fetching multipart-post 2.0.0  Installing multipart-post 2.0.0  Fetching fuzzy_match 2.1.0  Installing fuzzy_match 2.1.0  Fetching hashie 3.4.2  Installing hashie 3.4.2  Fetching httpclient 2.6.0.1  Installing httpclient 2.6.0.1  Fetching method_source 0.8.2  Installing method_source 0.8.2  Fetching mini_portile 0.6.2  Installing mini_portile 0.6.2  Fetching open-uri-cached 0.0.5  Installing open-uri-cached 0.0.5  Fetching slop 3.6.0  Installing slop 3.6.0  Fetching sqlite3 1.3.10  Installing sqlite3 1.3.10 with native extensions  Fetching unf 0.1.4  Installing unf 0.1.4  Fetching faraday 0.9.1  Installing faraday 0.9.1  Fetching nokogiri 1.6.6.2  Installing nokogiri 1.6.6.2 with native extensions  Fetching pry 0.10.1  Installing pry 0.10.1  Fetching sqlite_magic 0.0.3  Installing sqlite_magic 0.0.3  Fetching domain_name 0.5.24  Installing domain_name 0.5.24  Fetching faraday_middleware 0.10.0  Installing faraday_middleware 0.10.0  Using scraperwiki 3.0.1 from https://github.com/openaustralia/scraperwiki-ruby.git (at morph_defaults@fc50176)  Fetching http-cookie 1.0.2  Installing http-cookie 1.0.2  Fetching wikidata-client 0.0.7  Installing wikidata-client 0.0.7  Fetching faraday-cookie_jar 0.0.6  Installing faraday-cookie_jar 0.0.6  Fetching mediawiki_api 0.4.1  Installing mediawiki_api 0.4.1  Bundle complete! 9 Gemfile dependencies, 27 gems now installed.  Gems in the groups development and test were not installed.  Bundled gems are installed into ./vendor/bundle.  Bundle completed (24.88s)  Cleaning up the bundler cache. -----> Installing node-v6.11.1-linux-x64 -----> Detecting rake tasks   -----> Discovering process types  Procfile declares types -> scraper Injecting scraper and running... 8 scraper.rb:66:in `block (2 levels) in current_members': undefined method `attr' for nil:NilClass (NoMethodError) from /app/vendor/bundle/ruby/2.0.0/gems/nokogiri-1.6.6.2/lib/nokogiri/xml/node_set.rb:187:in `block in each' from /app/vendor/bundle/ruby/2.0.0/gems/nokogiri-1.6.6.2/lib/nokogiri/xml/node_set.rb:186:in `upto' from /app/vendor/bundle/ruby/2.0.0/gems/nokogiri-1.6.6.2/lib/nokogiri/xml/node_set.rb:186:in `each' from scraper.rb:63:in `block in current_members' from /app/vendor/bundle/ruby/2.0.0/gems/nokogiri-1.6.6.2/lib/nokogiri/xml/node_set.rb:187:in `block in each' from /app/vendor/bundle/ruby/2.0.0/gems/nokogiri-1.6.6.2/lib/nokogiri/xml/node_set.rb:186:in `upto' from /app/vendor/bundle/ruby/2.0.0/gems/nokogiri-1.6.6.2/lib/nokogiri/xml/node_set.rb:186:in `each' from scraper.rb:55:in `current_members' from scraper.rb:32:in `scrape_term' from scraper.rb:139:in `block in <main>' from scraper.rb:137:in `reverse_each' from scraper.rb:137:in `<main>'

Statistics

Average successful run time: 2 minutes

Total run time: 5 days

Total cpu time used: about 4 hours

Total disk space used: 1.15 MB

History

  • Auto ran revision b4188009 and failed .
    nothing changed in the database
    1 page scraped
  • Auto ran revision b4188009 and failed .
    nothing changed in the database
  • Auto ran revision b4188009 and failed .
    3898 records removed in the database
    1 page scraped
  • Auto ran revision b4188009 and completed successfully .
    2 records updated in the database
    8 pages scraped
  • Auto ran revision b4188009 and completed successfully .
    nothing changed in the database
    8 pages scraped
  • ...
  • Created on morph.io

Show complete history

Scraper code

Ruby

poland-sejm-wikipedia / scraper.rb