jadearama / address_scraper

productionhub address scraper


This scraper scours productionhub.com for its production companies. For each production company, it clicks its link and then obtains and parses the address into [line1, line2, city, state, zip]. It also removes text like “inc” and llc" and any trailing punctuation from the production company name. Production companies with addresses in one line (incomplete addresses) are skipped.

Forked from ScraperWiki

Contributors jadearama

This scraper has not yet been run

Data

Downloaded 1 time by MikeRalphson

To download data sign in with GitHub

Download table (as CSV) Download SQLite database (0 Bytes) Use the API

rows 10 / 791

Address line 1 Address line 2 Production Co Zip City Number State
257 W. 52nd St, 2nd Fl
Zollo Productions
10019
New York
791
NY
76 Warren St.
1L Media
03301
Concord
1
NH
50 W. 23rd St.
24fps Productions
10010
New York
2
NY
2443 130th Pl SE
3 Loaves Productions
98030
Seattle
3
WA
381 Broadway
360 Sound and Vision Entertainment
10013
New York
4
NY
PO Box 413
371 Productions
53211
Milwaukee
5
WI
4040 Vineland Ave, Ste 105
44 Blue Productions
91604
Studio City
6
CA
10426 Regent St
4word Thought Entertainment
90034
Los Angeles
7
CA
750 Ponce De Leon Place, Suite 2
7th Wave Pictures
30306
Atlanta
8
GA
S Lucile St
9Elephants Productions
98118
Seattle
9
WA

Statistics

Total run time: less than 5 seconds

Total cpu time used: less than 5 seconds

Total disk space used: 28.6 KB

History

Scraper code

Python

address_scraper / scraper.py