554512 Scrapers for two websites

Completed Posted Mar 1, 2012 Paid on delivery
Completed Paid on delivery

Two php scripts which scrape Television Series information the two websites below:

[url removed, login to view]

fields: (title, year, imdb website, description, categories (drama, comedy, etc..), actors, url of image. (The list has 334 titles)

[url removed, login to view]

You must traverse the tree, enter each series page to extract additional information.

fields to scrape: title, wikipedia page of base series e.g. /30_rock, , official website, imdb website, actors (the list has 217 series)

Each script must output to a separate csv file.

A third php script must merge the duplicate data: if two entries have the same imdb url, the wikipedia entry must be completed with the data from the imdb scrape.

Use php and curl with appropriate agents. No user interface, the scripts must be executed on command line, separately

Data Entry Odd Jobs PHP SQL Web Scraping

Project ID: #2300463

About the project

1 proposal Remote project Active Jul 11, 2012

Awarded to:

topman2009

Hi, I will do a good [login to view URL] check PMB. Thanks

$75 USD in 2 days
(0 Reviews)
0.0