We need to extract specific data from a major retailer website.
We will provide store category URLs and then you build a system that will visit each product page within each given category URL and extract: Product Title, UPC code, Product Price and Product Page URL.
All of the data can be found in the page source of each product page.
The output will be CSV
Based on experience this could yield hundreds of thousands of results and may require the use of your own proxy systems and multi-threading capabilities.
We are not particularly interested in building a software application. We are interested in the output as described. So the project deliverable will be the properly formatted CSV output using whatever software or system you may have at your disposal.
74 freelancers are bidding on average $456 for this job
I am ready to get started right away.... Can we discuss the project details? My distinction, payment after your complete satisfaction with the resulted task.
-Could you provide few URLs to let us test the scraping first. -Plus how many URLs are there. We will need to see at what interval or attempts of scraping proxy is [url removed, login to view] for URLs