a html parsing program to collect data from urls

Completed Posted Feb 16, 2008 Paid on delivery
Completed Paid on delivery

I want a program with the following requirements. I hope this spec is detailed enough. I've also attached an example to make sure you understand everything. Please tell me in your bid what programming language you are using.

It takes a file ([url removed, login to view]) containing a list of urls (a sample [url removed, login to view] in the attachments).

for each html, you need to collect the following information:

Visitor

Home

Vistor total score

Home total score

Game Weather

Played Indoor or outdoor

Temp(F)

Humidity

Wind(mph)

The first four items can be found in the section above 'Scoring Plays'

The rest of the items are in the section 'Game Day Weather'.

If the game was played indoor, all the weather related items shall be blank. If a url does not have a certain weather item, leave that item blank.

Retractable roof is considered indoor.

Although most html files are of the same/similiar format. There may be variation so you need to spend some time testing your program if you see blank items.

Write the items to a pipe delimited file, one game per row, column names are:

url|date|vistorCode|homeCode|Vistor|Home|VistorScore|HomeScore|Weather|IndoorOutdoor|Temp|Humidy|Wind

the first 4 columns are from the url link, the rest are the items you've collected from the url, the format of the date column is yyyy-mm-dd

save the pipe delimited file as [url removed, login to view] in the same folder the program is. You also need to save all the html files to a foler called htmls in the same folder the program is.

Attached are a sample [url removed, login to view] file and a full example of the input/output.

Thanks.

## Deliverables

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

windows

java/perl/php or anything as long as you tell me how to run the program

Java Perl PHP Python

Project ID: #3725564

About the project

32 proposals Remote project Active Feb 18, 2008

Awarded to:

neoinc

See private message.

$25.5 USD in 10 days
(16 Reviews)
2.6

32 freelancers are bidding on average $33 for this job

kindcodersl

See private message.

$39.95 USD in 10 days
(256 Reviews)
6.8
tzo

See private message.

$42.5 USD in 10 days
(254 Reviews)
6.5
bahe

See private message.

$34 USD in 10 days
(145 Reviews)
6.5
niculescud

See private message.

$42.5 USD in 10 days
(75 Reviews)
6.0
torianvw

See private message.

$33.15 USD in 10 days
(35 Reviews)
5.7
ovidiuv

See private message.

$25.5 USD in 10 days
(228 Reviews)
5.5
AncaU

See private message.

$42.5 USD in 10 days
(107 Reviews)
5.2
jogomon2

See private message.

$42.5 USD in 10 days
(30 Reviews)
5.2
CherylFernandes

See private message.

$24.65 USD in 10 days
(115 Reviews)
5.0
amiytarik

See private message.

$42.5 USD in 10 days
(16 Reviews)
5.0
mradityagoyal

See private message.

$42.5 USD in 10 days
(39 Reviews)
4.8
aldenmlvw

See private message.

$42.5 USD in 10 days
(34 Reviews)
4.8
djken

See private message.

$12.75 USD in 10 days
(37 Reviews)
4.5
medoos

See private message.

$28.05 USD in 10 days
(21 Reviews)
4.5
klin

See private message.

$17 USD in 10 days
(72 Reviews)
4.5
egycodersvw

See private message.

$29.75 USD in 10 days
(41 Reviews)
4.4
notvalidalv

See private message.

$17 USD in 10 days
(28 Reviews)
4.4
dpune

See private message.

$29.75 USD in 10 days
(23 Reviews)
4.1
etachevavw

See private message.

$42.5 USD in 10 days
(30 Reviews)
3.9
cphp

See private message.

$17 USD in 10 days
(6 Reviews)
3.3