Basic Web Scrape

Completed Posted Mar 15, 2012 Paid on delivery
Completed Paid on delivery

I have a webpage that i want to scrape data from. I have already written the code to capture the HTML...so all i need is someone to implement the regex and such to parse the page and populate an array with the data. Attached is the HTML data you will be parsing (the structure does not change, but the number of entries will)

INPUT:

(see attached) I will pass this in a variable called $html at the top of the script. For now you can just save the file and use file_get_contents() to grab the text. I will be inserting some cURL to get this data from a form submission.

OUTPUT:

Multidimensional array with the key set as the "Account Number" and all sub-array keys set to the name of the field. Here is a complete example showing the fields i need, based on the first two records in the sample data.

$output =

Array

(

[0162740110030] => Array

(

[Precinct Number] => 1

[Sale Date (Sale Nbr)] => 04/03/2012 (1)

[Cause Number] => 2008-71303

[District Court] => 189

[Case Style] => HARRIS COUNTY, ET AL VS WILL BAILEY, ET AL

[Legal Description] => LT 30 BLK 11 HIGHLAND HEIGHTS

[Physical Address] => 6633 TUSKEGEE ST 77091

[Adjudged Value] => $6,000

[Estimated Minimum Bid] => $6,000.00

[Status] =>

)

[0833680000007] => Array

(

[Precinct Number] => 8

[Sale Date (Sale Nbr)] => 04/03/2012 (1)

[Cause Number] => 2010-20154

[District Court] => 55

[Case Style] => LA PORTE INDEPENDENT SCHOOL DISTRICT VS KELTON [url removed, login to view], ET AL

[Legal Description] => LT 7 BLK 1A LAPORTE TERRACE

[Physical Address] => 714 N 13TH ST 77571

[Adjudged Value] => $62,063

[Estimated Minimum Bid] => $4,295.86

[Status] =>

)

)

HINTS/TIPS:

1. You don't have to use this if you have another method you prefer, but this class makes parsing HTML tables pretty quick and easy: [url removed, login to view]

2. Parse in chunks...each listing begins with <td class="repTblCell">...then each piece of data in in <td class="repText"> and within that, the label for the array is within <span class="repTblPrompt">

3. Note: The amount you bid is the amount you will be paid (upon successful completion). There are no tips or bonuses. I will thoroughly test the script and let you know of any problems BEFORE releasing escrow so that everything works 100% to specifications above before payment is made.

HOW TO WIN THE PROJECT:

1. Show solid experience with scraping/data mining.

2. Experience on this website and positive feedback.

3. Low price and quick turnaround.

4. Bid early...I will likely NOT wait until the end of the bidding period if I find a developer that seems like a good fit.

Data Mining PHP Web Scraping

Project ID: #1505704

About the project

15 proposals Remote project Active Mar 15, 2012

Awarded to:

inspire007

dear sir...i already developed your project...the demo link is given in PM...thank you

$30 USD in 1 day
(118 Reviews)
6.5

15 freelancers are bidding on average $107 for this job

srinichal

I look forward to deliver the script

$150 USD in 5 days
(109 Reviews)
7.3
mantislin

Hi sir, Please check PM.

$75 USD in 1 day
(189 Reviews)
6.9
waelfree

Hi, I can do that ISA

$80 USD in 1 day
(72 Reviews)
6.5
inzaghi2006

Hi, I write many scripts that use regex. please contact me Thanks

$70 USD in 2 days
(128 Reviews)
6.4
ansi2

Proposal details will follow. Thanks, 2ansi

$50 USD in 0 days
(87 Reviews)
6.3
phpXpertbd

I specialize in similar projects. Please check PM for more details.

$130 USD in 3 days
(30 Reviews)
6.2
tonykim100

Hello, I am ready to start now. Thanks.

$80 USD in 1 day
(113 Reviews)
6.1
Cueball61

What you ask for looks fairly simple to do, I have been using various scraping libraries for quite some time now, and have recently settled on the library you mentioned (very easy to use, I must say!). I can do this More

$120 USD in 3 days
(12 Reviews)
5.3
procoder898

Hi, I am expert at Data Mining/Web Scraping and can surely satisfy you. Please check your inbox,

$69 USD in 2 days
(26 Reviews)
5.2
ViliusSutkus

Hello. Check the PM.

$50 USD in 1 day
(15 Reviews)
4.7
atozinfosoft

Hello Respected Client, I have Read your requirements and we are very experience in this concept. please check Message Board for more details. Thanks

$300 USD in 6 days
(0 Reviews)
0.0
darioa

php scraper with knowledge of regex

$100 USD in 1 day
(0 Reviews)
0.0
falazar

Hi, I am a very experienced programmer, and will have no problem completing this project quickly. I deal with web scraping every day, from small issues, to some very large projects. A couple of recent projects I h More

$100 USD in 2 days
(0 Reviews)
0.0