Write script to extract data from published HTML tables into a CSV or JSON standard output
$30-250 USD
Paid on delivery
The website [login to view URL] publishes entrants for sailing races into a set of tables divided by class. A script is needed to process the HTML and extract the data into CSV or JSON format.
The input will be the "regatta ID" and a format modifier (-csv or -json)
Example:
% rnget 12672 -json >> [login to view URL]
An example event is:
[login to view URL]
under the link "Current Registrants"
The specific frame in question is called by:
[login to view URL]
Where the variable to be input is the "regatta_id", in this example 12672
For JSON, the desired schema would be:
{
"Entrant #": "<#>"
"Skipper": "<skipper name>"
"Club": "<club name>"
"Sail#": "<sail number>"
"Boat Name": "<boat name>"
"Fleet / Division": "<fleet/division name>"
"Make/Model": "<make of boat/model of boat>"
"Rating": "<rating #>"
"Date Entered": "<date entry received>"
}
For CSV, the desired output would be:
Entrant #,Skipper,Club,Sail#,Boat Name,Fleet / Division,Make/Model,Rating,Date Entered
Raw HTML looks like this for each entrant:
<tr valign="top">
<td bgcolor="#ffffff" width="15" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1">1.</font></td>
<td bgcolor="#ffffff" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1"> Doug Johnstone <br /><br /> </font></td>
<td bgcolor="#ffffff" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1"> SBYC </font></td>
<td bgcolor="#ffffff" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1"> 56491 </font></td>
<td bgcolor="#ffffff" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1"> LeReve </font></td>
<td bgcolor="#ffffff" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1"> Cruising Class / </font></td>
<td bgcolor="#ffffff" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1"> Jeanneau / Jeanneau 36 ip </font></td>
<td bgcolor="#ffffff" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1"> 108 </font></td>
<td bgcolor="#ffffff" valign="top"><font face="Verdana, Arial, Helvetica, sans-serif" size="1"> 07/07/2016 </font></td>
</tr>
This can be written in a common Linux scripting language such as Ruby or Python, or some other efficient scripting language.
Project ID: #14278024
About the project
Awarded to:
Hi, I am Ruby on Rails expert (over 6 years of experience).I can assure you quality of my work. Relevant Skills and Experience My Skills: 1. Ruby 2. Ruby on Rails 3. Git, Bitbucket 4. MySQL , PostgreSQL 5. HTML5 , H More
50 freelancers are bidding on average $117 for this job
I can complete the task you need in 1 day or less. I will be using "beautifulsoup4" which is a python HTML DOM manipulator that can work with python 2 and 3. Update: I've attempted the program and managed to complet More
HI SIR AWARD ME RIGHT NOW I M 100% CONFIDENT FOR THIS PROJECT!!! Stay tuned, I'm is still working on this proposal.
hi i am an expert webscraper. i can easily extract data and save it in the format you require. this is pretty straight forward. should be done in less than 2 days.
Hi I am very experienced in web scraping using Python I already read through your requirements, and I am sure I can complete this in 1 day Please message me Thanks Relevant Skills and Experience Python, Web scraping More
Hello, I am an experienced python coder with expertise in data processing. I would be happy to assist. Regards, Erik Relevant Skills and Experience I have extensive experience with data processing applicatio More
Hi I have a lot of experience in all things web and can write a PHP script that can parse the requested HTML into both JSON & CSV. You just need to be able to host the script. Can be done today. Relevant Skills and Ex More
Hello, I can do this quickly in python. Just let me know. Relevant Skills and Experience python, web scraping Proposed Milestones $50 USD - milestone
Hello, Hope you are doing well. I have more than 5 years of experience in web development. please initiate message board so we will discuss it. Thank you