Find Jobs
Hire Freelancers

Contact Scraper from Email Signature

$100-1000 USD

Closed
Posted almost 14 years ago

$100-1000 USD

Paid on delivery
We need a program developed capable of the following: 1) Take as input an? English-language HTML or plaintext email.? 2) Detect the email signature in the email 3) Parse the email signature for all available data, including: - Name - Position - Company - Email address - Phone Numbers (cell, office, fax) - Personal/Company website address - Skype, AIM, Google Chat, Twitter or other Social contact methods 4) Output a VCard with all available fields filled in and with entire email signature as Note section of the VCard ## Deliverables The only input into the program will be an Plaintext and/orHTML email, with complete header and body information. ? The email will have been forwarded from another source, sowill contain a New section at the top and a Forwarded section below [login to view URL] states that email programs add some sort of separator, which differsfrom program to program ? Examples: 1. Google Mail: "---------- Forwarded message ----------" 2. Outlook: 1. "? _____ " in plaintext 2. "<hr size=3D2 width=3D"100%" align=3Dcenter tabindex=3D-1>" in HTML format (content of <HR> tag may differ, but <HR> tag generally separates New from Forwarded part 3. Yahoo: 1. "--- On Sat, 6/12/10, John Nyaradi <john@[login to view URL]> wrote:" in plaintext 2. "--- On <b>Sat, 6/12/10, John Nyaradi <i><john@[login to view URL]></i></b> wrote:" in HTML ? The program will attempt to identify a single signatureblock in the Forwarded section of the email, below the Forward separator. Inthe case of multiple signature blocks, the signature block closest to the beginningof the email will be the one used. ? A signature block traditionally consists of some/all of thefollowing information: * Full Name * Position * Company * Address * Phone numbers * * Office/Work * Direct * Cell/Mobile * Fax * Email Address * Personal/Company website address * Other contact information: * * Facebook Profile Link * Twitter URL or Name * Google Chat Name * Skype Name ? Some other methods to consider in order to identify thesignature block: * Signature blocks are usually separated from the rest of the document by one or more line breaks and possibly some sort of horizontal spacer (-, _, =, *, etc). * Signature blocks also may immediately follow a Complimentary Close - "Thanks," "Best Regards" "Sincerely" "Best" etc <!-- --> * Signature blocks usually have more than one of the previously mentioned elements in close proximity, with a few (Name, Phone, Email) almost always standard. * The name and/or email in the signature block may match the From/To/Subject/Sent information in the begining of the forwarded part of the email. Once again, the format in which this is written changes according to email program, but generally contains 4 elements - From, Sent, To and Subject * * From: Lena Dander [mailto:lena@[login to view URL]] * Sent: Saturday, June 12, 2010 11:59 * To: Mike Roesh * Subject: my address * Signature blocks may have uncommon separators in the block - ie, if two phone numbers are on one line, they may be separated by "|" or "-" ? Once the signature block is identified, the informationshould be parsed and output in XML format with all of the available data thathas been identified. We can discuss the actual format of the XML elementslater, but they will be mostly based on the elements of the signature block discussedearlier. ? If no signature block is found, the program will attempt toidentify the Full Name and Email of the sender in the forwarded email via theFrom/To/Sent/Subject lines in the body of the email. The program should outputan XML with those details. ? If neither tasks can be accomplished, the program shouldreturn an event stating so. ? Given that intelligent parsing is never 100% accurate, wewill work with the chosen developer to set success targets for correctlyidentifying, parsing and outputting the signature block information from alarge sample of emails (several hundred or thousand emails).
Project ID: 3498245

About the project

19 proposals
Remote project
Active 14 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
19 freelancers are bidding on average $547 USD for this job
User Avatar
See private message.
$637.50 USD in 14 days
4.9 (391 reviews)
7.5
7.5
User Avatar
See private message.
$850 USD in 14 days
4.8 (153 reviews)
7.5
7.5
User Avatar
See private message.
$340 USD in 14 days
5.0 (238 reviews)
6.9
6.9
User Avatar
See private message.
$233.75 USD in 14 days
5.0 (62 reviews)
5.9
5.9
User Avatar
See private message.
$253.30 USD in 14 days
4.9 (18 reviews)
5.4
5.4
User Avatar
See private message.
$212.50 USD in 14 days
5.0 (58 reviews)
5.3
5.3
User Avatar
See private message.
$616.25 USD in 14 days
4.9 (69 reviews)
5.0
5.0
User Avatar
See private message.
$484.50 USD in 14 days
5.0 (68 reviews)
5.0
5.0
User Avatar
See private message.
$850 USD in 14 days
5.0 (6 reviews)
3.7
3.7
User Avatar
See private message.
$850 USD in 14 days
5.0 (17 reviews)
3.6
3.6
User Avatar
See private message.
$722.50 USD in 14 days
5.0 (6 reviews)
3.3
3.3
User Avatar
See private message.
$850 USD in 14 days
3.0 (17 reviews)
5.2
5.2
User Avatar
See private message.
$425 USD in 14 days
5.0 (4 reviews)
2.3
2.3
User Avatar
See private message.
$807.50 USD in 14 days
5.0 (1 review)
1.8
1.8
User Avatar
See private message.
$170 USD in 14 days
5.0 (5 reviews)
1.9
1.9
User Avatar
See private message.
$314.50 USD in 14 days
4.9 (4 reviews)
1.7
1.7
User Avatar
See private message.
$850 USD in 14 days
5.0 (3 reviews)
1.9
1.9
User Avatar
See private message.
$850 USD in 14 days
0.0 (2 reviews)
2.2
2.2
User Avatar
See private message.
$85 USD in 14 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Cincinnati, United States
0.0
0
Member since Jun 12, 2010

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.