Find Jobs
Hire Freelancers

Development of WebCrawler for job offers

$750-1500 USD

Cancelled
Posted almost 15 years ago

$750-1500 USD

Paid on delivery
The outcome of the development project hast o be a WebCrawler for websites (mainly company sites) containing job offers which allows the extraction of content and stores it in a database. Extraction of content The content of the websites containing the job offers (job description) has to be extracted in a text format and exported in a mysql database whereas the format should be “onlinedate, offlinedate, id, category, url, jobtitle, jobdescription, e-mail (if existing), phone (if existing), contactperson (if existing)”. Validity of jobs The software also has to verify if jobs are taken offline by a company and are not online anymore. The jobs than has to be marked offline in the database with the corresponding date the job has been taken offline (date of crawling process where the software has identified that the job is not online anymore). Definition / Configuration of URLs The configuration of the software must allow, that the URLs of the websites which have to crawled including SUB-URLs can be defined. The URLs are mainly company webpages whereas the software has to identify on which pages job offers are published. Please do not bid on the project if you are not familiar with the technology of web crawlers. Configuration of Keywords Besides the URLs also keywords can be defined whereas a website will only be extracted if predefined keywords are on the site. Keywords can be grouped in a category so that a job which has been found can be put in a certain job category (e.g. consulting, banking,…). Progress and statistics A small progress and statistic module of the software always has to show the progress of a crawling process as well as the result (websites crawled, new jobs found, jobs updated, jobs taken offline). The data has to be provided for each new crawling process. Error Messages In order to identify problems with a crawling process an error log has to be written which allows the identification of the problems which have occurred during a crawling process. Performance The WebCrawler must ensure a fast crawling process. PLEASE ONLY bid on the project if you are a very experienced developer and are familiar with crawling technologies.
Project ID: 437220

About the project

3 proposals
Remote project
Active 15 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
3 freelancers are bidding on average $1,317 USD for this job
User Avatar
for a bit high budget,i can help you
$1,500 USD in 50 days
5.0 (8 reviews)
6.2
6.2
User Avatar
Hello, please check PMB
$1,500 USD in 30 days
5.0 (2 reviews)
2.9
2.9
User Avatar
Hi, please check PMB.
$950 USD in 21 days
5.0 (1 review)
2.2
2.2

About the client

Flag of GERMANY
Hamburg, Germany
4.8
4
Member since May 19, 2007

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.