Find Jobs
Hire Freelancers

C# Spreadsheets and PDF Scraping

$250-750 USD

Completed
Posted over 7 years ago

$250-750 USD

Paid on delivery
This project is to write code in C# to scrape a variety of spreadsheets and pdf files from a list of sources across the internet. You will be provided with a code framework that provides methods to pull the data and save it. You will need to define the data you are scraping as C# objects (with entity framework "code first" attributes) and implementing the scraping logic for each file. A sample of the pattern we wish for you to follow is included. There are simple wrappers provided so you can give a URL or local file and get the relevant Excel\PDF\Html library loaded with the data for you so you only need to write the scrape logic and objects not the infrastructure. The libraries being used are: ClosedXml\OpenXml (Excel 2007+) NPOI (Excel 2003 and earlier) iTextSharp (PDF) HtmlAgilityPack (Html - probably not required for this project) The spreadsheets and pdf files we want to scrape can be found by looking at the below links: NOTE 1 - In many instances there are identical files for different time periods, so you can reuse the same scrape (so it isn't as many as it might look like at first) NOTE 2 - Sometimes there are zip files that contain multiple spreadsheets\pdf files which also need to be scraped. NOTE 3 - Sometimes there are PDF and Excel files representing the same data. In these cases you only need to scrape one of them (probably the spreadsheet as it is easier to do) [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL]
Project ID: 12489172

About the project

23 proposals
Remote project
Active 7 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hi! Mr. Goodier = ) I have gone through all the links that are under project description. Basically, I counted the number of files on each link, there are around 450 files, considering there are some "master files" that would contain data in outer distribuited files for years/products and so on. There are links that have many files, like one from the Italian government has 110 files. Others like the one from the USA or Brazil, have open links that could lead to more file links, so here I would expect you check to see if you will need files from those. On last project, average fee per file was $5 Usd, If I use that as reference then my bid goes way overbudget with the. So I have adjusted to match the one you specified here. Finally, I notice you mention the ITextSharp and HtmlAgilityPack. I have worked on both, with a note on iTextSharp which I don't find really great processing PDf as with previous experience, but anyways there are other libraries that I could use for doing the job. Many thanks on inviting me to bid on project. Looking forward! Best regards, Sergio
$549 USD in 30 days
4.7 (90 reviews)
6.8
6.8
23 freelancers are bidding on average $530 USD for this job
User Avatar
Hello Hi Hello Hi I read your requirement and interested in your project. I am good at C++, C#, Web and Mobile devlopement. I have some developer friends. So I can carry out your task fully. If you hire me, I can do your project as soon as possible. We can discuss detail on chat. I am always waiting you on chat room!. Best regards. Thank you. See you Again.
$555 USD in 10 days
4.9 (103 reviews)
6.7
6.7
User Avatar
hi there i can work on this scrapper logic however i have some questions. Please initiate discussion. Thankyou
$722 USD in 10 days
5.0 (79 reviews)
6.8
6.8
User Avatar
Hi, I am expert in making scrappers like this. I have plenty of experience using itextsharp, npoi, openxml. Please send me the code framework and other details. Thanks Barun
$250 USD in 10 days
4.9 (82 reviews)
6.5
6.5
User Avatar
A proposal has not yet been provided
$444 USD in 15 days
4.9 (35 reviews)
6.0
6.0
User Avatar
Dear sir, I am scraping expert, I have did too many scraping projects, please check my reviews then you will know. Can you tell me more details? then I will provide example data/script for you. Thanks, Kimi
$522 USD in 6 days
5.0 (8 reviews)
4.7
4.7
User Avatar
Hi sir I am Hasan Jack and I have more than 5 years of experience in C# Development. As because of my prior experience I really feel I can be best choice for this job. Looking forward to hear from you. Regards, Hasan Jack
$500 USD in 10 days
4.7 (6 reviews)
4.3
4.3
User Avatar
I am really interested to do this job and get started right away. Can we discuss the project details? Payment after you're completely satisfied nothing advance.
$666 USD in 10 days
5.0 (3 reviews)
4.2
4.2
User Avatar
Dear Sir, I'm writing in response to your task post. As a highly competent software specialist with more than nine years of experience , I would bring a high quality and service focused mindset to this job. Based on my experience in: - Managing and designing projects. - Developing and debugging in many different languages like C++, C#, VB.Net, VB6... - Many algorithms, design patterns, and a knack in problem solving. - Delivering with high quality based on careful testing. If I'm chosen I offer high quality software following known coding standards, and conventions. Milestones can be set once details are provided. Sincerely,
$666 USD in 12 days
5.0 (8 reviews)
4.2
4.2
User Avatar
Hello, My name is Mohd Rafi, I have 13 years of experience as an Architect/Tech Lead/Developer in .Net Technologies. I have carefully gone through your job post and it looks like a perfect fit for my skills set. I have developed a large number of Web enterprise applications. I have good knowledge of XML, Excel and have good experience in Web Scraping. It's been almost a decade that I've been working in C#, .Net, ASP.NET, WCF, Web Scraping, HTML, SQL, CSS, Javascript and Jquery. I have worked on many existing and new multi-tier applications from front-end to back-end.I have good knowledge of web handlers, web api, web services and have very good hands on the UI and on server side code. I have good working experience of Design patterns and architectural patterns. Programming is my passion. I can work alone or within a team. If you are ready, I can show some of the code that I have written to give you an idea how clean and easy to maintain code do I write. I have worked for several product companies. From last 3 years I was working with DocStar on their product 'Eclipse' which is a document management software. I would appreciate, if we can have 10 minutes meeting, I can show you how fruitful and right choice I will be for your project. One more thing I would like to tell you that I'm ready for a POC/test. I want to show you my work and skills. You can contact me on skype: mohammadrafionline Thanks
$750 USD in 20 days
5.0 (3 reviews)
4.2
4.2
User Avatar
Hello, We have accomplished 90% of the project which is similar of your requirement. All we need 10% customization as per your requirement set and specifications. I want to discuss in personal chat in order to explore your needs, which will yield a clear picture of implementation phase. Prior undertaking project, I want to show demo of the work done previously. Apart from demo, I will be sharing following documentation which will turn your project into Quality and Successful delivery: - Technical Project Proposal - Designs - Flow chart for this Project - Execution plan
$773 USD in 20 days
5.0 (4 reviews)
3.8
3.8
User Avatar
Hello, I am shahid from kashmir.   Over the last 7 years, I have worked for several clients. Joined Freelancer with over 7 years of experience in , Data entry, Linkedin Lead generation , Google Research Expert,Web scraping.  Python and Scrapping expert with 5 years of experience.  Linkedin API developer. ·          Using Python, Wordpress ,C Programming , C++Programming, Linux,PHP,MYSQL ,Java ,Javascript, ,Website Design ,Graphic Design,CSS,Research,Wordpress ,Magento ,Matlab and Mathematica ,Leads ,Web Search ,Machine Learning ,HTML5 ,Linkedin ,Landing Pages ,Web Services ,Internet Research ,Angular.js ,,Data mining,,Web scrapping,Find contacts, Data Processing, Data Entry, Excel, Leads, Web Search, Data Mining, Linked in, Microsoft Office, Email handling.  I am a highly skilled web researcher,data entry provider seeking an opportunity to leverage my expertise and demonstrate my high level of technical an administrative skills.  I have successfully completed more than 100 projects ranging from, Wordpress , C Programming ,  C++ Programming, Linux,PHP,MYSQL ,Java ,Javascript ,Website Design ,Graphic Design,CSS,Research,Wordpress ,Magento ,Leads ,Web Search ,Machine Learning ,HTML5 ,Linkedin ,Landing Pages ,Web Services web research, data entry, Internet Research, Linkedin Lead Generation, Google docs & Excel Spreadsheet creation/editing.  You can test the quality of my leads and also i provide leads at best price in the market.  Regards:
$250 USD in 0 day
5.0 (1 review)
2.0
2.0
User Avatar
If the infrastructure is written and you only need the scrapping algorithm , I may finish this in 5 to 7 days ,but I have to see your Code Framework first before committing to this . Also , I noticed that your source Links contain PDF & XLSX for the same data , If this is the case you don't need ITextSharp Lib .
$666 USD in 10 days
5.0 (2 reviews)
1.6
1.6
User Avatar
Dear Hiring Manager, Thank you for this wonderful opportunity. Today Your job posting has caught my attention because I’m keenly considering your job post “C# Spreadsheets and PDF Scraping”. I have 6+ years experience in C# Programing. I believe my abilities would be perfect for your venture. I can finish this job within the necessary time frame. I am professional software developer. I will be serving you with all my hard work and skills. I would be really grateful if you could give me this opportunity. Thank you for your time and consideration. Sincerely, Rakesh Verma
$300 USD in 5 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have good knowledge on PDF scraping and also I've worked on iTextsharp. Good knowledge in spreadsheet scraping. I have good technical knowledge.
$444 USD in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello sir, This is Himanshu. I have scrapped sites of different counties to get information of courts.I have used HAP and Selinium to scrap data form live site. I have one question about PDF scrap that shold we scrap that records in XML Format??
$277 USD in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello, Its a pleasure to let you know that I've Completed and Delivered similar project before. All I need to work upon customization part, if we can proceed towards more discussion. I have gone through your project description and confident to accomplish your project. I am an individual developer and you will be working directly with me if we proceed work on this project. My key skills are - 1)Ruby 2)Ruby On Rail 3)Angular JS 4)Node JS 5)PHP with Codeigniter and Laravel Framework. Let's initiate our chat so we can proceed towards conclusion of this project scope and give it a start as soon as possible. Thank & Regards Prashant Shinde
$833 USD in 18 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I am well experienced in analysing the issues and providing the solution. Has 12+ years of experience in C++, C# dlls and application development. If you hire me, you will get satisfied result in time. I am here to make long term relationship. Lets discuss.
$555 USD in 10 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED KINGDOM
London, United Kingdom
5.0
2
Payment method verified
Member since Dec 2, 2016

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.