Develop a file conversion algorithm

₹75000-150000 INR

Closed

Posted

over 7 years ago

₹75000-150000 INR

Paid on delivery

Need to write an algorithm to convert PDF files to Excel after conversion and [login to view URL] exist in ceratin fixed format(s) and need to be converted to pre-defined [login to view URL] volume of files means ML could improve results.

Algorithm

Machine Learning (ML)

OCR

PDF

Python

Project ID: 11074183

About the project

3 proposals

Remote project

Active 8 yrs ago

Looking to make some money?

Email address

Benefits of bidding on Freelancer

Set your budget and timeframe

Get paid for your work

Outline your proposal

It's free to sign up and bid on jobs

3 freelancers are bidding on average ₹141,667 INR for this job

@theeren

This sounds really interesting. I'd love to work on it. It would be very helpful if you could provide sample pdf files such that I can better evaluate the technology stack that is required. Python would be the main backend driver. I would potentially utilize a distributed task queue such as Celery with a RabbitMQ messaging queue to enable simple scaling of the process. I would include unit-test to support high quality code. We can define milestones as we move forward.

₹150,000 INR in 30 days

5.0

(3 reviews)

2.8

@kernvollig

Hi, my name is Paul and I'm a python developer of cloud apps backend and solution architecture. It so happens that I developed a script that converts PDF files to excel sheets and also that extracts BI columnar-oriented data. This script does not use OCR but I have a little experience with PyTesseract as well and that could come in handy in this case. Also I'm a machine learning enthusiast and have fiddled with that for fun in the late days, as well as I have a friend that got his Master degree in this matter and gave me some tips and advices. Hope we can work it out. Cheers.

₹125,000 INR in 30 days