Find Jobs
Hire Freelancers

Build data extract and transformation tool

$250-750 USD

Completed
Posted over 8 years ago

$250-750 USD

Paid on delivery
We are in need of a data extraction (E) + transformation (T) tool -- the importer -- that converts information stored in various file types (such as .csv, .xls, .xlsx, .mdb, .accdb, etc.) and structures (wide, long, dyadic, etc.) into a standardized .json format based on an already established data schema and built off an existing prototype. Information on the json data schema can be found here: [login to view URL] The data extraction ( E ) logic is how to generate the flow of events from data within each source file; and the data transformation ( T ) logic is how to serialize the data in the flow to separate JSON objects. The JSON output of the E+T tool will eventually be uploaded into a cloud-based data aggregation tool, aka. DataGator, that is currently under development. The full-fledged importer tool needs to be robust enough to identify different variants of standard data fields and records, including non-standard characters within the data as well as within the file names, and reformat and package these into the final .json format that the system, aka. DataGator, recognizes. We are looking for developers with expertise in Java and data processing. Familiarity with Jackson/Apache POI would also be advantageous. This project has the possibility of becoming a longer-term arrangement for the development of additional auxiliary components for the DataGator project. The final iteration of this tool will be included in a public rollout for the DataGator system. To facilitate candidate developers to understand our requirements, we have created a prototype of the importer tool. A working example is provided within the prototype that can convert a single .csv file to JSON output. The prototype of the importer tool can be found in our public GitHub repository, [login to view URL] The work that needs to be completed would be to extend this prototype both horizontally (i.e. extract data from other file types such as .xls, .xlsx, .mdb, .txt) and vertically (i.e. support data filtering, merging, and hierarchical extraction, i.e. selecting multiple work sheets within a single Excel files, one sheet in multiple Excel files, multiple sheets within multiple Excel files etc.). There are multiple modules that need to be constructed based on file types and additional functionality NOTE: This project is only for Module 1 (XLSX Extractor) and all additional modules will be completed under follow-on projects dependent on successful completion of Module #1. Module 1: Build E+T functionality for .xlsx file Module 2: Build E+T functionality for .xls file Module 3: Build E+T functionality for .mdb file Module 4: Build E+T functionality for .txt file Module 5: Allow for processing of multiple files Module 6: Create a Java GUI for client-side operation Note: The logic for multi-sheet xls / xlsx/mdb files can have three alternative options (i) extract data from one of the sheets (ii) extract data from multiple sheets and combine them into a single JSON object (iii) extract data from multiple sheets and output them as individual JSON objects. Our expectation is that Module 1 (XLSX Extractor) will be able to be completed within 3-4 days of being awarded the project and the freelancer is qualified to complete Modules 2 through 6. We have provided several sample files that will need to be converted to JSON format and eventually uploaded, however it must be understood that these sample files are not an exhaustive list of characters within a file or file structures. The ideal freelancer will be willing and able to identify potential issues that may arise as other users begin to use it and different file structures are loaded into the tool. We are looking for freelancers that are passionate about this project and interested in a longer term working relationship. Please reach out with any questions that you may have.
Project ID: 8713967

About the project

10 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
I have 8 years experience in JAVA, J2EE and ready to work on this project. The bid value is only for Module 1. That means a simple API which will do the 3 options which you mentioned.
$250 USD in 4 days
4.9 (95 reviews)
6.7
6.7
10 freelancers are bidding on average $488 USD for this job
User Avatar
Hello I'm interesting your project very well I'm a Good C#, Java, ScraP, db, Math, Algorithm expert. I understand your req exactly. I m quite well experienced in these jobs. Let's go ahead with me I want to service for you continously. Thanks
$750 USD in 7 days
4.9 (443 reviews)
8.5
8.5
User Avatar
A proposal has not yet been provided
$555 USD in 10 days
4.8 (47 reviews)
6.2
6.2
User Avatar
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
$491 USD in 6 days
5.0 (61 reviews)
6.1
6.1
User Avatar
A proposal has not yet been provided
$500 USD in 15 days
4.8 (17 reviews)
4.2
4.2
User Avatar
I am an IITK graduate, 9 year experienced software professional and I have got top notch developers in my team, who have got experience across a span of technologies. The members in my team have worked with top notch tech organization such as Amazon, Cisco, Oracle etc. We have been involved in similar projects in the past and our track record has been excellent.
$526 USD in 10 days
3.9 (22 reviews)
5.4
5.4
User Avatar
Hi, Thank you for this wonderful opportunity. Your job post has caught my attention because I’m highly experienced in required capabilities. You must find accuracy, quality, and professionalism at my work. I’m very empowered to offer my services for your job. I have vast knowledge in: Apache, Data Processing, Java etc. Please you can get in touch with me instantly to discuss this place further. I am mostly online at Yahoo Messenger, Google Talk and Skype. We can discuss there in details, if you are interested. I wish you’d give me this chance. With thanks, Imtiaz Ikram Irfan
$333 USD in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I am interested to get this job, because I want also a better feedback from buyer my previous history is the example of best work. I'm able to serve the best service for you. If you give me a chance, I shall be hard working for your website. Please give me this job. I'm ready to start. Looking forward to hearing from you soon With worm regards, Jashedul Islam Shaun
$333 USD in 10 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
United States
5.0
55
Payment method verified
Member since Jan 18, 2009

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.