I need a script (in any language) to organize data from different files into one giant file. I have tables in xls and pdf files. I am not sure about the feasibility of extracting data from pdfs using a script. If this cannot be automated, then I will pay for the manual input by the provider. The detailed instructions and the data set are listed below as attachments. Please check them before bidding.
I am attaching a sample of the output file. I realized that quarter is not needed. Ignore the quarter field in the above docx.
I realize a small difference in how things are coded for the 1999-2002 pdfs, instead of Metropolitan County and Non-metropolitan County, it has Suburban Counties and Rural Counties. Let have two other fields called isSuburbanCounty and isRuralCounty to account for these years. Also, you can put missing values for pdfs from 1996 to 1998 for all the "isXXXX" fields, because they are not there.