Custom text connector for Apatar
$30-5000 USD
Paid on delivery
Apatar is a popular open-source ETL platform developed in Java. The platform is based on the application of customized plugins or “connectors?? meant to extract the data to be transformed from a specific type of source (e.g. text files, excel files, DBs).
A plugin or connector for the extraction of data from text files exists. However, the existing connector has several limitations. I would like to have a connector developed that overcomes specific limitations in terms of type of source (e.g. text file extension) file that can be read, interpretation of the data read, etc.
The new connector should:
* Be able to deal with custom separators for the fields in records (e.g. comma, colon, semicolon, tab, blank, pipe and almost any alphanumeric character)
* Be able to ignore any number of lines at the beginning and/or end of the source file
* Be able to interpret the fields in any record indicated (line of the file) as the name of the fields (columns)
* Be able to deal with mismatch between the number of fields according to the header and the number of fields in a record by either cropping, ignoring or concatenating fields
The code for the existing connector could (but does not have to) be used as a starting point for the development. Therefore, familiarity with Apatar would be an advantage.
Project ID: #3120239