Parallel Data Processing Comparison – CloverETL vs. Talend vs. Pentaho


On Oct. 21 OpenSys released a new version of its ETL tool, CloverETL Designer version 2.8.1. It's mainly bugfix version but also brings a new component, ParallelReader, that makes delimited data file (CSV) processing faster than ever before.[Continue reading]

ParallelReader Component: Performance Boost in Data Processing

In October release 2.8.1 of Clover we introduced a new component which definitely should attract your attention – the Parallel Reader. The name itself already suggests the goal of the component – improve reading speed by going parallel. The component is very similar to Universal Data Reader in function – it reads delimited flat files like CSV, tab delimited, etc. - much hasn't changed here. But the real difference comes from under the hood.[Continue reading]

Hidden Features: Using Mutable Delimiter for Data Parsing

CloverETL provides a very useful feature: mutable delimiter. When you parse a delimited file (eg. CSV) you can specify different delimiter for each field. This isn't surprising for daily CloverETL users however for users of other ETL tools it can be. It might not be very well known that in CloverETL you can even define more delimiters for one field (so called "mutable delimiter") and CloverETL chooses the right one. It reveals new ways of file processing with irregular structure in CloverETL. I believe this functionality isn't provided by any other ETL tool on the market. If I am wrong you can leave me a message in comments. I'm always happy to find "hidden features" of other ETL tools.[Continue reading]