ETL Basics

On sales2008-2011.csv data set a number of ETL operations are performed. Besides showing what ETL features are, the goal of this workflow is to move from a series of contracts with different customers in different countries to a one-row summary description for each one of the customers. The one-row description includes: the customer unique ID; the total amount of money payed by the customer to the company; the countries the customer has been active in; the date of the first contract (this is always useful to estimate the customer loyalty); and the number of days between the first and the last purchase, that is the number of days the customer has been with the company. At the end, each one-row customer summary information will be joined together with each contract data row from the original file and write the resulting table to a CSV file in a "data" folder located in the workflow folder.

ETL Basics

 

Resources

EXAMPLES Server: 02_ETL_Data_Manipulation/00_Basic_Examples/02_ETL_Basics02_ETL_Data_Manipulation/00_Basic_Examples/02_ETL_Basics*
Download a zip-archive

 

 


* Find more about the Examples Server here.
The link will open the workflow directly in KNIME Analytics Platform (requirements: Windows; KNIME Analytics Platform must be installed with the Installer version 3.2.0 or higher). In other cases, please use the link to a zip-archive or open the provided path manually