Databases

Binning in Databases

This workflow demonstrates the different database binner nodes that allow you to create new binning columns for numerical columns. An example would be the conversion of a numerical column that contains the age in years into a categorical column with values such as children and adults. The demonstrated nodes are the Database Auto-Binner node that automatically creates the bin boundaries e.g.

Sampling in Databases

This workflow creates an SQL statement that allows you to extract a sample of data from a database. The node also supports stratified sampling, which is the preferred way to sample from populations with varying subpopulation sizes.

Sampling in Databases

 

MSAccess meets H2

This workflow blends data between two different relational DBMS in KNIME Analytics Platform. It focuses on two common relational database systems, i.e. MS Access and H2. Why these two? Both are relatively easy to use, and used mainly in individual departments or small-to-medium business sectors.
The data is a standard baseball encyclopedia and it is available in text form, in .csv file, and in relational database format. Data are available at http://www.seanlahman.com/baseball-archive/statistics/.

Database Jam Session

This workflow jams together data from not 1, not 2, but from 6 databases! That is: MySQL, MongoDB, MS SQL Server, MariaDB, Oracle, PostgreSQL. The use case is a Next Best Offer, modelling the likelihood of a customer to buy a second product. This workflow is a variation of the workflow we build together at courses on KNIME Analytics Platform.

Teradata Aster meets KNIME Table

Today's challenge is to blend the data between a Teradata Aster database and a KNIME table in the KNIME Analytics Platform. Why these two? Teradata Aster is a database system in use at many companies around the world, and KNIME tables are an easy way to store and access models built in other KNIME workflows. The data is from a collection of open-source heart disease data sets available in .txt format. They are available at http://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/ .

Subscribe to Databases