KNIME general.

Exploring a Chemistry Ontology with KNIME

Mon, 03/23/2020 - 10:00 admin

Author: Martyna Pawletta (KNIME)

We are often asked if it’s possible to work with ontologies in KNIME Analytics Platform.

Exploring Chemistry Ontologies with KNIME

With “work with ontologies” people can mean many different things but let's focus today on one particular ontology and basic tasks including reading and querying ontologies to create an interactive tool at the end. For this purpose, today, we dive into the world of chemistry to use the ChEBI ontology (Chemical Entities of Biological Interest). 

Time Series Analysis with Components

Mon, 03/09/2020 - 16:00 admin

Authors: Daniele Tonini, Maarit Widmann, Corey Weisinger

"Considering the plethora of articles, applications, web tutorials and challenges on the data science subject that we’re seeing in the last 3-5 years, it can be pretty surprising to find only a few of them dedicated to time series analysis and forecasting. We’re living in the golden era of data analytics, with plenty of data and algorithms of any kind... but topics like deep learning, artificial intelligence and NLP are attracting basically all of the attention of the practitioners, while the concept of Time Series forecasting is often neglected.

Data Visualizaton 101: Five Easy Plots to Get to Know Your Data

Thu, 03/05/2020 - 16:00 admin

Here are five different methods of sharing your data analysis with key stakeholders

Author: Paolo Tamagnini (KNIME). As first published in DevPro Journal.

There are many different scenarios when building a data science workflow. No matter how complex the data analysis, every data scientist needs to deal with an important final step: communicating their findings to the different stakeholders — decision-makers, managers, or clients. This final step is vital because if the findings cannot be understood, trusted or valued, then the entire analysis will be discarded and forgotten.

From a Single Decision Tree to a Random Forest

Thu, 02/27/2020 - 10:00 admin

Authors: Kathrin Melcher, Rosaria Silipo (KNIME). As first published in Dataversity.

Decision trees represent a set of very popular supervised classification algorithms. They are very popular for a few reasons: They perform quite well on classification problems, the decisional path is relatively easy to interpret, and the algorithm to build (train) them is fast and simple.

Tuning the Performance and Scalability of KNIME Workflows

Mon, 02/24/2020 - 10:00 admin

Authors: Iris Adä and Phil Winters (KNIME)

Tuning Performance and Scalability of KNIME Workflows

Want a workflow that uses available in-DB capabilities and moves to a production Spark setup? At the same time it should use special Google services before comparing a KNIME Random Forest to an H2O Random Forest and then automatically choose the correct model to create data that are automatically added to your favourite CRM - so that the new score is placed back into the CRM? No problem in KNIME.

Three New Techniques for Data Dimensionality Reduction in Machine Learning

Thu, 02/20/2020 - 10:00 admin

Authors: Maarit Widmann and Rosaria Silipo (KNIME). As first published in The New Stack.

The full big data explosion has convinced us that more is better. While it is of course true that a large amount of training data helps the machine learning model to learn more rules and better generalize to new data, it is also true that an indiscriminate addition of low-quality data and input features might introduce too much noise and, at the same time, considerably slow down the training algorithm.

Subscribe to KNIME Blog: general