This workflow is part of a number of other workflows that address a data mining scenario at the intersection of active learning, text mining, stream mining and service-oriented knowledge discovery architectures.
This workflow, in particular, allows to create a subset of the training set based on the most uncertain predicted classes.
It first read the entire training set. Then, it processes the questions and it predicts the class for each one of those. The loop body allows to compute the differences between the three top probabilities for each predicted class of each question. Finally, a subset of the entire training set is created based on the most uncertain predicted class and saved as new table.
EXAMPLES Server: 50_Applications/33_Emil_the_TeacherBot/03_AL_Training_Subset_Uncertain_Classes50_Applications/33_Emil_the_TeacherBot/03_AL_Training_Subset_Uncertain_Classes*
Download a zip-archive
* Find more about the Examples Server here.
The link will open the workflow directly in KNIME Analytics Platform (requirements: Windows; KNIME Analytics Platform must be installed with the Installer version 3.2.0 or higher). In other cases, please use the link to a zip-archive or open the provided path manually