Fun with Tags

Automated access to disease information is an important goal of information extraction and text mining efforts. Here, we want to create a model that learns disease names in a set of documents from biomedical literature. We will automatically extract literature from PubMed and use these documents to train our model on an initial set of disease names (the dictionary). We score the resulting model and check if we can extract new information by comparing the detected disease names to our initial set. Subsequently, we interactively inspect the diseases that co-ooccur in the same documents by a network approach and look into genetic information associated with these diseases.

Fun with Tags

 

Resources

EXAMPLES Server: 08_Other_Analytics_Types/02_Chemistry_and_Life_Sciences/03_Fun_with_Tags08_Other_Analytics_Types/02_Chemistry_and_Life_Sciences/03_Fun_with_Tags*
Download a zip-archive

Blog:

 

 


* Find more about the Examples Server here.
The link will open the workflow directly in KNIME Analytics Platform (requirements: Windows; KNIME Analytics Platform must be installed with the Installer version 3.2.0 or higher). In other cases, please use the link to a zip-archive or open the provided path manually