Disease Tagging in Biomedical Literature

Reduce time spent sifting through medical literature with automatic disease tagging.

knime_icons_rz View workflow on KNIME Hub

The Challenge

Biomedical literature is a hive of valuable information on research topics like diseases, drug/treatment attributes, medical decisions, health effects, population data and epidemiology, and more. With advances in technology, there is a rapid growth in the amount of this literature - making it impossible for researchers and practitioners alone to exhaust all of this valuable information.

knime_icons_rz Our Solution

With KNIME Software, mining knowledge from text such as disease-related information can be automated. An analytics expert creates a workflow in KNIME Analytics Platform, which contains a model that learns disease names from a set of documents in the biomedical literature. The trained model is then deployed to the KNIME WebPortal via KNIME Server. Here, with the predetermined interaction points, researchers can interactively inspect the diseases that co-occur in the same documents and explore genetic information associated with these diseases.

Why KNIME Software

StanfordNLP nodes in the KNIME Textprocessing Extension (within KNIME Analytics Platform) facilitate building and evaluating the model. This extension also offers nodes for analyzing the results - for example Term Co-Occurrence Counter to investigate co-occurring diseases. Networking Mining nodes make it possible to visualize and analyze results and KNIME Server makes these interactive results accessible to researchers and domain experts.

Explore KNIME

knime_icons_rz Download

This Innovation Note is available for sharing as a PDF.

Download now

knime_icons_rz KNIME for Life Sciences

Access, transform, and interact with large amounts of life science data.

Learn more

Contact us

For information on KNIME Software and what it can do for you.

Contact us