09 Sep 2019longoka

Author: Kenneth Longo

The cheminformatics world is replete with software tools and file formats for the design, manipulation and management of small molecules and libraries thereof. Those tools and formats are often specialized in analyzing small molecules of ~500 daltons, give or take a few, or those molecules that can reasonably be drawn and understood using classic ball-and-stick or molecular coordinate frameworks. Perhaps not coincidentally, this neatly envelops the needs of small molecule drug discovery, where it is not uncommon to find both public and privately-held repositories of hundreds of thousands (to millions) of such molecules, for use in molecular or phenotypic screening assays. The small size and elemental simplicity of these molecules has resulted in a variety of storage file formats (e.g., mol, SMILES, sdf, etc) and many supporting software packages (e.g., RDkit, CDK, ChemAxon, etc) for visualization and manipulation that support them. KNIME Analytics Platform provides easy access to those file formats and software packages.

Read more


02 Sep 2019admin

Recently on social media we asked you for tips on tidying up and improving workflows. Our aim was to find out how you declutter to make your workflows not just superficially neater, but faster, more efficient, and smaller: ultimately an elegant masterpiece! Check out the original posts on LinkedIn and Twitter.

Declutter - Four Tips for an Efficient, Fast Workflow
Fig. 1 From confusion to clarity - decluttering your workflow

Read more


29 Jul 2019admin

Using Okta to Modernize LDAP

Author: James Weakley

We’d like to introduce James Weakley, a Data Architect at nib health funds, who recently wrote a short blog post on the topic of KNIME Server and Okta. James has given us permission to republish it here. But first a few words about James.

Read more


22 Jul 2019berthold

The data science dilemma: Automation, APIs, or custom data science?

As companies place an increasing premium on data science, there is some debate about which approach is best to adopt — and there is no straight up, one-size-fits-all answer. It really depends on your organization’s needs and what you hope to accomplish.

There are three main approaches that have been discussed over the past couple of years; it’s worth taking a look at the merits and limitations of each as well as the human element involved. After all, knowing the capabilities of your team and who you’re attempting to serve with data science influences heavily how to implement it.

Read more


15 Jul 2019paolotamag

Authors: Paolo Tamagnini & Christian Dietz

Where To Get Answers to Your Data Science Questions?

When I start a new data science project with KNIME Analytics Platform, there are always a few questions I need to ask myself before I even pull in a single node to my blank workbench.

  • “Can I train this kind of a model in KNIME?”
  • “Which KNIME nodes will I need for this task?”
  • “Has anyone else put together a use case like this with KNIME before?”
  • “Can I download any KNIME workflows as inspiration?” 

To answer all these questions, all I need to do is ask the KNIME Hub. The KNIME Hub has been available at hub.knime.com since March 2019 but many new features have now been added with the release of KNIME Analytics Platform 4.0.

Read more


08 Jul 2019Redfield

In this blog series we’ll be experimenting with the most interesting blends of data and tools. Whether it’s mixing traditional sources with modern data lakes, open-source devops on the cloud with protected internal legacy tools, SQL with noSQL, web-wisdom-of-the-crowd with in-house handwritten notes, or IoT sensor data with idle chatting, we’re curious to find out: will they blend? Want to find out what happens when IBM Watson meets Google News, Hadoop Hive meets Excel, R meets Python, or MS Word meets MongoDB?

Read more


27 Jun 2019michael.berthold
With this release we are continuing our progress toward a community oriented data science platform, adding lots of functionality that enables easier sharing with the KNIME Community. Most noticeably, of course, the KNIME Hub itself but there are also a number of changes in KNIME Analytics Platform making sharing and collaborating with the community easier...

Read more


17 Jun 2019Maarit

Authors: Maarit Widmann (KNIME) and Alfredo Roccato (Data Science Trainer and Consultant)

Wheeling like a hamster in the data science cycle? Don’t know when to stop training your model?

Model evaluation is an important part of a data science project and it’s exactly this part that quantifies how good your model is, how much it has improved from the previous version, how much better it is than your colleague’s model, and how much room for improvement there still is.

In this series of blog posts, we review different scoring metrics: for classification, numeric prediction, unbalanced datasets, and other similar more or less challenging model evaluation problems.

Read more


11 Jun 2019Lukasa

In this blog series we’ll be experimenting with the most interesting blends of data and tools. Whether it’s mixing traditional sources with modern data lakes, open-source devops on the cloud with protected internal legacy tools, SQL with noSQL, web-wisdom-of-the-crowd with in-house handwritten notes, or IoT sensor data with idle chatting, we’re curious to find out: will they blend? Want to find out what happens when IBM Watson meets Google News, Hadoop Hive meets Excel, R meets Python, or MS Word meets MongoDB?

Read more


03 Jun 2019admin

Author: Brendan Doherty, Seagate Technology

In this blog post I will discuss some of the processes and steps that were taken on the journey to embed KNIME in a High Volume Manufacturing Environment within Seagate Technology.

Seagate Technology are one of the world's largest manufacturers of electronic data storage technologies and solutions. Seagate Technology creates products and services that include network attached storage, high performance computing, data protection appliances, internal hard drives, backup and recovery services, flash storage, and related solutions. They are a vertically integrated company and have manufacturing plants based in many locations worldwide. The read/write heads for the HDDs are manufactured in two locations, one of which is in Derry City, Northern Ireland. These devices are highly complex, have a long manufacturing cycle time, and generate a lot of data during their fabrication. The plant has many different groups located at the site all of which use data from a wide variety of sources on a daily basis.

Read more


Subscribe to KNIME news, usage, and development