KNIME Integrations

Integrate Big Data, Machine Learning, AI, Scripting, and more.
KNIME Integrations
KNIME Integrations

Open source integrations for KNIME Analytics Platform (also developed and maintained by KNIME), provide seamless access to large open source projects such as Keras for deep learning, H2O for high performance machine learning, Apache Spark for big data processing, Python and R for scripting, and more.

Big Data
Big Data

Conduct predictive analytics and scoring on Apache Spark using PMML models and integrate complex statistics and machine learning with SparkML or H2O Sparkling Water. Visual programming allows code-free, big data science, while scripting of jobs allows detailed control when needed.

Import, export, and access data with Hive, Impala, H2, HDFS, or KNIME Analytics Platform.

Mix and match local and Hadoop workflow executions within the same workflow.

Add PySpark jobs to your existing Spark workflow. Our Python editor allows code validation directly within the cluster for rapid prototyping.


R and Python Scripting
R and Python Scripting

Add custom functionality ​with native R, Python (versions 2 and 3), and Java scripting capabilities - from custom Apache Spark jobs, to visualizations or advanced analytics, and machine learning.

Run scripts seamlessly in combination with other KNIME nodes within a single workflow. Document individual steps, allowing for large scale deployment.

Import and run code from Jupyter notebooks - your code can stay in Jupyter but still be used from within your KNIME workflows. 


H2O Machine Learning
H2O Machine Learning

Take advantage of H2O machine learning and choose from a variety of high performance algorithms (Gradient Boosted Trees, Generalized Linear Models, Random Forest etc).

Train and validate ​models in H2O using data partitioners, cross validation, binomial, and multinomial scoring.

Scale execution with H2O Sparkling Water. Seamlessly combine H2O nodes with KNIME Extension for Apache Spark.

Integrate with existing KNIME nodes for data prep and cleansing, visualization, or hyperparameter optimization, combining them directly with H2O functionality.

Deep Learning
Deep Learning

Load, create, edit, train, and execute deep neural networks within KNIME Analytics Platform.

Access a variety of cutting edge deep learning frameworks, such as TensorFlow or CNTK via Keras, or Tensorflow directly.

Create and train deep network architectures without writing a single line of code using the KNIME Keras Integration.

Fine tune trained networks to your analysis problem. A rich variety of unstructured (text, images, etc) and structured data types can directly be used for training and prediction.

Google Sheets
Google Drive Connectivity

Read from and write to both “My Drive” and “Team Drive” and use files you’ve stored in Google Drive. Access data from a Google Sheet, write information to new sheets, or modify existing sheets.

Carry out various tasks ​such as reading or adding headers, substituting missing values, and automatically opening Google Sheets.

Log in directly from the node configuration​ or provide credential files (if preferred).

And More...
And more...

Create custom JavaScript visualizations​, utilizing state of the art visualization libraries, for example D3.

Connect to Azure or AWS​ and work with your cloud data, stored in S3 or Azure Blobstore.

Search for Tweets on Twitter​, retrieve information about users, Tweet directly via KNIME, and more.

Visualize geo-spatial information ​with open street maps.

Integrate XGBoost’s Linear Ensemble or Tree Ensemble learners for either classification or regression in your KNIME workflows.

Get started with KNIME
Download KNIME Analytics Platform and get started on your first workflow.
Installing Extensions
Learn how to install extensions in KNIME Analytics Platform.
Join the KNIME community
Create a user profile and start a conversation with our active, global community.