Modularized Spark Scripting

This workflow demonstrates the usage of the different Spark Java Snippet nodes to read a text file from HDFS, parse it, filter it and write the result back to HDFS.
You might also want to have a look at the provided snippet templates that each of the node provides. In order to do so simply open the configuration dialog of a Spark Java Snippet node and go to the Templates tab.

Modularized Spark Scripting

 

Resources

EXAMPLES Server: 10_Big_Data/02_Spark_Executor/06_Modularized_Spark_Scripting10_Big_Data/02_Spark_Executor/06_Modularized_Spark_Scripting*
Download a zip-archive

 

 


* Find more about the Examples Server here.
The link will open the workflow directly in KNIME Analytics Platform (requirements: Windows; KNIME Analytics Platform must be installed with the Installer version 3.2.0 or higher). In other cases, please use the link to a zip-archive or open the provided path manually