Using Pentaho With Apache Ignite
Overview
Pentaho is a comprehensive platform that provides the ability to extract, transform, visualize, and analyze your data easily. Pentaho Data Integration uses the Java Database Connectivity (JDBC) API in order to connect to your database.
Apache Ignite is shipped with its own implementation of the JDBC driver which makes it possible to connect to Ignite from the Pentaho platform and analyze the data stored in a distributed Ignite cluster.
Installation and Configuration
-
Download and Install Pentaho platform. Refer to the official Pentaho documentation.
-
After Pentaho is successfully installed, you will need to install the Apache Ignite JDBC Driver using the JDBC Distribution Tool. To do so, download Apache Ignite and locate
{apache-ignite}/libs/ignite-core-2.16.0.jar
and copy the file to the{pentaho}/jdbc-distribution
directory. -
Open a command line tool, navigate to the
{pentaho}/jdbc-distribution
directory and run the following script./distribute-files.sh ignite-core-2.16.0.jar
Ignite JDBC Driver Setup
The next step is to set up the JDBC driver and connect to the cluster. Below you will find the minimal number of actions that need to be taken. Refer to the JDBC Thin Driver documentation for the more details.
-
Open your command line tool, go to the
{pentaho}/design-tools/data-integration
directory and launch Pentaho Data Integration using the./spoon.sh
script. -
Once the screen like the one below appears, click on the
File
menu option and create a new transformation -New → Transformation
-
You can create a new Database Connection using setting the following parameters in Pentaho:
Pentaho Property Name Value Connection Name
Set some custom name such as
IgniteConnection
Connection Type
Select the
Generic database
option.Access
Select the
Native (JDBC)
option.Custom Connection URL
jdbc:ignite:thin://localhost:10800
or the real address of a cluster node to connect to.Custom Driver Class Name
org.apache.ignite.IgniteJdbcThinDriver
-
Click the
Test
button to check that the connection can be established:
Data Querying and Analysis
Once the connection between Ignite and Pentaho is established , you can query, transform, and analyze the data in a variety of ways supported by Pentaho. For more details, refer to the official Pentaho documentation.
Apache, Apache Ignite, the Apache feather and the Apache Ignite logo are either registered trademarks or trademarks of The Apache Software Foundation.