October 20th, Q&A session: Get you issues solved and questions answered!

GitHub logo
Edit

Using Pentaho With Apache Ignite

Overview

Pentaho is a comprehensive platform that provides the ability to extract, transform, visualize, and analyze your data easily. Pentaho Data Integration uses the Java Database Connectivity (JDBC) API in order to connect to your database.

Apache Ignite is shipped with its own implementation of the JDBC driver which makes it possible to connect to Ignite from the Pentaho platform and analyze the data stored in a distributed Ignite cluster.

Installation and Configuration

  • Download and Install Pentaho platform. Refer to the official Pentaho documentation.

  • After Pentaho is successfully installed, you will need to install the Apache Ignite JDBC Driver using the JDBC Distribution Tool. To do so, download Apache Ignite and locate {apache-ignite}/libs/ignite-core-2.9.0.jar and copy the file to the {pentaho}/jdbc-distribution directory.

  • Open a command line tool, navigate to the {pentaho}/jdbc-distribution directory and run the following script ./distribute-files.sh ignite-core-2.9.0.jar

Ignite JDBC Driver Setup

The next step is to set up the JDBC driver and connect to the cluster. Below you will find the minimal number of actions that need to be taken. Refer to the JDBC Thin Driver documentation for the more details.

  1. Open your command line tool, go to the {pentaho}/design-tools/data-integration directory and launch Pentaho Data Integration using the ./spoon.sh script.

  2. Once the screen like the one below appears, click on the File menu option and create a new transformation - New → Transformation

    Pentaho New Transformation
  3. You can create a new Database Connection using setting the following parameters in Pentaho:

    Pentaho Property Name Value

    Connection Name

    Set some custom name such as IgniteConnection

    Connection Type

    Select the Generic database option.

    Access

    Select the Native (JDBC) option.

    Custom Connection URL

    jdbc:ignite:thin://localhost:10800 or the real address of a cluster node to connect to.

    Custom Driver Class Name

    org.apache.ignite.IgniteJdbcThinDriver

  4. Click the Test button to check that the connection​ can be established:

    Pentaho Ignite Connection

Data Querying and Analysis

Once the connection between Ignite and Pentaho is established , you can query, transform, and analyze the data in a variety of ways supported by Pentaho. For more details, refer to the official Pentaho documentation.

Pentaho Running Queries