Erick Ramirez avatar image
Erick Ramirez posted Erick Ramirez edited

HOW TO - Connect to Astra DB from Pentaho Data Integration


This article provides the steps for connecting to Astra DB from Pentaho Data Integration (PDI also known as "Spoon", formerly KETTLE).


This article assumes you have installed Pentaho Data Integration on your laptop or PC. It was written for version 9.1 on MacOS but it should also work for the Windows version.

You will also need to generate an application token and download the secure bundle for your Astra DB.



Download the JDBC driver from the DataStax website:

  1. Go to
  2. Select Simba JDBC Driver for Apache Cassandra.
  3. Select JDBC 4.2.
  4. Read the license terms and accept it (click the checkbox).
  5. Hit the blue Download button.
  6. Once the download completes, unzip the downloaded file.


Deploy the Simba driver to Pentaho servers using the distribution tool:

  1. On your laptop or PC, copy the Simba JAR to the JDBC distribution directory:
    $ cp CassandraJDBC42.jar pentaho/jdbc-distribution/
  2. Run the distribution tool (distribute-files.baton Windows)
    $ cd /Applications/Pentaho/jdbc-distribution
    $ ./ CassandraJDBC42.jar
  3. Verify that the JAR has been copied to the PDI library:
    $ cd /Applications/Pentaho
    $ ls -lh design-tools/data-integration/lib/CassandraJDBC42.jar
    -rw-r--r--  1 erick  vaxxed   16M 14 Sep 22:18 design-tools/data-integration/lib/CassandraJDBC42.jar
    $ file design-tools/data-integration/lib/CassandraJDBC42.jar
    design-tools/data-integration/lib/CassandraJDBC42.jar: Java archive data (JAR)
  4. Restart Pentaho on your workstation for the Simba driver to be loaded.


Connect to your Astra DB in PDI:

  1. Create a new Transformation.
  2. Open a new Database Connection dialog box.
  3. In the Connection name field, give your DB connection a name.
  4. Under Connection type, select Generic database.
  5. Set the Custom connection URLto:
    Note that you will need to specify the full path to your secure bundle.
  6. Set the Custom driver class name field to com.simba.cassandra.jdbc42.Driver.
  7. In the Username field, enter the string token.
  8. In the Password field, paste the value of the token you created in the Prerequisites section above. The token looks like AstraCS:AbC...XYz:123...edf0.


  9. Click on the Test Connection button to confirm that the driver configuration is working:


  10. Click on the OK button to save the connection settings.

Final test

Connect to your Astra DB by launching the SQL Editor in Pentaho and run a simple CQL statement. For example:


Here's an example output:


You should also be able to browse the keyspaces in your Astra DB using the DataBase Explorer. Here's an example output:


astra dbastra-db-clientspentaho
1 comment
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

Amar@Itedu avatar image Amar@Itedu commented ·

Very nicely explained.

0 Likes 0 ·



Erick Ramirez contributed to this article