Install rJava and RJDBC libraries

Learn how to install rJava and RJDBC libraries on your Databricks cluster.

Written by Adam Pavlacka

Last published at: December 22nd, 2022

This article explains how to install rJava and RJBDC libraries.

Problem

When you install rJava and RJDBC libraries with the following command in a notebook cell:

%r

install.packages(c("rJava", "RJDBC"))

You observe the following error:

ERROR: configuration failed for package 'rJava'

Cause

The rJava and RJDBC packages check for Java dependencies and file paths that are not present in the Databricks R directory.

Solution

Follow the steps below to install these libraries on running clusters.

  1. Run following commands in a %shcell.
    %sh
    
    ls -l /usr/bin/java
    ls -l /etc/alternatives/java
    ln -s /usr/lib/jvm/java-8-openjdk-amd64 /usr/lib/jvm/default-java
    R CMD javareconf
  2. Install the rJava and RJDBC packages.
    %r
    
    install.packages(c("rJava", "RJDBC"))
  3. Verify that the rJava package is installed.
    %r
    
    dyn.load('/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server/libjvm.so')
    library(rJava)