Skip to content

Install and configure Apache Sqoop

  1. Download, unpack and move
  2. Setting up the environment variable
  3. Configure Sqoop with Hadoop
  4. Get JDBC required for your database
  5. Verifying Sqoop

Apache Sqoop(TM) is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. Unfortunately, this project has retired and was moved into the Attic in 2021-06.

Last update on Apache Sqoop page was in 2019-01-18.


Download, unpack and move


Setting up the environment variable

Run text editor

and paste

Activate the environment variables with the following command:


Configure Sqoop with Hadoop

Move to Sqoop config directory and copy the template file using the following command:

Open sqoop-env.sh:

and edit the following lines:


Get JDBC required for your database

Download JDBC required for your database - in my case it is PostgreSQL driver, and move jar file to a correct location:


Verifying Sqoop