Introduction to Apache Sqoop


Sqoop is a tool which can transfer bulk data from a relational database to Hadoop and vise-versa. For better performance and optimal system utilization it does parallel data transfer and load balancing among the nodes. It can read/write data from/to Oracle, Teradata, Netezza, MySQL, Postgres, and HSQLDB. While importing the data to hdfs it can save the data in different format e.g. ORC, Avro, Parquet etc.