Sqoop is
a tool which can transfer bulk data from a relational database to Hadoop and
vise-versa. For better performance and optimal system utilization it does
parallel data transfer and load balancing among the nodes. It can read/write data
from/to Oracle, Teradata, Netezza, MySQL, Postgres, and HSQLDB. While importing
the data to hdfs it can save the data in different format e.g. ORC, Avro,
Parquet etc.
Subscribe to:
Posts (Atom)
Popular Posts
-
Encoding is used to translate the numeric values into a readable character it provides the information that your computer needs to display...
-
Loading a csv file and capturing all the bad records is a very common task in ETL projects. The bad records are analyzed to take correctiv...
-
Big Data Analytics had become a buzzword today. Be it Insurance, Banking, Ecommerce or anything everyone is inclined towards learning or i...
-
There are various ways of submitting an application in spark. In Addition to client and cluster modes of execution there is also a ...
-
RDD (Resilient Distributed Dataset) is the fundamental data structure of Apache Spark. This post will cover What is RDD, Why Spark needs i...
-
Content has been moved to :- https://www.technologyintrend.com/2019/03/scala-or-python-for-apache-spark.html Sorry for inconveni...
-
Content has been moved to :- https://www.technologyintrend.com/2019/03/platform-to-practice-Big-Data-Apache-Spark.html Sorry for inconv...
-
Post has been moved to https://www.technologyintrend.com/2019/07/hadoop-vs-rdbms.html Sorry for inconvenience
-
Sqoop is a tool which can transfer bulk data from a relational database to Hadoop and vise-versa. For better performance and optimal sy...
-
Post has been moved to - https://www.technologyintrend.com/2019/07/basic-programming-guide-to-begin-with-apache-spark.html Sorry...