May 2017
Beginner to intermediate
596 pages
15h 2m
English
Adding the layer between the client and Hadoop is the reason for a lot of the ease which Sqoop 2 brings as compared to Sqoop 1. The following table compares this design though between the two versions.
| Sqoop 1 | Sqoop 2 |
| Command line is the only client option | Command line along with browser interface (via Hue) are the client options |
| Client only architecture | Client-server architecture |
| Client works only on the same machine where Sqoop is installed | Server setup allows access to Sqoop from different machines |
| Tight coupling between other tools (Apache Oozie) when integration is brought about. | Integration is quite easy using the exposed REST API’s |
| Connectors and drivers need to be configured for each client installation separately. ... |