Apache Tez is an extensible framework for building high performance batch and interactive data processing applications, coordinated by YARN in Apache Hadoop. Tez improves the MapReduce paradigm by dramatically improving its speed, while maintaining MapReduce’s ability to scale to petabytes of data. Important Hadoop ecosystem projects like Apache Hive and Apache Pig use Apache Tez, as do a growing number of third party data access applications developed for the broader Hadoop ecosystem.
- hortonworks.com (http://hortonworks.com/apache/tez/)
Apache Tez is designed for Yarn on top of Hadoop 2. Tez is designed for high performance, functioning at low latency mainly for processing. Tez is developed with extensibility in mind and allows ...