O'Reilly logo

Apache Hive Essentials by Dayong Du

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Summary

In this chapter, we first covered how to identify performance bottlenecks using the EXPLAIN and ANALYZE statements. Then, we spoke about the design optimization for performance when using tables, partition, and index. We also covered the data file optimization including file format, compression, and storage. At the end of this chapter, we discussed job and query optimization in Hive. After going through this chapter, we should be able to do performance troubleshooting and tuning in Hive.

In the next chapter, we'll talk about function extensions for Hive.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required