Hive Query Optimisation

Chloooo
2 min readSep 6, 2020

Hive query small tips

Photo by Vivek Doshi on Unsplash

This article only talks about the optimisation on DML layer with an explanation on Hadoop MapReduce mechanisms.

Suppose we have two tables: fact_order, dim_customer. fact_order keeps all customer order transaction history, dim_customer is the main dimension table for customer attributes.

--

--