Hive query small tips
This article only talks about the optimisation on DML layer with an explanation on Hadoop MapReduce mechanisms.
Suppose we have two tables: fact_order, dim_customer. fact_order keeps all customer order transaction history, dim_customer is the main dimension table for customer attributes.