From 6e768a1fcbfe8c800527462d76ee49d62a64ef3c Mon Sep 17 00:00:00 2001 From: msabhi Date: Fri, 16 Dec 2016 14:33:11 -0500 Subject: Fixing Hive transformation flow --- chapter/8/big-data.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) (limited to 'chapter/8') diff --git a/chapter/8/big-data.md b/chapter/8/big-data.md index 018894d..2590095 100644 --- a/chapter/8/big-data.md +++ b/chapter/8/big-data.md @@ -615,9 +615,9 @@ The Hive execution model {% cite thusoo2010hive --file big-data%} composes of th - Optimization: Optimization forms the core of any declarative interface. In the case of Hive, optimization happens through chains of transformation of DAG. A transformation could include even a user defined optimization and it applies an action on the DAG only if a rule is satisfied. Every node in the DAG implements a special interface called as Node interface which makes it easy for the manipulation of the operator DAG using other interfaces like GraphWalker, Dispatcher, Rule, and Processor. Hence, by transformation, we mean walking through a DAG and for every Node we encounter we perform a Rule satisfiability check. If a Rule is satisfied, a corresponding processor is invoked. A Dispatcher maintains a list of Rule to Processor mappings. -
- Hive transformation -
+
+ Hive transformation +
*Figure to depict the transformation flow during optimization, from:* {%cite thusoo2010hive --file big-data %} -- cgit v1.2.3