update diagram

author: msabhi <abhi.is2006@gmail.com> 2016-12-04 08:50:30 -0500
committer: GitHub <noreply@github.com> 2016-12-04 08:50:30 -0500
commit: b92cacd9c46dd9da407eacad33a6fdb9acbf2ff2 (patch)
tree: 57ed782c1f149bedcb62956bfb8ed495db07faf7 /chapter/8
parent: 07499f7dac53ad9fcb92594a0069f23e9740e669 (diff)
1 files changed, 3 insertions, 1 deletions
diff --git a/chapter/8/big-data.md b/chapter/8/big-data.md
index cc11e28..1ca16aa 100644
--- a/chapter/8/big-data.md
+++ b/chapter/8/big-data.md
@@ -97,7 +97,9 @@ The properties that power RDD with the above mentioned features :
 - A compute function to do a computation on partitions.
 - Optionally, a Partitioner for key-value RDDs (e.g. to say that the RDD is hash-partitioned)
 - Optional preferred locations (aka locality info), (e.g. block locations for an HDFS file)
-
+<figure class="main-container">
+  <img src="./spark_pipeline.png" alt="MapReduce Execution Overview" />
+</figure>
 Spark API provide two kinds of operations on a RDD:
 Transformations - lazy operations that return another RDD.
 `map (f : T => U) : RDD[T] ⇒ RDD[U]` : Return a MappedRDD[U] by applying function f to each element
author	msabhi <abhi.is2006@gmail.com>	2016-12-04 08:50:30 -0500
committer	GitHub <noreply@github.com>	2016-12-04 08:50:30 -0500
commit	b92cacd9c46dd9da407eacad33a6fdb9acbf2ff2 (patch)
tree	57ed782c1f149bedcb62956bfb8ed495db07faf7 /chapter/8
parent	07499f7dac53ad9fcb92594a0069f23e9740e669 (diff)