From a9883554b8e4ab00e41dbd8a358f97628f35f392 Mon Sep 17 00:00:00 2001 From: msabhi Date: Sun, 4 Dec 2016 08:51:00 -0500 Subject: Fix diagram --- chapter/8/big-data.md | 4 ++++ 1 file changed, 4 insertions(+) (limited to 'chapter/8/big-data.md') diff --git a/chapter/8/big-data.md b/chapter/8/big-data.md index 1ca16aa..bae6b83 100644 --- a/chapter/8/big-data.md +++ b/chapter/8/big-data.md @@ -97,9 +97,13 @@ The properties that power RDD with the above mentioned features : - A compute function to do a computation on partitions. - Optionally, a Partitioner for key-value RDDs (e.g. to say that the RDD is hash-partitioned) - Optional preferred locations (aka locality info), (e.g. block locations for an HDFS file) + +
MapReduce Execution Overview
+ + Spark API provide two kinds of operations on a RDD: Transformations - lazy operations that return another RDD. `map (f : T => U) : RDD[T] ⇒ RDD[U]` : Return a MappedRDD[U] by applying function f to each element -- cgit v1.2.3