From daed05ae775538ad2edabe0693da3fb832c721e6 Mon Sep 17 00:00:00 2001 From: msabhi Date: Sun, 4 Dec 2016 15:21:43 -0500 Subject: Update big-data.md --- chapter/8/big-data.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) (limited to 'chapter/8/big-data.md') diff --git a/chapter/8/big-data.md b/chapter/8/big-data.md index bf96553..ba9affe 100644 --- a/chapter/8/big-data.md +++ b/chapter/8/big-data.md @@ -28,9 +28,9 @@ by: "Jingjing and Abhilash" Overview of Pregel. Its implementation and working. its limitations. Do not stress more since we have a better model GraphX to explain a lot. - SparkSQL Catalyst & Spark execution model : Discuss Parser, LogicalPlan, Optimizer, PhysicalPlan, Execution Plan. Why catalyst? how catalyst helps in SparkSQL , data flow from sql-core-> catalyst->spark-core -- Evaluation: Given same algorithm, what is the performance differences between Hadoop, Spark, Dryad and SparkSQL? There are no direct comparison for all those models, so we may want to compare separately: +- Evaluation: Given same algorithm, what is the performance differences between Hadoop, Spark, Dryad? There are no direct comparison for all those models, so we may want to compare separately: - Hadoop vs. Spark - - Spark vs. SparkSQL + - Spark vs. SparkSQL from SparkSQL paper - Big Data Ecosystem Everything interoperates with GFS or HDFS, or makes use of stuff like protocol buffers so systems like Pregel and MapReduce and even MillWheel... -- cgit v1.2.3