aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authormsabhi <abhi.is2006@gmail.com>2016-12-04 15:21:43 -0500
committerGitHub <noreply@github.com>2016-12-04 15:21:43 -0500
commitdaed05ae775538ad2edabe0693da3fb832c721e6 (patch)
tree14497fb36d137477675d33aa3b6aa50d530a7d6b
parent7f8bf61a2cfbf36f5b3e55043244d963ebc92bec (diff)
Update big-data.md
-rw-r--r--chapter/8/big-data.md4
1 files changed, 2 insertions, 2 deletions
diff --git a/chapter/8/big-data.md b/chapter/8/big-data.md
index bf96553..ba9affe 100644
--- a/chapter/8/big-data.md
+++ b/chapter/8/big-data.md
@@ -28,9 +28,9 @@ by: "Jingjing and Abhilash"
Overview of Pregel. Its implementation and working. its limitations. Do not stress more since we have a better model GraphX to explain a lot.
- SparkSQL Catalyst & Spark execution model : Discuss Parser, LogicalPlan, Optimizer, PhysicalPlan, Execution Plan. Why catalyst? how catalyst helps in SparkSQL , data flow from sql-core-> catalyst->spark-core
-- Evaluation: Given same algorithm, what is the performance differences between Hadoop, Spark, Dryad and SparkSQL? There are no direct comparison for all those models, so we may want to compare separately:
+- Evaluation: Given same algorithm, what is the performance differences between Hadoop, Spark, Dryad? There are no direct comparison for all those models, so we may want to compare separately:
- Hadoop vs. Spark
- - Spark vs. SparkSQL
+ - Spark vs. SparkSQL from SparkSQL paper
- Big Data Ecosystem
Everything interoperates with GFS or HDFS, or makes use of stuff like protocol buffers so systems like Pregel and MapReduce and even MillWheel...