diff options
| author | msabhi <abhi.is2006@gmail.com> | 2016-12-04 15:21:43 -0500 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2016-12-04 15:21:43 -0500 |
| commit | daed05ae775538ad2edabe0693da3fb832c721e6 (patch) | |
| tree | 14497fb36d137477675d33aa3b6aa50d530a7d6b | |
| parent | 7f8bf61a2cfbf36f5b3e55043244d963ebc92bec (diff) | |
Update big-data.md
| -rw-r--r-- | chapter/8/big-data.md | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/chapter/8/big-data.md b/chapter/8/big-data.md index bf96553..ba9affe 100644 --- a/chapter/8/big-data.md +++ b/chapter/8/big-data.md @@ -28,9 +28,9 @@ by: "Jingjing and Abhilash" Overview of Pregel. Its implementation and working. its limitations. Do not stress more since we have a better model GraphX to explain a lot. - SparkSQL Catalyst & Spark execution model : Discuss Parser, LogicalPlan, Optimizer, PhysicalPlan, Execution Plan. Why catalyst? how catalyst helps in SparkSQL , data flow from sql-core-> catalyst->spark-core -- Evaluation: Given same algorithm, what is the performance differences between Hadoop, Spark, Dryad and SparkSQL? There are no direct comparison for all those models, so we may want to compare separately: +- Evaluation: Given same algorithm, what is the performance differences between Hadoop, Spark, Dryad? There are no direct comparison for all those models, so we may want to compare separately: - Hadoop vs. Spark - - Spark vs. SparkSQL + - Spark vs. SparkSQL from SparkSQL paper - Big Data Ecosystem Everything interoperates with GFS or HDFS, or makes use of stuff like protocol buffers so systems like Pregel and MapReduce and even MillWheel... |
