2016-03-10 最后更新时间:
写在前面
本系列是综合了自己在学习spark过程中的理解记录 + 对参考文章中的一些理解 + 个人实践spark过程中的一些心得而来。写这样一个系列仅仅是为了梳理个人学习spark的笔记记录,所以一切以能够理解为主,没有必要的细节就不会记录了,而且文中有时候会出现英文原版文档,只要不影响理解,都不翻译了。若想深入了解,最好阅读参考文章和官方文档。
其次,本系列是基于目前最新的 spark 1.6.0 系列开始的,spark 目前的更新速度很快,记录一下版本号还是必要的。
最后,如果各位觉得内容有误,欢迎留言备注,所有留言 24 小时内必定回复,非常感谢。
Tips: 如果插图看起来不明显,可以:1. 放大网页;2. 新标签中打开图片,查看原图哦。
1. 书籍
2. 网站
3. 文章,博客
4. 视频
- YouTube: what is apache spark
- Top 5 Mistakes When Writing Spark Applications
- Tuning and Debugging Apache Spark
- A Deeper Understanding of Spark Internals - Aaron Davidson (Databricks)
-
slideA Deeper Understanding of Spark Internals - Aaron Davidson (Databricks) - Building, Debugging, and Tuning Spark Machine Learning Pipelines - Joseph Bradley (Databricks)
-
slideBuilding, Debugging, and Tuning Spark Machine Learning Pipelines - Spark DataFrames Simple and Fast Analysis of Structured Data - Michael Armbrust (Databricks)
-
slideSpark DataFrames Simple and Fast Analysis of Structured Data - Michael Armbrust (Databricks) - Spark Tuning for Enterprise System Administrators
- Structuring Spark: DataFrames, Datasets, and Streaming
-
slideStructuring Spark: DataFrames, Datasets, and Streaming - Spark in Production: Lessons from 100+ Production Users
-
slideSpark in Production: Lessons from 100+ Production Users - Production Spark and Tachyon use Cases
- SparkUI Visualization
- Everyday I’m Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
-
slideEveryday I’m Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015 - Securing your Spark Applications
- Building a REST Job Server for Interactive Spark as a Service
-
slideBuilding a REST Job Server for Interactive Spark as a Service - Exploiting GPUs for Columnar DataFrame Operations
- Easy JSON Data Manipulation in Spark - Yin Huai (Databricks)
-
slideEasy JSON Data Manipulation in Spark - Yin Huai (Databricks) - Sparkling: Speculative Partition of Data for Spark Applications - Peilong Li
-
slideSparkling: Speculative Partition of Data for Spark Applications - Peilong Li - Advanced Spark Internals and Tuning – Reynold Xin
- The Future of Real Time in Spark
- Spark 2 0
- Democratizing Access to Data
- Not Your Father’s Database: How to Use Apache Spark Properly in Your Big Data Architecture
slideNot Your Father’s Database: How to Use Apache Spark Properly in Your Big Data Architecture
5. next
上面的资源我都会不断更新的,里面 80% 以上的都是我亲自看过并且觉得有价值的,可不是胡乱收集一通的,推荐欣赏哦。