Big Data Computing with Spark
Big data systems such as Hadoop and Spark emerge as enabling technologies in managing massive amounts of data across hundreds or even thousands of computing nodes. Meanwhile, these new systems have very different programming models and require different algorithm design techniques. This course covers both the theory and hands-on experiences of these big data systems, with ample examples to help understand the basic concepts.
- Learning Spark: Lightning-Fast Big Data Analysis, O'Reilly Media
- 7 Steps for a Developer to Learn Apache® Spark
- Spark internals
- Algorithm design for big data systems
- Basic Python programming
- Basic knowledge about SQL