Sections
Text Area
Big Data Computing with Spark
Left Column
Text Area
Course Description
Big data systems such as Hadoop and Spark emerge as enabling technologies in managing massive amounts of data across hundreds or even thousands of computing nodes. Meanwhile, these new systems have very different programming models and require different algorithm design techniques. This course covers both the theory and hands-on experiences of these big data systems, with ample examples to help understand the basic concepts.
References
- Learning Spark: Lightning-Fast Big Data Analysis, O'Reilly Media
- 7 Steps for a Developer to Learn Apache® Spark
- https://spark.apache.org/docs/latest/index.html
Syllabus
- Spark internals
- SparkSQL
- Algorithm design for big data systems
Right Column
Image
Image
Text Area
Prerequisite
- Basic Python programming
- Basic knowledge about SQL