Sections
Text Area

Big Data Computing with Spark

Left Column
Text Area

Course Description

Big data systems such as Hadoop and Spark emerge as enabling technologies in managing massive amounts of data across hundreds or even thousands of computing nodes. Meanwhile, these new systems have very different programming models and require different algorithm design techniques. This course covers both the theory and hands-on experiences of these big data systems, with ample examples to help understand the basic concepts.

References

  1. Learning Spark: Lightning-Fast Big Data Analysis, O'Reilly Media
  2. 7 Steps for a Developer to Learn Apache® Spark
  3. https://spark.apache.org/docs/latest/index.html

 

Syllabus

  1. Spark internals
  2. SparkSQL
  3. Algorithm design for big data systems
Right Column
Image
Image
Program - Big Data Computing with Spark
Text Area
Prerequisite
  • Basic Python programming
  • Basic knowledge about SQL