Course Syllabus

Overview

Students should watch Udacity course videos according to the following schedule. It is recommended for students to do lab sessions on the schedule by yourself as early as possible since some of homework may cover the lab materials scheduled later than the homework.

Schedule

# Week of Lesson Lab Deliverable Due
1 8/22/2016 1. Intro to Big Data Analytics & 2. Course Overview
2 8/29/2016 3. Predictive Modeling 1. Hadoop & HDFS Basics
3 9/5/2016 4. MapReduce 2. Hadoop Streaming & HBase HW 1 (Sep 11)
4 9/12/2016 5. Classification evaluation metrics & 6. Classification ensemble methods 3. Hadoop Pig & Hive
5 9/19/2016 7. Phenotyping & 8. Clustering 4. Scala Basic HW 2 (Sep 25)
6 9/26/2016 9. Spark 5. Spark Basic
7 10/3/2016 10. Medical ontology 6. Spark SQL HW 3 (Oct 9)
8 10/10/2016 11. Graph analysis 7. Spark MLlib Project Proposal (Oct 23)
9 10/17/2016 12. Dimensionality Reduction 8. Spark GraphX
10 10/24/2016 13. Patient similairty HW 4 (Oct 31)
11 10/31/2016 Guest Lecture Peer review bidding (Nov 6)
12 11/7/2016 Guest Lecture
13 11/14/2016 Guest Lecture Paper Draft (Nov 20)
14 11/21/2016 Guest Lecture Peer Review for Draft (Nov 27) 
15 11/28/2016 Guest Lecture Final Project (code+presentation+final paper) (Dec 4)
16 12/5/2016 Conclusions Peer Review for Final Project (Dec 11)
17 12/12/2016 Final Exam Week

Past Guest Lectures

See RESOURCE section.