Course Syllabus

Overview

Students should watch Udacity course videos according to the following schedule. It is recommended for students to do lab sessions on the schedule by yourself as early as possible since some of homework may cover the lab materials scheduled later than the homework.

Schedule

Week # 2017 Dates Lesson Lab Deliverable Due
1 Aug 21 to 27 1. Intro to Big Data Analytics
2. Course Overview
Scala Basic
2 Aug 28 to Sep 3 3. Predictive Modeling Hadoop & HDFS Basics

3 Sep 4 to 10 4.MapReduce

& HBase

HW 1 (Sep 4)
4 Sep 11 to 17 5.Classification evaluation metrics
6.Classification ensemble methods
Hadoop Pig & Hive

5 Sep 18 to 24 7. Phenotyping & 8. Clustering NLP Lab

HW 2 (Sep 24)
6 Sep 25 to Oct 1 9. Spark

Spark Basic, Spark SQL

Project group formation (Oct 1)
7 Oct 2 to 8 10. Medical ontology

HW 3 (Oct 8)
8 Oct 9 to 15 11. Graph analysis

Spark Application

Project Proposal (Oct 15)
9 Oct 16 to 22 12. Dimensionality Reduction Spark MLlib & Spark GraphX
10 Oct 23 to 29 13. Patient similairty

HW 4 & Peer review bidding (Oct 29)
11 Oct 30 to Nov 5 Spring break
12 Nov 6 to 12 Potential guest Lecture
13 Nov 13 to 19 Potential guest Lecture Paper Project draft (Nov 12)
14 Nov 20 to 26 Potential guest Lecture Peer Review for Draft (Nov 19) 
15 Nov 27 to Dec 3 Potential guest Lecture
16 Dec 4 to 10 working on projects Final Project (code+presentation+final paper) (Dec 10)
17 Dec 11 to 17 Final Exam Week

Previous Guest Lectures

See RESOURCE section.